Privacy-preserving record linkage for Tufts CTSI and collaborators
- Ensure that you have Python installed.
- Install uv, a Python package manager.
- Clone this PPRL repository:
git clone https://github.com/TuftsCTSI/pprl - Open the newly created folder:
cd pprl - Install dependencies with
uv sync. - Run
source .venv/bin/activate. - Verify your setup by running the test suite:
pprl test - Close or restart the terminal (or proceed to the Usage section).
- Add all configuration files to the
my_filessubdirectory. - Depending on your role, you might have generated an input file. This should also be placed in
my_files. - Run
source .venv/bin/activate, if you haven't already. - From the main
pprldirectory, run the appropriatepprlcommand.
- If you're generating hashes from patient data, run
pprl create. - If you're linking hashes in order to determine duplicates, run
pprl match. - Run
pprlto see a full list of options.
- Transmit output files according to the study protocol.
- Delete any sensitive files once they are no longer required.
- Close or restart the terminal.