GitHub - StanfordHPDS/data_transformation: Repository for the paper "A Microsimulation-Based Framework for Mitigating Societal Bias in Primary Care Data"

Introduction

This repository contains code for the paper "A microsimulation-based framework for mitigating societal bias in primary care data" by Agata Foryciarz, Fernando Alarid-Escudero, Gabriela Basel, Marika Cusick, Robert L. Phillips, Andrew Bazemore, Alyce S. Adams, and Sherri Rose (2025), medRxiv:10.1101/2025.04.03.25325206.

The code generates eGFR trajectories across a person's lifetime, conditional on comorbidities, age, CKD stage, and sex.

Environment setup

Using this repo requires a local anaconda installation.

git clone https://github.com/StanfordHPDS/data_transformation
cd data_transformation
conda update -n base -c defaults conda

# set up and activate a conda environment
conda env create --name data_transformation -f environment.yml

conda activate data_transformation

# install this repo as a package
pip install -e .

Cohort extraction

All SQL scripts necessary for extracting the cohort from an OMOP BigQuery dataset are provided under egfr_microsim/cohort. Running python egfr_microsim/cohort/run_all.py generates all calibration targets and cohort summary statistics necessary for running the model and reported in the manuscript. Cell counts below 11 are suppressed and converted to 11, so all summary data saved to CSV files by run_all.py can be exported to a non-secure environment.

Results of cohort extraction were included under

cd data_transformation

egfr_microsim/cohort/calibration_targets
egfr_microsim/cohort/data

Experiment

The entire experiment can be reproduced by running

egfr_microsim/job_scripts/full_100k.sh

Given the size of the experiment, it is best ran in a distributed way:

egfr_microsim/job_scripts/full_100k/run_experiment.sh
egfr_microsim/job_scripts/full_100k/run_analyze.sh
egfr_microsim/job_scripts/full_100k/gen_figures.sh

Experiment results will be saved in

egfr_microsim/exp_results/full_100k

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
egfr_microsim		egfr_microsim
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
environment.yml		environment.yml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Environment setup

Cohort extraction

Experiment

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

StanfordHPDS/data_transformation

Folders and files

Latest commit

History

Repository files navigation

Introduction

Environment setup

Cohort extraction

Experiment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages