INSTRUCTION

🚀 Project Workflow

1. Access the MIMIC-III Dataset

Follow the official instructions to obtain access to the MIMIC-III v1.4 clinical dataset from PhysioNet:

🔗 MIMIC-III Access Instructions

2. Extract Sepsis Treatment Dataset

Clone and follow the instructions from the official GitHub repository:

🔗 Microsoft/mimic_sepsis Repository

Use the provided SQL scripts and Python code to extract intermediate tables for the sepsis cohort.

3. Generate Continuous Treatment Variables

Run the following script to derive continuous treatment actions (e.g., IV fluid and vasopressor dosage):

python sepsis_cohort_continous.py

This forms the action space for RL.

4. Feature Selection and Preprocessing

Open and execute:

feat_selection.ipynb

This notebook selects relevant features and produces the final dataset for model input.

5. Split Dataset

To divide the data into training, validation, and test sets, run:

data_split.ipynb

6. Learn State Transition Model

To build the state transition model using k-Nearest Neighbors (kNN):

transition_model.ipynb

This is used for model-based RL algorithms.

7. Build OOD Guardian

Run the following script to compute the Out-of-Distribution (OOD) guardian based on a Gaussian Kernel method:

python guardian.py

This protects the RL policy from unsafe decisions on out-of-distribution states.

🧠 Train Reinforcement Learning Models

To train different RL models, run the following Python files:

Model	Script Path
CQL	`models/ddpg_cql.py`
CCQL	`models/ddpg_cql_ts.py`
GCQL	`models/ddpg_cql_guard.py`
MB-TRPO	`models/trpo.py`
GMB-TRPO	`models/trpo_guard.py`
MB-CPO	`models/cpo.py`
GMB-CPO	`models/cpo_guard.py`

🧪 Evaluate the Trained Policies

Use the following script to perform offline rollout evaluation on the test dataset:

python eval.py

📁 Project Structure Overview

├── data/                   # Intermediate and final datasets
├── models/                 # Offline RL implementations
├── notebooks/
│   ├── feat_selection.ipynb
│   ├── data_split.ipynb
│   └── transition_model.ipynb
├── sepsis_cohort_continous.py
├── guardian.py
├── eval.py
└── README.md               # (this file)

⚠️ Data Availability Notice

Due to the usage restrictions of the MIMIC-III dataset, we are unable to provide any demonstration data or preprocessed files in this repository. Access to the MIMIC-III dataset must be obtained individually through PhysioNet after completing the required credentialing process.

For more information and to request access, please visit: https://physionet.org/content/mimiciii/1.4/

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
40		40
42		42
44		44
54		54
64		64
checkpoints/cpo_guard		checkpoints/cpo_guard
logs/cpo_guard		logs/cpo_guard
mimic_sepsis		mimic_sepsis
models		models
results/rollouts_data/cpo_guard		results/rollouts_data/cpo_guard
rl_representations		rl_representations
# Code Citations.md		# Code Citations.md
.gitignore		.gitignore
README.md		README.md
chk.ipynb		chk.ipynb
data_split.ipynb		data_split.ipynb
feat_selection.ipynb		feat_selection.ipynb
g.sh		g.sh
g40_augF.txt		g40_augF.txt
g40_augT.txt		g40_augT.txt
g42_augF.txt		g42_augF.txt
g42_augT.txt		g42_augT.txt
g44_augF.txt		g44_augF.txt
g44_augT.txt		g44_augT.txt
g54_augF.txt		g54_augF.txt
g54_augT.txt		g54_augT.txt
g64_augF.txt		g64_augF.txt
g64_augT.txt		g64_augT.txt
guardian.py		guardian.py
metric2.png		metric2.png
metrics1.png		metrics1.png
results.png		results.png
seeds.json		seeds.json
sepsis_cohort_continous.py		sepsis_cohort_continous.py
sepsis_final_RAW_continuous_13.csv		sepsis_final_RAW_continuous_13.csv
sepsis_final_RAW_continuous_13_test_indices_ini.npy		sepsis_final_RAW_continuous_13_test_indices_ini.npy
sepsis_final_RAW_continuous_13_test_indices_ori.npy		sepsis_final_RAW_continuous_13_test_indices_ori.npy
sepsis_final_RAW_continuous_13_train_indices_ini.npy		sepsis_final_RAW_continuous_13_train_indices_ini.npy
sepsis_final_RAW_continuous_13_train_indices_ori.npy		sepsis_final_RAW_continuous_13_train_indices_ori.npy
sepsis_final_RAW_continuous_13_weights.csv		sepsis_final_RAW_continuous_13_weights.csv
transition_model.ipynb		transition_model.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

INSTRUCTION

🚀 Project Workflow

1. Access the MIMIC-III Dataset

2. Extract Sepsis Treatment Dataset

3. Generate Continuous Treatment Variables

4. Feature Selection and Preprocessing

5. Split Dataset

6. Learn State Transition Model

7. Build OOD Guardian

🧠 Train Reinforcement Learning Models

🧪 Evaluate the Trained Policies

📁 Project Structure Overview

⚠️ Data Availability Notice

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

INSTRUCTION

🚀 Project Workflow

1. Access the MIMIC-III Dataset

2. Extract Sepsis Treatment Dataset

3. Generate Continuous Treatment Variables

4. Feature Selection and Preprocessing

5. Split Dataset

6. Learn State Transition Model

7. Build OOD Guardian

🧠 Train Reinforcement Learning Models

🧪 Evaluate the Trained Policies

📁 Project Structure Overview

⚠️ Data Availability Notice

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages