This project aims to generate treatment predictions and clinical trajectories for patients with non-small cell lung cancer (NSCLC) using large language models (LLMs) applied to real-world clinical notes.
- Contains source code for generating consistency bench + synthetic benchmarks
- Contains source code for generating analysis regarding zero-shot model failure mode predictions
- Contains source code for generating model performance on cross-model consistency benchmark, general evaluation scripts, and ROC curve generation files.
- Includes scripts for heatmap, bar chart, ROC curves, and confusion matrix analyses from main paper.
- Contains ordered bash scripts for executing code to validate paper performance. Ordered by analyses present in paper.