DeepSurv: Cox Proportional Hazards Deep Neural Network

PyTorch implementation of DeepSurv for survival analysis research.

Paper: Katzman et al., 2018 | Original Code

🗺️ Research Roadmap

Phase	Status	Goal
1. Vanilla Baseline	✅ Complete	Establish baseline (C-index=0.7662)
2. SEER Application	✅ Complete	Apply to clinical cancer data (C-index=0.7616)
3. Comorbidity Analysis	Study dual-cancer survival patterns
4. Comorbidity-Aware Arch	Multi-input, attention, cross-stitch
5. Advanced Methods	🔮 Optional	DeepHit, RSF, Transformers

📂 Project Structure

DeepSurv/
├── README.md                      # All documentation (this file)
├── requirements.txt
├── phase1_vanilla/
│   ├── main.py                    # --generate-data to save, then train
│   ├── src/                       # Vanilla codebase
│   ├── data/                      # → train/val/test_synthetic.csv
│   └── results/                   # → checkpoints/, figures/
└── phase2_seer/
    ├── main.py                    # Training script
    ├── src/                       # SEER codebase (data_loader modified)
    ├── data/                      # → train/val/test_comorbid.csv
    └── results/                   # → checkpoints/, figures/

Design: Maximally minimal. All docs in README. Each phase self-contained.

🎯 Quick Start

# Install
python -m venv .venv && source .venv/bin/activate
pip install -r requirements.txt

# Phase 1
cd phase1_vanilla
python main.py --generate-data  # Generate & save data
python main.py                  # Train

# Phase 2  
cd ../phase2_seer
python main.py                  # Generates SEER data & trains (vanilla)

📊 Data Generation

Phase 1: Synthetic Linear

Automatically generated in main.py:

5000 samples, 10 features (standardized normal)
Linear hazard: weights [2.0, -1.6, 1.2, ..., -0.06]
Saved to: data/synthetic_vanilla_5000_linear.csv

Phase 2: SEER-like Comorbid

Automatically generated in main.py:

5000 samples, 25 features (5 demographics + 20 comorbidities)
Demographics: age (normal), race (4 categories)
Comorbidities: 20 binary indicators (30% prevalence)
Realistic survival: exponential with mean ~36 months
Saved to: data/seer_synthetic_5000_comorbid.csv

Both datasets are generated on-the-fly during training. No separate data generation scripts needed.

🔍 What Does DeepSurv Predict?

Output: Risk score f(x) ∈ ℝ, not survival time or event probability.

Cox Framework: Learns relative risk ranking using Cox partial likelihood (Cox, 1972). For patient i with (tᵢ, δᵢ):

δᵢ = 1 (event): tᵢ is true survival time
δᵢ = 0 (censored): tᵢ is lower bound, true time unknown

Why ranking? Censoring makes regression infeasible (undefined target). Cox partial likelihood requires only ordering information.

Time prediction alternatives: Parametric (Weibull AFT), Random Survival Forests (2008), DeepHit (2018), SurvTRACE (2022).

Input flexibility: Architecture [25, 25] adapts to any input size. For input_dim=d, first layer weight matrix is W₁ ∈ ℝ^(25×d), automatically allocated by nn.Linear(d, 25). Example:

Vanilla synthetic (10 features): 10 → [25, 25] → 1
SEER comorbid (25 features): 25 → [25, 25] → 1

Output targets: Both use (survival_months, vital_status) for Cox loss supervision, not direct prediction.

📊 Phase 1 Results

Metric	Value
C-Index	0.7662 (validation) / 0.7778 (training)
Dataset	Synthetic linear, 5000 samples, 10 features
Architecture	[25, 25] ReLU (vanilla)
Hyperparameters	LR=1e-3, L2=0.01, SGD+Nesterov
Training	165 epochs (early stopped, patience=100)
Validation Split	15% (750 samples)
Event Rate	20.3% (stronger signal, less censoring)
Device	Apple M1 (MPS)

Framework Adaptations:

LR: 1e-4 → 1e-3 (10×) - PyTorch SGD dynamics
L2: 10.0 → 0.01 (÷1000) - PyTorch weight_decay convention
Synthetic signal: 2× hazard weights for reproducible learning

📊 Phase 2 Results: SEER Application

Metric	Value
C-Index	0.7616 (validation) / 0.7427 (training)
Dataset	SEER-like synthetic comorbid, 5000 samples, 25 features
Features	5 demographics (age, race) + 20 comorbidities
Architecture	[25, 25] ReLU (vanilla - unchanged)
Hyperparameters	LR=1e-3, L2=0.01, SGD+Nesterov (same as Phase 1)
Training	138 epochs (early stopped, patience=100)
Validation Split	15% (750 samples)
Event Rate	22.8% (realistic clinical rate)
Survival	Mean=55.6 months, Median=34.9 months
Device	Apple M1 (MPS)

Key Findings:

✅ Vanilla settings transfer perfectly - No modifications needed for clinical data
✅ Similar performance - C-index 0.7616 (SEER) vs 0.7662 (synthetic)
✅ Model scales naturally - 10→25 input features, parameters 951→1326
✅ Realistic clinical distribution - 22.8% event rate, ~4.6 years mean survival

Implementation Changes:

Code: Identical vanilla implementation (src/ copied from Phase 1)
Data: SEER synthetic generation added to main.py (25 features: demographics + comorbidities)
Model: No architectural changes, input_dim automatically adapts from 10→25

📋 Phase Comparison

Aspect	Phase 1 (Vanilla Baseline)	Phase 2 (SEER Application)
Data	Synthetic linear (10 features)	SEER comorbid (25 features)
Features	Standardized normal	5 demographics + 20 comorbidities
Samples	5000	5000
Event Rate	20.3%	22.8%
Mean Survival	N/A	55.6 months (~4.6 years)
Model	[25, 25] ReLU	[25, 25] ReLU (identical)
Parameters	951	1326 (auto-scaled)
Hyperparams	LR=1e-3, L2=0.01	LR=1e-3, L2=0.01 (identical)
Training	165 epochs	138 epochs
Validation C-Index	0.7662	0.7616
Training C-Index	0.7778	0.7427
Code Changes	N/A	None - vanilla code copied

Conclusion: Vanilla DeepSurv generalizes to clinical data with zero modifications. Similar performance (0.7662 vs 0.7616) demonstrates robust baseline for comorbidity research.

🔧 Configuration

Key hyperparameters in src/config.py:

Parameter	Vanilla	Adapted	Reason
Architecture	[25, 25] ReLU	✓	-
Optimizer	SGD+Nesterov (momentum=0.9)	✓	-
Batch Size	64	✓	-
Early Stop	50 epochs	✓	-
Learning Rate	1e-4	1e-3	PyTorch optimization dynamics
L2 Reg	10.0	0.01	PyTorch weight_decay semantics

� References

Cox, D. R. (1972). "Regression models and life-tables." JRSS.
Katzman, J. L., et al. (2018). "DeepSurv." BMC Medical Research Methodology, 18(1), 24.
Ishwaran, H., et al. (2008). "Random survival forests." Ann. Appl. Stat.
Lee, C., et al. (2018). "DeepHit." AAAI.

Status: Phase 1 ✅ Complete | Phase 2 🔄 In Progress | Research code for PhD thesis

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
papers		papers
phase1_vanilla		phase1_vanilla
phase2_seer		phase2_seer
.gitignore		.gitignore
README.md		README.md
ToDo		ToDo
deepsurvk-Notes.txt		deepsurvk-Notes.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DeepSurv: Cox Proportional Hazards Deep Neural Network

🗺️ Research Roadmap

📂 Project Structure

🎯 Quick Start

📊 Data Generation

Phase 1: Synthetic Linear

Phase 2: SEER-like Comorbid

🔍 What Does DeepSurv Predict?

📊 Phase 1 Results

📊 Phase 2 Results: SEER Application

📋 Phase Comparison

🔧 Configuration

� References

About

Uh oh!

Releases

Packages

Languages

nhabibi/DeepSurv

Folders and files

Latest commit

History

Repository files navigation

DeepSurv: Cox Proportional Hazards Deep Neural Network

🗺️ Research Roadmap

📂 Project Structure

🎯 Quick Start

📊 Data Generation

Phase 1: Synthetic Linear

Phase 2: SEER-like Comorbid

🔍 What Does DeepSurv Predict?

📊 Phase 1 Results

📊 Phase 2 Results: SEER Application

📋 Phase Comparison

🔧 Configuration

� References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages