PathBench-MIL

PathBench-MIL

Downstream evaluation scripts and benchmark pipelines for PathBench — multiple-instance learning (MIL) based WSI classification and survival prediction.

Overview

PathBench-MIL contains the downstream evaluation code used by the PathBench benchmark. It provides training and evaluation pipelines for:

WSI classification
Survival prediction

The repository expects features to be extracted from whole-slide images (WSIs) using a separate extraction pipeline (for example, PrePATH). See the Data Preparation section below.

Features

Modular training/evaluation scripts for classification and survival tasks
Example splits and dataset templates
Simple shell wrappers (run.sh) for reproducible experiments

Repository Layout

README.md
classification/
    ├── main.py
    ├── run.sh
    ├── datasets/
    └── models/
    ├── splits/
    └── utils/
survival/
    ├── main_kfold.py
    ├── run.sh
    └── models/
    ├── splits/
    └── utils/

Data Preparation

Recommended workflow:

Split each WSI into patches and extract features with a pretrained model (outside this repo, e.g., PrePATH).
Store features as .pt or .h5 files and organize them under a cohort directory.

Suggested layout:

DATA_ROOT/
└── CohortName/
        ├── pt_files/
        │   ├── resnet50/
        │   │   ├── slide_1.pt
        │   │   ├── slide_2.pt
        │   │   └── ...
        │   └── ...
        └── patches/
                ├── slide_1.h5
                ├── slide_2.h5
                └── ...

Add dataset paths and labels to classification/splits/datasets.xlsx for classification experiments.
For survival experiments, follow the survival/splits/example.xlsx format (time-to-event and event flag columns).

Note: If you change storage format or file layout, please update the corresponding data loader in datasets/.

Quick Start

We recommend using conda with Python 3.10 and installing dependencies from requirements.txt for reproducible environments.

# create and activate a conda environment with Python 3.10
conda create -n pathbench python=3.10 -y
conda activate pathbench

# install from requirements.txt (preferred)
pip install --upgrade pip
pip install -r requirements.txt

WSI Classification

Prepare feature files and update classification/splits/datasets.xlsx.
Edit classification/run.sh to set dataset root, hyperparameters and other options.
Run the training/benchmark:

cd classification
bash run.sh

Survival Prediction

Prepare an Excel with survival info following survival/splits/example.xlsx.
Edit survival/run.sh (paths, hyperparameters).
Run:

cd survival
bash run.sh

Citation

If you use this code or the benchmark in your research, please cite:

@article{ma2025pathbench,
    title={PathBench: A comprehensive comparison benchmark for pathology foundation models towards precision oncology},
    author={Ma, Jiabo and Xu, Yingxue and Zhou, Fengtao and Wang, Yihui and Jin, Cheng and Guo, Zhengrui and Wu, Jianfeng and Tang, On Ki and Zhou, Huajun and Wang, Xi and others},
    journal={arXiv preprint arXiv:2505.20202},
    year={2025}
}

How to evaluate new models

To evaluate a new MIL model using this benchmark, please visit the PrePATH repository.

License

This project is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).

See the full license text in the LICENSE file at the repository root or online:

https://creativecommons.org/licenses/by-nc/4.0/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PathBench-MIL

Table of Contents

Overview

Features

Repository Layout

Data Preparation

Quick Start

WSI Classification

Survival Prediction

Citation

How to evaluate new models

License

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
classification		classification
survival		survival
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

birkhoffkiki/PathBench-MIL

Folders and files

Latest commit

History

Repository files navigation

PathBench-MIL

Table of Contents

Overview

Features

Repository Layout

Data Preparation

Quick Start

WSI Classification

Survival Prediction

Citation

How to evaluate new models

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages