Cytologia

White Blood Cells detection challenge.

Overview of the solution is available as PDF document here.

Folder structure:

You need to copy images from the challenge into images_cytologia/ folder. Training notebooks below will generate intermediate parquet/csv files. Output file of a notebook is used as input of other notebooks.

├── README.md
│
├── data
│   ├── images_cytologia
│   ├── background_images.csv
│   ├── train.csv
│   └── test.csv
│
├── notebooks <- Training/inference notebooks
│
├── code <- Related code used by notebooks
│   ├── src
│   ├── src_object_detector <- YOLOX fork
│   └── requirements.txt
│
├── yolo_models <- Materials for object detection (YoloX)
│
├── models <- Materials for classifier

Environment

Install all python 3.10 dependencies:

cd code/
pip install -r requirements.txt

cd src_object_detector/YOLOX/
pip install -v -e .

Inference

Run the following notebook:

Full-Inference.ipynb. Run all trained models on test data. It takes around 2h40 for 20751 images (462ms per image) with a single standard GPU (RTX3090). Download ready-to-use full package (including the same notebook named as FULL-INFERENCE-v6.ipynb) with weights for all trained models. You need to copy images from the challenge into images_cytologia/ folder before running it. This package is the one uploaded for the winning submission.

Training

Run the following notebooks in the given order. You need to copy images from the challenge into data/images_cytologia/ folder before running.

1 - Cross-Validation.ipynb. Clean up duplicated data and generate cross-validation split. It generates two files: BB_cleaned_v1.parquet (cleaned data) and cv4_seed42_multilabel.parquet (cross validation split). It takes around 30 minutes to run.

2 - Train-YOLOX-S-512.ipynb. Train first YOLOX model based on train data. Download YOLOXs COCO weights first and copy yolox_s.pth into code/src_object_detector/YOLOX folder. It will generate COCO structure under yolo_models/ folder and weights under code/src_object_detector/YOLOX/YOLOX_outputs folder. Training takes around 80 hours (150 epochs) with a single standard GPU (RTX3090).

3 - Infer-YOLOX-S-512.ipynb. Infer YOLOX-S-512 model. It will generate cv4_seed42_multilabel_oof_yolox_s_512_v3.1.parquet file used by the next notebook. It takes around 2 hours.

4 - Background images.ipynb. Generate background images from YOLOX model and OTSU filter. It will generate background images used by next notebooks under data/background folder. It takes around 10 minutes to run.

5 - Train-YOLOX-S-640.ipynb. Train second YOLOX model based on train data with noisy boxes removed. It will generate COCO structure under yolo_models/ folder and weights under code/src_object_detector/YOLOX/YOLOX_outputs folder. It takes around 70 hours (110 epochs) to train with a single standard GPU (RTX3090).

6 - Infer-YOLOX-S-640.ipynb. Infer YOLOX-S-640 model. It will generate boxes/ folder with extracted bounding boxes ready to train some classifiers. It also generates files used by the next notebook. It takes around 2 hours.

7 - Train-Classifiers. Train/Finetune/Infer multi-classes and multi-labels models and perform ensemble. Make sure you've downloaded DinoBloom-B pretrained weights under notebooks/ folder before running. It will generate models under models/ folder. It takes around 170 hours to train with a single standard GPU (RTX3090).

8 - Full-Inference. Infer YOLOX models followed by classifiers models. Update DATA_ROOT and MODELS_HOME if you run it from notebooks/ folder. Runtime durations available.

Note: Training times estimation given above are for:

A single standard GPU (NVidia RTX3090/24GB-VRAM), CUDA Version: 12.6.
64GB RAM, IntelCorei9, 10 CPUs, SSD Disk.
Linux Ubuntu 22.04.3 LTS (GNU/Linux 5.15.167.4)/WSL2, Python 3.10.
The random seed is fixed to 42, 100% reproducibility is not guaranteed as certain operations (e.g., PyTorch/torch.nn.functional, DinoV2/Xformers) may still have nondeterministic behavior due to hardware and low-level implementation differences. However, final results should be very close.

Models weights

You can download each model's weights (result of the CV4 trainings above) if needed. Intermediate results (cross-validation, inference, submissions) all included.

Model	Image size	Weights
YOLOX-S-512	512x512	Download
YOLOX-S-640	640x640	Download
Multi-classes ViT large (23 classes)	224x224	Download
Multi-classes ViT large with background (24 classes)	224x224	Download
Multi-classes DinoV2/DinoBloom (23 classes)	224x224	Download
Multi-classes NextViT (23 classes)	384x384	Download
Multi-classes TinyViT (23 classes)	512x512	Download
Multi-classes EffNetV2m (23 classes)	512x512	Download
Multi-labels EffNetV2m with background (24 classes)	512x512	Download

Take away

The best submission (with the best CV score) is an ensemble of multiple models. While no formal ablation study was conducted, observations from experiments suggest the following key contributors to improved cross-validation performance:

A two-stage pipeline: White Blood Cell bounding box detector followed by classifiers.
Augmentations such as MixUp and CutMix.
Extended training with more epochs and fine-tuning.
Test-Time Augmentation (TTA)
Diverse models architectures (CNN and Transformers)
Weighted averaging of models in the ensemble.

Further improvements could be achieved by fine-tuning on external datasets like Peripheral Blood Cell, which were not utilized in this work.

References

HuggingFace Timm vision models: https://huggingface.co/timm
DinoBloom foundation models: https://github.com/marrlab/DinoBloom

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
code		code
data		data
documentation		documentation
notebooks		notebooks
LICENSE		LICENSE
README.md		README.md
pipeline.png		pipeline.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Cytologia

Folder structure:

Environment

Inference

Training

Models weights

Take away

References

About

Uh oh!

Releases

Packages

Languages

License

MPWARE-TEAM/Cytologia

Folders and files

Latest commit

History

Repository files navigation

Cytologia

Folder structure:

Environment

Inference

Training

Models weights

Take away

References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages