Cross-Modal Prostate Cancer Grading (WSI + LLM Prompts)

This repository contains the official PyTorch implementation of our cross-modal transformer framework for weakly supervised prostate cancer grading from H&E whole slide images (WSIs). The model aligns slide-level visual representations with grade-specific textual descriptions generated by a frozen LLM and performs classification via cosine similarity in a shared latent space.

Key Components

Patch feature input (e.g., CLAM / ResNet features)
Optional IRM attention scores for top-M patch refinement
Frozen LLM prompt token embeddings (precomputed)
Cross-attention transformer (text queries, visual keys/values)
Convex aggregation for slide and text embeddings
Cosine similarity + temperature scaling for grading

Data

Expected per-slide feature files: patches_dir/<slide_id>.pt containing:

features: Tensor[N, d_patch]
attn (optional): Tensor[N]

Labels file: labels.csv with columns: slide_id,label (label in {0,1,2,3}).

Training

Precompute prompt token embeddings and save as a .pt dict: {class_id: Tensor[L, d_text_in]}
Train:

python scripts/train.py --cfg configs/default.yaml --fold 0 --prompt_pt /llm/prompts.pt

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
configs		configs
scripts		scripts
src		src
README.md		README.md
REDME.md		REDME.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cross-Modal Prostate Cancer Grading (WSI + LLM Prompts)

Key Components

Data

Training

About

Uh oh!

Releases

Packages

Languages

NabaviLab/Histo-LLM

Folders and files

Latest commit

History

Repository files navigation

Cross-Modal Prostate Cancer Grading (WSI + LLM Prompts)

Key Components

Data

Training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages