Official implementation of SoccerMaster: A Vision Foundation Model for Soccer Understanding (CVPR 2026 Oral)
Haolin Yang, Jiayuan Rao, Haoning Wu, Weidi Xie
SoccerMaster is a unified soccer-specific vision foundation model that leverages diverse soccer content, including images and videos, to support a wide range of soccer understanding tasks, such as commentary generation, detection, tracking, classification, etc.
SoccerMaster Architecture. (a) The architecture of SoccerMaster, which encodes both soccer videos and images through spatial and temporal attention modules to generate semantically rich representations. (b) The pretraining tasks and downstream adaptations of SoccerMaster across both spatial perception and semantic understanding tasks.
First, clone the SigLIP2 backbone model to the pretrained models directory:
cd SoccerMaster/codes/SoccerMaster/pretrained_models/google
git lfs install
git clone https://huggingface.co/google/siglip2-large-patch16-512Clone the SoccerMaster model checkpoints from Hugging Face:
cd SoccerMaster/codes/SoccerMaster/pretrained_models
git lfs install
git clone https://huggingface.co/xleprime/SoccerMasterAfter completing these steps, you should have the following directory structure:
root/codes/SoccerMaster/pretrained_models/
├── google/
│ └── siglip2-large-patch16-512/
└── SoccerMaster/
- Add pretraining code.
- [] Refine pretraining code.
- Release SoccerMaster checkpoints.
- Add instructions for quick start.
- Release datasets.
- Release data pepeline.
If you find our work useful, please cite:
@article{yang2025soccermaster,
title={SoccerMaster: A Vision Foundation Model for Soccer Understanding},
author={Yang, Haolin and Rao, Jiayuan and Wu, Haoning and Xie, Weidi},
journal={arXiv preprint arXiv:2512.11016},
year={2025}
}
