Mask4Former: Mask Transformer for 4D Panoptic Segmentation (Renamed from MASK4D)

Kadir Yilmaz, Jonas Schult, Alexey Nekrasov, Bastian Leibe

RWTH Aachen University

Mask4Former is a transformer-based model for 4D Panoptic Segmentation, achieving a new state-of-the-art performance on the SemanticKITTI test set.

[Project Webpage] [arXiv]

News

2025-05-25: MinkowskiEngine has been replaced with Spconv to simplify installation and enable 16-bit operations. Training now should be roughly twice as fast. See Minkowski in Tags for the previous version of the code.
2024-01-29: Mask4Former accepted to ICRA 2024
2023-09-28: Mask4Former on arXiv

Dependencies

The main dependencies of the project are the following:

python: 3.11
cuda: 12.4

We use uv as Python package and project manager. You can also set up your environment as you prefer and just use pip install command.

You can set up a uv virtual environment in your local directory as follows:

uv venv --python 3.11
source .venv/bin/activate

You can install the python packages as follows:

uv pip install -r requirements.txt --no-deps --extra-index-url https://download.pytorch.org/whl/cu124

Data preprocessing

After installing the dependencies, we preprocess the SemanticKITTI dataset. This can take some time.

python -m datasets.preprocessing.semantic_kitti_preprocessing preprocess \
--data_dir $SEMANTICKITTI_DIR/SemanticKITTI/dataset \
--save_dir data/semantic_kitti \
--generate_instances True

Training and testing

Train Mask4Former:

python main_panoptic_4d.py

In the simplest case the inference command looks as follows:

python main_panoptic_4d.py \
general.mode="validate" \
general.ckpt_path='PATH_TO_CHECKPOINT.ckpt'

Or you can use DBSCAN to boost the scores even further:

python main_panoptic_4d.py \
general.mode="validate" \
general.ckpt_path='PATH_TO_CHECKPOINT.ckpt' \
general.dbscan_eps=1.0

Trained checkpoint

Mask4Former

The provided model, trained after the submission, achieves 71.3 LSTQ without DBSCAN and 72.0 with DBSCAN post-processing on the valiidation set.

BibTeX

@inproceedings{yilmaz24mask4former,
  title     = {{Mask4Former: Mask Transformer for 4D Panoptic Segmentation}},
  author    = {Yilmaz, Kadir and Schult, Jonas and Nekrasov, Alexey and Leibe, Bastian},
  booktitle = {{International Conference on Robotics and Automation (ICRA)}},
  year      = {2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
conf		conf
data		data
datasets		datasets
docs		docs
models		models
scripts		scripts
trainer		trainer
utils		utils
.gitignore		.gitignore
README.md		README.md
main_panoptic_4d.py		main_panoptic_4d.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mask4Former: Mask Transformer for 4D Panoptic Segmentation (Renamed from MASK4D)

News

Dependencies

Data preprocessing

Training and testing

Trained checkpoint

BibTeX

About

Uh oh!

Uh oh!

Contributors 2

Languages

YilmazKadir/Mask4Former

Folders and files

Latest commit

History

Repository files navigation

Mask4Former: Mask Transformer for 4D Panoptic Segmentation (Renamed from MASK4D)

News

Dependencies

Data preprocessing

Training and testing

Trained checkpoint

BibTeX

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 2

Languages