Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle

The official repository introducing MaskPose and BBox-Mask-Pose methods.

Project webpage | ArXiv | Video

Description

This repository provides the code and model weights for the paper Detection, Segmentation, and Pose Estimation for Multiple Bodies: Closing the Virtuous Circle.

The code is a modified version of MMPose 2.0 with the following key changes:

Support for multi-dataset training of ViTPose, previously implemented in the official ViTPose repository but absent in MMPose.
Added MaskBackground data augmentation to train MaskPose.
Includes weights for MaskPose and RTMDet to reproduce BBox-Mask-Pose results from the paper.

Note: The code has not undergone extensive cross-platform testing and may contain bugs. If you encounter issues, please report them via the repository's issue tracker.

If you use this work, kindly cite it using the reference provided below.

News

02 Dec 2024: The code is available
23 Nov 2024: The project website is on

Installation

The code builds on MMPose. Install its dependencies simply with:

pip install mmengine
mim install "mmcv>=2.0.1"
mim install "mmdet>=3.1.0"

And then install our code using

pip install -r requirements.txt
pip install -e .

Results & Weights

Pose

Results on COCO val and OCHuman val of different Human Pose Estimation (HPE) methods. All results are with detection from RTMDet-l from MMDetection.

We provide trained weights for both MaskPose (introduced in the paper) and ViTPose trained in the multi-dataset setup. Multi-dataset ViTPose is not new but their weights are not compatible with popular MMPose 2.0 codebase. We retrained theme in the MMPose 2.0 environment.

Model	Datasets	COCO AP	OCHuman AP	weights	notes
ViTPose-b	COCO+AIC+MPII	76.3	42.5	download	multi-dataset training compatible with MMPose 2.0
MaskPose-b	COCO+AIC+MPII	76.4	45.3	download

Detection

To run BBox-Mask-Pose loop, you also need to adapt a detector. We fine-tuned RTMDet-l with mask-out data augmentation as shown in the paper. Weights of the fine-tuned model (compatible with MMDetection config) is here.

Licence

Please read carefully the terms and conditions and any accompanying documentation before you download and/or use the RePoGen model, data and software, (the "Model & Software"). By downloading and/or using the Model & Software (including downloading, cloning, installing, and any other use of this github repository), you acknowledge that you have read these terms and conditions, understand them, and agree to be bound by them. If you do not agree with these terms and conditions, you must not download and/or use the Model & Software. Any infringement of the terms of this agreement will automatically terminate your rights under this License.

Acknowledgements

The code combines MMDetection, MMPose 2.0 and ViTPose.

Citation and Contact

The code was implemented by Miroslav Purkrábek.

For questions, please use the Issues of Discussion.

BibTeX will be here once submitted to ArXiv

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
configs		configs
images		images
mmpose		mmpose
requirements		requirements
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle

Project webpage | ArXiv | Video

Table of Contents

Description

News

Installation

Results & Weights

Pose

Detection

Licence

Acknowledgements

Citation and Contact

About

Uh oh!

Releases

Packages

Languages

License

geodesicDavid/BBoxMaskPose

Folders and files

Latest commit

History

Repository files navigation

Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle

Project webpage | ArXiv | Video

Table of Contents

Description

News

Installation

Results & Weights

Pose

Detection

Licence

Acknowledgements

Citation and Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages