- MtpNet is an efficient multitasking network used to solve three important tasks in autonomous driving: traffic object detection, drivable area segmentation, and lane detection.
- In all three tasks, both accuracy and speed have reached advanced levels.
- Integrated into Huggingface Spaces 🤗 using Gradio. Try out the Web Demo !
We used the BDD100K as our datasets,and experiments are run on NVIDIA 3090.
Model : trained on the BDD100k dataset and test on the bilibili: Cam 看世界 video.
| input | output by yolop | output by mtpnet |
|---|---|---|
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
| Example: output by mtpnet | ||
|---|---|---|
![]() |
![]() |
![]() |
| Result | Visualization | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| Result | Visualization | ||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| Model | Size | Params | Batch4 | Batch8 | Batch16 | Batch32 | Average |
|---|---|---|---|---|---|---|---|
YOLOP |
640 | 7.9M | 50 | 47 | 50 | 44 | 47 |
HybridNets |
640 | 12.8M | 29 | 32 | 47 | 59 | 41 |
YOLOPv2 |
640 | 38.9M | 66 | 79 | 79 | 75 | 74 |
MtpNet |
640 | 50.7M | 55 | 63 | 65 | 66 | 62 |
You can get the model from here.
You can detect images or videos.
python demo_onnx.py --source demo/images/example.png # Identify the specified image
python demo_onnx.py --source demo/images # Identify all images in the folder
python demo_onnx.py --source demo/videos/example.mp4 # Identify designated videos
python demo_onnx.py --source demo/videos # Identify all videos in the folder
python demo_onnx.py --source 0 # Real recognition through camera- Download the images from images.
- Download the annotations of detection from det_annotations.
- Download the annotations of drivable area segmentation from da_seg_annotations.
- Download the annotations of lane line segmentation from ll_seg_annotations.
MtpNet is released under the MIT Licence.











