Differentiable Dual-Decomposition for Semantic Image Segmentation

Introduction

Pytorch code for our paper End-to-end Training of CNN-CRF via Differentiable Dual-Decomposition.

Overview

image_segmentation/: includes training and validation scripts.
lib/: contains core functions, data preparation, custom layers, model definition, and utility functions.
experiments/: contains *.yaml configuration files to run experiments.

Requirements

The code is developed using python 3.7.2 on Ubuntu 18.04.1. NVIDIA GPUs ared needed to train and test. See requirements.txt for other dependencies.

Quick start

Installation

Install pytorch == v1.0.0 with CUDA>=9 following official instructions.
Clone this repo, and we will call the directory that you cloned as ${ROOT}
Install dependencies.
```
pip install -r requirements.txt
```
Add current project directory (which we will later denote as ${DDD_ROOT}) to PYTHONPATH environment variable.
```
export PYTHONPATH=${PYTHONPATH}:${PWD}
```
Compile the custom layers in ./lib/layers/dp-extension and ./lib/models/sync_bn/inplace_abn/src/ by going into these directories and running the following command:
```
python setup.py install
```

Data Preparation for PASCAL VOC 2012 benchmark

Download original dataset of PASCAL VOC2012 and extract the tarball file onto your disk, we denote the location where dataset was extracted as ${VOC_ROOT}. It should have a folder called VOCdevkit.
Create a new directory named data at root directory of this project.

Create a symbolic link to VOC2012 dataset via following command:

ln -s ${VOC_ROOT}/VOCdevkit/VOC2012/ ${DDD_ROOT}/data/pascal_voc

After all above steps you should have the following structure in ./data/pascal_voc:

${DDD_ROOT}
 `-- data
     `-- pascal_voc 
         |-- Annotations
         |-- ImageSets
         |-- JPEGImages
         |-- SegmentationClass
         `-- SegmentationObject

Download the Berkley augmented dataset for additional annotations on VOC2012. Extract the .tgz file to an arbitrary location which we will denote as ${SBD_ROOT}$, it should have a folder called benchmark_RELEASE.

Create symbolic link to SBD dataset via following command:

ln -s ${SBD_ROOT}/benchmark_RELEASE/ ${DDD_ROOT}/data/sbd

Training on VOC2012

Download ImageNet-pretrained ResNet-50 model and put it under ${ROOT}/models/pytorch/imagenet/
To train baseline DeepLabV3 model (with ASPP), run:

CUDA_VISIBLE_DEVICES=$GPU_IDS python image_segmentation/train_voc.py --cfg experiments/pascal_voc/resnet50-aspp_513x513_head-lr-1x_sgd-poly-lr7e-3_2gpus.yaml

To train DeepLabV3 with dual-decomposition end-to-end, run:

CUDA_VISIBLE_DEVICES=$GPU_IDS python image_segmentation/train_voc.py --cfg experiments/pascal_voc/resnet50-aspp_513x513_ne-fpi-iter5_head-lr-1x_sgd-poly_lr7e-3_2gpus.yaml

Two GPUs each with >=11GB memory are required for training either of the models. Model checkpoints and logs will be saved into output folder while tensorboard logs will be saved into log folder.

Testing the model

To test the model after training, run:

CUDA_VISIBLE_DEVICES=$GPU_ID python image_segmentation/validate_voc.py --cfg ${PATH_TO_CONFIG_FILE}

Where ${PATH_TO_CONFIG_FILE} is the same file used in training. Tensorboard logs will be saved into log folder.

Citation

If you use our code or models in your research, please cite with:

@article{wang2019DDD,
    author  = {Shaofei Wang and Vishnu Lokhande and Maneesh Singh and Konrad Kording and Julian Yarkony},
    title   = {End-to-end Training of CNN-CRF via Differentiable Dual-Decomposition},
    journal = {CoRR},
    volume  = {abs/1912.02937},
    year    = {2019},
    url     = {http://arxiv.org/abs/1912.02937}
}

References

The overall structure of the code follows Simple Baselines for Human Pose Estimation and Tracking.
Xception back-bone definition and weights: Pretrained models for Pytorch.
Synchronized BatchNorm: In-Place Activated BatchNorm.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
experiments/pascal_voc		experiments/pascal_voc
image_segmentation		image_segmentation
lib		lib
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Differentiable Dual-Decomposition for Semantic Image Segmentation

Introduction

Overview

Requirements

Quick start

Installation

Data Preparation for PASCAL VOC 2012 benchmark

Training on VOC2012

Testing the model

Citation

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Differentiable Dual-Decomposition for Semantic Image Segmentation

Introduction

Overview

Requirements

Quick start

Installation

Data Preparation for PASCAL VOC 2012 benchmark

Training on VOC2012

Testing the model

Citation

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages