CV-Lab: Deep Learning for Image Segmentation

A modular framework for training and evaluating deep learning models on image segmentation tasks. While it does not offer fine-grained solutions out of the box, it is ideal for learning, rapid prototyping, and validating research ideas. The codebase is well-documented and straightforward to support both learning and easy extension.

Overview

This repository provides tools for image segmentation using deep learning. It features a clean, modular codebase with support for common segmentation architectures and standard training workflows.

Features

Multiple Architectures: U-Net and Attention U-Net implementations
Modular Design: Organized code structure with separate modules for datasets, networks, and utilities
Dynamic Configuration: Reflection mechanism for automatic configuration loading and scalability
Configuration Management: YAML-based hyperparameter configuration
Training Pipeline: Standard training loop with validation and checkpointing
Experiment Tracking: Optional Weights & Biases integration
Model Export: ONNX format support for deployment

Installation

Option 1: Local Installation

Clone the repository:

git clone https://github.com/liu-bodong/CV-Lab.git
cd CV-Lab

Install dependencies:
```
pip install -r requirements.txt
```

Option 2: Docker (Recommended)

For a consistent environment with GPU support:

Clone the repository:

git clone https://github.com/liu-bodong/CV-Lab.git
cd CV-Lab

Using Docker Compose (Recommended):

docker-compose up -d dev
docker-compose exec dev bash

Using Docker directly:

# Build the image
docker build -t cv-lab .

# Run with GPU support
docker run --gpus all -v $(pwd):/app -it cv-lab bash

Option 3: Pre-built Docker Images

Use pre-built images without building locally:

From GitHub Container Registry:

# Pull the image
docker pull ghcr.io/liu-bodong/cv-lab:latest

# Clone repository for code and configs
git clone https://github.com/liu-bodong/CV-Lab.git
cd CV-Lab

# Run with pre-built image
docker run --gpus all -v $(pwd):/app -it ghcr.io/liu-bodong/cv-lab:latest bash

From Docker Hub:

# Pull the image
docker pull liubodong/cv-lab:latest

# Clone repository for code and configs
git clone https://github.com/liu-bodong/CV-Lab.git
cd CV-Lab

# Run with pre-built image
docker run --gpus all -v $(pwd):/app -it liubodong/cv-lab:latest bash

Using Docker Compose with pre-built image:

# Modify docker-compose.yml to use pre-built image:
services:
  dev:
    image: ghcr.io/liu-bodong/cv-lab:latest
    # Remove the 'build' section

Benefits of pre-built images:

No build time required
Consistent environment across different machines
Faster setup for CI/CD pipelines
Pre-tested configurations

Local Build vs Pre-built Images

Start training:
```
python train.py --config hyper.yaml
```

Docker Configuration

Base Image: PyTorch 2.7.1 with CUDA 12.8 and cuDNN 9
GPU Support: Automatic GPU detection and usage
Volume Mounting: Code, data, and outputs are mounted for persistence
Development: Interactive development with live code changes
Pre-built Images: Available on GitHub Container Registry and Docker Hub
Image Options: Build locally or use pre-built images for faster setup

Requirements

For Local Installation:

Python 3.8+
PyTorch 1.9.0+
torchvision 0.10.0+
CUDA (optional, for GPU acceleration)

For Docker Installation:

Docker
Docker Compose
NVIDIA Docker runtime (for GPU support)

Note: Docker installation includes all Python dependencies and CUDA support automatically.

Project Structure

Each sub-directory functions as a module with specified export components to be called using reflection.

CV-Lab/
├── datasets/         # Dataset classes and data loading utilities
├── networks/         # Model architectures (U-Net, Attention U-Net)
├── utils/            # Training utilities (logging, metrics, loss functions)
├── notebooks/        # Jupyter notebooks for analysis and visualization
├── runs/             # Training outputs and model checkpoints
├── train.py          # Main training script
└── hyper.yaml        # Configuration file

Quick Start

1. Prepare Your Dataset

Implement your dataset class in datasets/ or use the provided datasets as a reference. The framework expects dataset classes to return image-mask pairs.

2. Configure Training

Edit hyper.yaml to set your parameters. Note that the framework uses a dynamic configuration system implemented by reflection in python. Each sub-module exports classes and methods to be used. Please pay attention to the exact naming of imported components. Refer to the following example for a glimpse of hyper parameter editing:

# Model configuration
model_type: ModelClass  
image_size: [256, 256]
input_channels: 3
output_channels: 1
channels: [64, 128, 256, 512] # for frameworks that require specific channel sizes

# Training configuration
batch_size: 16
epochs: 100
init_lr: 0.0003
dataset: DatasetClass
data_dir: ./data/your_dataset

3. Start Training

Local Environment:

python train.py --config hyper.yaml

Docker Environment:

# Start container and enter interactive shell
docker-compose up -d dev
docker-compose exec dev bash

# Run training inside container
python train.py --config hyper.yaml

python train.py --config hyper.yaml

4. Monitor Results

Training outputs are saved to the runs/ directory:

runs/[model_name]_[timestamp]/
├── metrics.csv      # Training metrics
├── best.pth         # Best model checkpoint
├── last.pth         # Final checkpoint
├── summary.yaml     # Run configuration
└── plot.png         # Training curves

An example of plot.png:

Docker Development Workflow

Development Container

The Docker setup is optimized for development with live code changes:

# Start development container
docker-compose up -d dev

# Enter container shell
docker-compose exec dev bash

# Your code changes in the host are immediately reflected in the container
# Data and outputs persist in mounted volumes

Volume Mounts

Code: .:/app - Live code editing
Data: ./data:/app/data - Persistent dataset storage
Outputs: ./runs:/app/runs - Training results persist on host
Wandb: ./wandb:/app/wandb - Experiment tracking logs

GPU Support

The container automatically detects and uses available GPUs:

deploy:
  resources:
    reservations:
      devices:
        - driver: nvidia
          capabilities: [gpu]

Container Management

# Stop container
docker-compose down

# Rebuild after requirements.txt changes
docker-compose build dev

# View logs
docker-compose logs dev

Available Models

Currently, the framework supports the following models:

ResNet50/101
MobileNetV1
U-Net
Attention U-Net
more models are under development, feel free to add your own too!

Datasets

The framework includes a BrainMRIDataset class corresponding to a dataset on kaggle as an example implementation. You can create custom dataset classes for your needs.

BrainMRIDataset

Loads image-mask pairs from organized directory structure
Applies separate transforms for images and masks
Supports common medical image formats

Configuration Options

Model Parameters

model_type: Model architecture
image_size: Input dimensions [height, width]
input_channels: Number of input channels
output_channels: Number of output classes
channels: A list of channel sizes -- channel progression for encoder/decoder

Training Parameters

batch_size: Training batch size
epochs: Number of training epochs
init_lr: Initial learning rate
dataset: Dataset class name
data_dir: Path to dataset
val_split: Validation split ratio

Optional Features

Weights & Biases: Set wandb_project, wandb_entity, and wandb_mode for experiment tracking
Early Stopping: Configure patience for convergence early stopping
Learning Rate Scheduling: Use lr_rampdown_epochs for learning rate decay

Jupyter Notebooks

The notebooks/ directory contains tools for analysis and visualization:

main.ipynb: Model validation and output visualization
plot_csv.ipynb: Generate plots from training metrics
export.ipynb: Convert models to ONNX format
model_sanity.ipynb: Architecture testing and debugging
wandb.ipynb: Weights & Biases integration testing

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
datasets		datasets
networks		networks
notebooks		notebooks
runs		runs
utils		utils
.DS_Store		.DS_Store
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
hyper.yaml		hyper.yaml
requirements.txt		requirements.txt
todos.md		todos.md
train.py		train.py
workflow.md		workflow.md

License

liu-bodong/CV-Lab

Folders and files

Latest commit

History

Repository files navigation

CV-Lab: Deep Learning for Image Segmentation

Overview

Features

Installation

Option 1: Local Installation

Option 2: Docker (Recommended)

Option 3: Pre-built Docker Images

Local Build vs Pre-built Images

Docker Configuration

Requirements

For Local Installation:

For Docker Installation:

Project Structure

Quick Start

1. Prepare Your Dataset

2. Configure Training

3. Start Training

4. Monitor Results

Docker Development Workflow

Development Container

Volume Mounts

GPU Support

Container Management

Available Models

Datasets

BrainMRIDataset

Configuration Options

Model Parameters

Training Parameters

Optional Features

Jupyter Notebooks

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages