AutoMoE: Mixture of Experts Self-Driving Model

Note: Development is currently paused. For context on lessons learned and next steps, see SHORTCOMINGS.md.

This repository contains the code and experiments for building a modular, multi-task self-driving system based on a Mixture-of-Experts (MoE) architecture. The goal is to develop a robust model capable of navigating complex environments in the CARLA simulator.

🤗 Datasets

As part of this project, I created and released a collection of datasets to support research in autonomous driving.

CARLA Autopilot Multimodal – 82K frames (~365 GB) with semantic segmentation, LiDAR, bounding boxes, and environment metadata for sensor fusion and RL research.
CARLA Autopilot Images – 68K frames (~188 GB) multi-camera dataset with synchronized ego state and controls for imitation learning and vision-to-control tasks.

🤖 Architectural Vision

The core of this project is a Mixture-of-Experts model. Instead of a single, monolithic network, this architecture uses:

Specialized "Expert" Models: A collection of smaller, fine-tuned neural networks, each mastering a specific perception task (e.g., object detection, drivable area segmentation).
Gating Network: A lightweight network that learns to weigh and combine the outputs of the experts based on the current driving context.

This approach is designed to be more modular, interpretable, and efficient than end-to-end models. The final integrated system will be tested and refined in a high-fidelity simulated environment.

📊 Current Status

✅ Completed:
- Data collection and preprocessing pipelines for BDD100K, nuScenes, and CARLA have been established. (Waymo and Cosmos datasets are not yet included due to technical issues.)
- All expert models have been trained and evaluated on their primary datasets (Stage 2), establishing strong baselines.
- Fine-tuning the pre-trained expert models on CARLA data (Stage 3) to adapt them to the CARLA simulator environment.
- Gating network implementation and training infrastructure (Stages 5-6).
⏸️ Paused:
- Integrated MoE + Policy simulation testing (Stage 7).

🗺️ Project Roadmap

This project follows a structured, multi-stage development plan that separates perception (seeing) from control (driving).

✅ Stage 1: Data Collection & Preprocessing
- Collect and process all primary datasets (BDD100K, nuScenes, CARLA raw data).
✅ Stage 2: Expert Training & Evaluation
- Train and evaluate the expert models (detection, segmentation, drivable) on their respective primary datasets to create strong, specialized baselines.
✅ Stage 3: CARLA Expert Adaptation
- Fine-tune experts on CARLA to reduce domain gap and produce clean outputs in the simulator environment.
✅ Stage 4: Policy Head Development
- Train a CARLA-specific control module (BC, IL, or RL) to turn perception outputs into {steer, throttle, brake} commands.
✅ Stage 5: Gating Network Implementation
- Design and implement the gating network architecture responsible for combining expert outputs before the policy head.
✅ Stage 6: Gating Network Training
- Train the gating network on CARLA-adapted expert outputs to improve expert routing in the target domain.
⏸️ Stage 7: Integrated MoE + Policy Simulation
- Wire perception experts, gating network, and control module into CARLA’s synchronous simulation loop.
- Evaluate closed-loop driving performance (route completion, infractions/km, jerk).

⚙️ Setup and Usage

For accepted arguments and environment variables, open each referenced script file.

0. Environment setup

python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

1. Collect and Preprocess Datasets

python3 scripts/preprocess_bdd100k.py
python3 scripts/preprocess_nuscenes.py
python3 scripts/preprocess_carla.py

2. Train the Experts

bash training/train_bdd100k_experts_ddp.sh
bash training/train_nuscenes_expert_ddp.sh

3. Fine-tune Experts on CARLA

bash training/finetune_experts_carla.sh

4. Train Gating Network

bash training/train_gating_network.sh

5. Run Inference

bash inference/run_automoe.sh

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
dataloaders		dataloaders
datasets		datasets
eval		eval
inference		inference
models		models
notebooks		notebooks
scripts		scripts
tests		tests
training		training
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SHORTCOMINGS.md		SHORTCOMINGS.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AutoMoE: Mixture of Experts Self-Driving Model

🤗 Datasets

🤖 Architectural Vision

📊 Current Status

🗺️ Project Roadmap

⚙️ Setup and Usage

0. Environment setup

1. Collect and Preprocess Datasets

2. Train the Experts

3. Fine-tune Experts on CARLA

4. Train Gating Network

5. Run Inference

About

Uh oh!

Releases

Packages

Contributors 2

Languages

License

immanuel-peter/self-driving-model

Folders and files

Latest commit

History

Repository files navigation

AutoMoE: Mixture of Experts Self-Driving Model

🤗 Datasets

🤖 Architectural Vision

📊 Current Status

🗺️ Project Roadmap

⚙️ Setup and Usage

0. Environment setup

1. Collect and Preprocess Datasets

2. Train the Experts

3. Fine-tune Experts on CARLA

4. Train Gating Network

5. Run Inference

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages