AI Dev OS - Unified AI Development Platform

A complete software development workflow for autonomous AI agents combining Deep Agents orchestration, Superpowers skills enforcement, Newton physics simulation, Unsloth model training, BitNet inference, and real-time Claude HUD observability.

🎯 What is AI Dev OS?

AI Dev OS is an integrated platform where autonomous AI agents can handle complete engineering workflows—from design through deployment—with human oversight at key checkpoints. It's built on the same architecture used by leading orgs like Stripe, Ramp, and Coinbase for internal coding agents.

Key components:

🤖 Open SWE - Agent orchestration with sandboxed execution
🧠 Superpowers - Mandatory workflow enforcement (brainstorming → planning → TDD → review)
🎓 Unsloth - Fast model training (2x speedup, 70% less VRAM)
⚙️ Newton - GPU-accelerated physics simulation for robotics
⚡ BitNet.cpp - Efficient 1-bit LLM inference on CPU/GPU
📊 Claude HUD - Real-time observability of context, tools, agents, and progress

🏗️ Visual Architecture

I have integrated the official Unified AI Platform Architecture from your design board into the project documentation.

Developer Request (Slack/Linear/CLI)
    ↓
    └─→ Open SWE Harness (Deep Agents + Middleware)
        ├─→ Superpowers:brainstorming (Design refinement)
        ├─→ Superpowers:using-git-worktrees (Isolated branch)
        ├─→ Superpowers:writing-plans (Task breakdown)
        └─→ Subagent Orchestration
            ├─→ Agent-A (Sandbox-1): Code + Testing
            ├─→ Agent-B (Sandbox-2): Training (Unsloth)
            └─→ Agent-C (Sandbox-3): Simulation (Newton)
                    ↓
            Real-time Feedback (Claude HUD)
                    ↓
            Superpowers:verification + Code Review
                    ↓
            Auto-PR + Merge to Production

🚀 Quick Start

Prerequisites

Python 3.10+
Node.js 18+ (for Claude Code)
Docker (recommended for sandboxes)
NVIDIA GPU (for Newton + Unsloth training, optional for inference)

Installation

1. Clone and Setup

git clone https://github.com/Imposter-zx/ai-dev-os.git
cd ai-dev-os

2. Install Dependencies (Powered by `uv`)

# We use uv for lightning-fast dependency management
uv sync --all-groups

3. Initialize Sandboxes (Modal, Daytona, or Runloop)

python scripts/setup-sandboxes.py --provider modal
# or
python scripts/setup-sandboxes.py --provider daytona

4. Configure Claude Code & Plugins

# In Claude Code, run:
/plugin marketplace add obra/superpowers-marketplace
/plugin install superpowers@superpowers-marketplace
/plugin install claude-hud
/claude-hud:setup

5. Set Up AGENTS.md (Repo Conventions)

cp templates/AGENTS.md.template ./AGENTS.md
# Edit AGENTS.md with your repo's conventions

6. Start AI Dev OS

python -m ai_dev_os start --mode development

📋 Project Structure

ai-dev-os/
├── README.md                          # This file
├── AGENTS.md                          # Repo conventions for agents
├── requirements.txt                   # Core dependencies
├── requirements-dev.txt               # Development tools
├── pyproject.toml                     # Project metadata
├── .github/
│   └── workflows/
│       └── agent-validation.yml       # CI/CD for agent runs
├── src/
│   ├── ai_dev_os/
│   │   ├── __init__.py
│   │   ├── core.py                    # Main orchestration engine
│   │   ├── agents.py                  # Deep Agents wrapper
│   │   ├── sandbox.py                 # Sandbox abstraction
│   │   ├── skills.py                  # Superpowers skill loader
│   │   ├── hud.py                     # Claude HUD integration
│   │   ├── models.py                  # Training + inference (Unsloth + BitNet)
│   │   └── simulation.py              # Newton integration
│   ├── integrations/
│   │   ├── slack.py                   # Slack bot
│   │   ├── linear.py                  # Linear issue integration
│   │   └── github.py                  # GitHub PR automation
│   └── utils/
│       ├── context.py                 # Context window management
│       └── logger.py                  # Structured logging
├── scripts/
│   ├── setup-sandboxes.py             # Sandbox initialization
│   ├── create-skill.py                # Skill generation helper
│   ├── run-benchmark.py               # Performance benchmarking
│   └── migrate-to-bitnet.py           # Model quantization
├── templates/
│   ├── AGENTS.md.template             # Agent rules template
│   ├── skill-template/                # Superpowers skill scaffold
│   └── example-project/               # Complete example (robot walker)
├── tests/
│   ├── test_agents.py
│   ├── test_sandbox.py
│   ├── test_skills.py
│   └── test_models.py
├── docs/
│   ├── ARCHITECTURE.md                # Detailed architecture
│   ├── SETUP_GUIDE.md                 # Step-by-step setup
│   ├── WORKFLOWS.md                   # Common workflows
│   ├── CUSTOMIZATION.md               # How to customize
│   ├── API_REFERENCE.md               # API docs
│   └── TROUBLESHOOTING.md             # Common issues
└── examples/
    ├── robot-walker/                  # Quadruped controller example
    ├── model-training/                # Fine-tuning workflow
    └── multi-agent-research/          # Parallel simulation sweep

💡 Common Workflows

Workflow 1: Build a Feature Autonomously

# In Slack:
@openswe "Build authentication modal for login page"

# AI Dev OS:
1. Brainstorms design (Superpowers)
2. Creates implementation plan
3. Spawns subagents (code, tests, docs)
4. Validates in sandbox
5. Opens PR automatically
6. You review & merge

Workflow 2: Fine-tune a Model

from ai_dev_os import UnslothTrainer, BitNetInference

# Define training task
trainer = UnslothTrainer(
    model="meta-llama/Llama-2-7b",
    dataset="path/to/your/data.csv",
    output_quantization="1-bit"  # BitNet format
)

# Agent handles it
trainer.run_with_agent(
    sandbox="modal",
    monitor_hud=True  # Real-time Claude HUD updates
)

Workflow 3: Robotics Simulation Sweep

from ai_dev_os import NewtonSimulation, SubagentOrchestrator

# Define sweep parameters
sim = NewtonSimulation(
    robot="quadruped",
    terrain="stairs",
    episodes=1000
)

# Orchestrate parallel agents
orchestrator = SubagentOrchestrator(
    tasks=[
        ("sim", sim.config),
        ("train-policy", {"model": "Llama-8B", "bits": 4}),
        ("verify", {"metric": "success_rate", "threshold": 0.9})
    ],
    parallel=True,
    monitor_context=True  # Claude HUD watches context
)

results = orchestrator.run()

🔧 Configuration

AGENTS.md (Per-Repo Rules)

# AI Dev OS Rules for This Repo

## Tools
- Newton: enabled (GPU simulation)
- Unsloth: enabled (model training)
- BitNet: enabled (inference)

## Workflow Enforcement
- brainstorming: REQUIRED (design first)
- writing-plans: REQUIRED (plan before code)
- test-driven-development: REQUIRED (tests first)
- requesting-code-review: REQUIRED (review before merge)

## Thresholds
- context_warning: 75%
- context_critical: 90%
- test_coverage_min: 80%

## Custom Hooks
- pre_execution: validate AGENTS.md syntax
- post_merge: auto-update docs

Claude HUD Config

Create ~/.claude/plugins/claude-hud/config.json:

{
  "lineLayout": "expanded",
  "pathLevels": 2,
  "elementOrder": ["project", "context", "usage", "tools", "agents", "todos"],
  "display": {
    "showModel": true,
    "showContextBar": true,
    "showTools": true,
    "showAgents": true,
    "showTodos": true,
    "showDuration": true,
    "showSpeed": true
  },
  "colors": {
    "context": "cyan",
    "usage": "cyan",
    "warning": "yellow",
    "critical": "red"
  }
}

📚 Documentation

ARCHITECTURE.md - Deep dive into system design
SETUP_GUIDE.md - Detailed installation for each OS
WORKFLOWS.md - How to trigger and manage agent workflows
CUSTOMIZATION.md - Extend with custom skills/tools
API_REFERENCE.md - Complete API documentation
TROUBLESHOOTING.md - Debug common issues

🏆 Examples

Example 1: Robot Walker

A complete end-to-end example building an autonomous quadruped controller.

cd examples/robot-walker
python run.py

See examples/robot-walker/README.md for details.

Example 2: Model Fine-tuning

Fine-tune a model with Unsloth and quantize to BitNet format.

cd examples/model-training
python train.py --dataset ./data/custom.csv --output ./models/custom.gguf

Example 3: Parallel Research

Run a research sweep across 1000 simulation configurations.

cd examples/multi-agent-research
python sweep.py --configs 1000 --parallel-agents 10

🔌 Integrations

Slack

# In Slack, mention the bot:
@openswe "Your task here"
# Supports: repo:owner/name syntax for multi-repo

Linear

# In Linear, comment on any issue:
@openswe "Fix the bug in production"
# Agent reads full context, posts results as comment

GitHub

# Tag in PR comments:
@openswe "Address the review feedback"
# Agent fixes code and pushes to same branch

📊 Monitoring & Observability

Claude HUD (Real-time)

Integrated into your terminal. Shows:

Context usage (%) and remaining
Active agents and their status
Tools being used
Todo progress
Git branch and status

Logs

# View agent execution logs
tail -f ~/.ai-dev-os/logs/agents.log

# View sandbox logs
tail -f ~/.ai-dev-os/logs/sandbox.log

# View model training progress
tail -f ~/.ai-dev-os/logs/training.log

Dashboard (Optional)

uv run streamlit run app/dashboard.py
# Opens web UI at http://localhost:8501
# Features: Multi-user Auth, running agents, context usage, completed tasks, PR history

🧪 Testing

# Run all tests
uv run pytest

# Run with coverage
uv run pytest --cov=src

# Run specific test
uv run pytest tests/test_core_comprehensive.py

🚦 Status

✅ Deep Agents orchestration
✅ Superpowers caching & workflow enforcement
✅ Unsloth training implementation (Real fine-tuning)
✅ BitNet inference (Real CPU-optimized inference)
✅ Web Dashboard (Streamlit with multi-user Authentication)
✅ Prometheus Monitoring
✅ GitHub OAuth flow & PR automation
✅ Slack/Linear invocation with strict secret validation
✅ Newton physics simulation
✅ Modern CI/CD (uv based deterministic builds + Security gating with Bandit)
✅ Daytona Sandbox Support (Remote workspace orchestration)
🔜 Runloop sandbox support
🔜 Multi-GPU training optimization

🤝 Contributing

AI Dev OS is built by the community. To contribute:

Fork the repo
Create a feature branch: git checkout -b feature/your-feature
Follow CONTRIBUTING.md
Submit a PR

See CONTRIBUTING.md for detailed guidelines.

📝 License

MIT License - see LICENSE for details.

🙏 Acknowledgments

Built on the shoulders of giants:

LangGraph - Graph-based agent orchestration
Deep Agents - Agent framework
Superpowers - Workflow skills
Claude HUD - Terminal observability
Newton - Physics simulation
Unsloth - Fast LLM training
BitNet - Efficient inference

🔗 Links

Docs: ai-dev-os.dev
GitHub: Imposter-zx/ai-dev-os
Discord: Community Server
Twitter: @ai_dev_os

Ready to build with AI Dev OS? Start with SETUP_GUIDE.md →

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.github		.github
app		app
docs		docs
examples		examples
scripts		scripts
src		src
tests		tests
.coverage		.coverage
.gitignore		.gitignore
.python-version		.python-version
AGENTS.md		AGENTS.md
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
DEPLOYMENT.md		DEPLOYMENT.md
LICENSE		LICENSE
PROJECT_SUMMARY.md		PROJECT_SUMMARY.md
QUICK_START.md		QUICK_START.md
README.md		README.md
SECURITY.md		SECURITY.md
baseline_roadmap_tests.txt		baseline_roadmap_tests.txt
baseline_test_output.txt		baseline_test_output.txt
coverage_report.txt		coverage_report.txt
final_coverage_report.txt		final_coverage_report.txt
final_final_test_report.txt		final_final_test_report.txt
pyproject.toml		pyproject.toml
test_output.txt		test_output.txt
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

AI Dev OS - Unified AI Development Platform

🎯 What is AI Dev OS?

🏗️ Visual Architecture

🚀 Quick Start

Prerequisites

Installation

1. Clone and Setup

2. Install Dependencies (Powered by uv)

3. Initialize Sandboxes (Modal, Daytona, or Runloop)

4. Configure Claude Code & Plugins

5. Set Up AGENTS.md (Repo Conventions)

6. Start AI Dev OS

📋 Project Structure

💡 Common Workflows

Workflow 1: Build a Feature Autonomously

Workflow 2: Fine-tune a Model

Workflow 3: Robotics Simulation Sweep

🔧 Configuration

AGENTS.md (Per-Repo Rules)

Claude HUD Config

📚 Documentation

🏆 Examples

Example 1: Robot Walker

Example 2: Model Fine-tuning

Example 3: Parallel Research

🔌 Integrations

Slack

Linear

GitHub

📊 Monitoring & Observability

Claude HUD (Real-time)

Logs

Dashboard (Optional)

🧪 Testing

🚦 Status

🤝 Contributing

📝 License

🙏 Acknowledgments

🔗 Links

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

2. Install Dependencies (Powered by `uv`)

Packages