A fun, interactive web application where AI agents (DeepSeek & Gemini) collaborate as a development team to review, improve, and test your code!
Main chat interface with real-time streaming and neon theme
Live Multi-Agent Collaboration Dashboard
Tandem agents performing parallel search and synthesis
Review Agent analyzing agent performance and suggesting improvements
- ๐ง Senior Dev (Gemini) - Architecture, planning, Mission Checklist management
- ๐ฃ Junior Dev (Gemini) - Implementation, code writing, bug fixes
- ๐งช Unit Tester (Gemini) - Test creation (Pytest/Unittest), quality assurance
- ๐ Researcher (Gemini) - Targeted web searches, documentation lookups
- ๐๏ธ Research Lead (Gemini) - Deep research orchestration, report synthesis
- ๐ต๏ธโโ๏ธ Review Agent (Gemini) - Silent observer that critiques performance and suggests system improvements
The Research framework uses a Tandem Architecture for maximum depth and speed:
- Lead Architect (Gemini 3 Flash): Orchestrates the mission, performs high-speed web searches, and scrapes multiple sources in parallel.
- Synthesis Engine (DeepSeek V3): Analyzes the gathered raw data and synthesizes it into a high-impact Executive Report.
We utilize a combination of Gemini 3 Flash for development speed and DeepSeek-V3 (via deepseek-chat) for the final research synthesis and complex reasoning.
- Agents explicitly show their internal monologue using
<think>tags. - This "Show Your Work" approach allows you to see how the agent arrived at a solution before it writes any code.
- The Junior Dev and Senior Dev use it to plan architectural and implementation steps.
- The Researcher uses it to formulate search strategies and cross-reference sources.
- The Summarizer uses it to analyze multi-source data for the final report.
- ๐ฌ Real-time streaming - Watch agents think and respond live
- ๐ Mission Checklists - Agents create and track multi-step plans automatically
- ๐ป Terminal Integration - Embedded PowerShell terminal with automated task execution
- ๐งช Smart Test Runner - Agents can write AND run tests (
pytest,unittest) autonomously - โก Optimistic UI - Instant feedback for stop actions and state changes
- ๐ File Context - Intelligent file reading (only reads what is needed)
- ๐ Safe Switch Management - Dynamically switch between project folders without data loss
- ๐ Color-coded Diffs - Visual representation of code additions and removals
- ๐ Usage tracking - Monitor API usage and costs
- ๐ต๏ธโโ๏ธ Self-Improving System - Review Agent analyzes performance and suggests prompt/orchestrator improvements
cd DevSquad-AI
# Copy environment file and add your API keys
cp .env.example .env
# Edit .env and add your GEMINI_API_KEY, DEEPSEEK_API_KEY, and SERPER_API_KEYcd backend
# Create virtual environment
python -m venv venv
# Activate (Windows)
.\venv\Scripts\activate
# Activate (Mac/Linux)
source venv/bin/activate
# Install dependencies
pip install -r requirements.txt
# Install Playwright browsers (for web research)
playwright install chromium
# Start the server (Hypercorn recommended for Windows/Playwright stability)
hypercorn main:app --bind 0.0.0.0:8000 --reloadcd frontend
# Install dependencies
npm install
# Start dev server
npm run devVisit http://localhost:5173 ๐
DevSquad-AI/
โโโ backend/
โ โโโ main.py # FastAPI server
โ โโโ agents/ # AI agent personas
โ โ โโโ orchestrator.py # Manages conversations
โ โ โโโ senior_dev.py # Gemini senior dev
โ โ โโโ junior_dev.py # DeepSeek junior dev
โ โ โโโ unit_tester.py # Gemini tester
โ โ โโโ researcher.py # DeepSeek researcher
โ โ โโโ research_lead.py # Deep research coordinator
โ โโโ prompts/ # Fine-tuned system prompts
โ โโโ services/ # File manager, browser, etc.
โ โโโ requirements.txt
โโโ frontend/
โ โโโ src/
โ โ โโโ App.jsx # Main layout
โ โ โโโ components/ # React components
โ โ โโโ index.css # Neon theme
โ โโโ package.json
โโโ projects/ # Your dynamically managed project folders
โโโ .env # Your API keys
โโโ README.md
The application is designed around a State Machine pattern where each AI agent acts as a specialized node in a collaborative workflow.
- Mission Checklists: The Senior Dev breaks down complex requests into a step-by-step
[MISSION_CHECKLIST]. Agents execute one step at a time and mark it complete. - Autonomous Delegation: Agents decide who is best suited for the next step (e.g., Senior Dev hands off implementation to Junior Dev, who hands off to Unit Tester).
- The Orchestrator: The central
orchestrator.pymanages the "handoff" logic, parsing[โAGENT]cues and ensuring strict sequential execution. - Terminal & Task Safety: On Windows, PowerShell is the forced default to ensure terminal stability. The system uses log debouncing and fuzzy echo filtering to prevent terminal "noise" from cluttering the agent's context memory.
- Priority Review: Agents cannot execute code until the user has approved the file changes. The orchestrator now pauses all follow-up tasks (like
RUN_TESTS) specifically until the "Save" is confirmed.
The Review Agent operates in Shadow Mode, silently observing all agent interactions and providing:
- Performance Scoring: Each agent response is rated 0-100 based on accuracy, efficiency, and style
- Pattern Detection: Identifies recurring issues (e.g., "Junior Dev consistently forgets error handling")
- Prompt Engineering: Suggests concrete improvements to system prompts (
backend/prompts/*.md) - Handoff Optimization: Analyzes and suggests improvements to the orchestrator's cue detection and agent selection logic
- One-Click Fixes: Review suggestions appear as pending changes that can be applied with a single click
This creates a self-improving system where the AI agents get better over time based on their own performance analysis.
Agents communicate via special cues in their responses:
| Cue | Action |
|---|---|
[โSENIOR] |
Pass conversation to Senior Dev |
[โJUNIOR] |
Pass to Junior Dev |
[โTESTER] |
Pass to Unit Tester |
[โRESEARCH] |
Request web research |
[MISSION_CHECKLIST] |
Create a new task plan |
[CHECKLIST_UPDATE] |
Mark a task step as complete |
[PROJECT_COMPLETE] |
Mark the entire mission as finished |
[EDIT_FILE:path] |
Propose file edit (requires approval) |
[CREATE_FILE:path] |
Create a new file (requires approval) |
[READ_FILE:path] |
Read file content in background |
[RUN_TESTS:cmd] |
Automate test execution (e.g., pytest) |
[RUN_COMMAND:cmd] |
Run general shell commands for verification |
[DONE] |
End current turn |
- โ Create new files
- โ Edit existing files
- โ Delete files (requires explicit approval)
- ๐ All changes require user approval
- ๐ Dynamic Sandboxing - Prevents access outside the active project folder
- ๐งน Intelligent Scrubbing - Raw technical tags and oversized code blocks are moved to the Review Panel for readability
Edit .env to customize:
# Required
GEMINI_API_KEY=your_key
DEEPSEEK_API_KEY=your_key
# Optional
MAX_FILE_SIZE_MB=10 # Max upload size
ENABLE_BROWSER_AGENT=true # Enable web browsing
USAGE_LIMIT_PER_DAY=1000 # API call limit| Endpoint | Method | Description |
|---|---|---|
/upload |
POST | Upload files/folders to project |
/chat |
POST | Send message to agents |
/ws/agents |
WebSocket | Real-time agent stream |
/files |
GET | List active project files |
/create-folder |
POST | Create a new directory |
/move |
POST | Move files/folders (Drag & Drop) |
/rename |
POST | Rename files/folders inline |
/select-folder |
GET | Open native folder picker |
/set-workspace |
POST | Switch active project |
/approve |
POST | Approve/Reject file changes |
/research |
POST | Web research query |
/usage |
GET | API usage stats |
- S3 for file storage
- Lambda for agent execution
- DynamoDB for session history
- CloudFront CDN
- More agent personas (DevOps, Security, etc.)
- Git integration
- Voice chat with agents
- VS Code extension
MIT License - feel free to use and modify!
- Google Gemini - AI backbone (flash-preview-series)
- Playwright - Browser automation
- FastAPI - Backend framework
- React - Frontend framework