Skip to content

A collaborative multi-agent AI development environment powered by Gemini and DeepSeek. This tool features a virtual dev team (Senior, Junior, Tester, and Researcher) that can review code, perform web research, and edit files in real-time within a sandboxed environment.

Notifications You must be signed in to change notification settings

Andyi955/DevSquad-AI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

44 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿค– Multi-Agent Code Assistant

A fun, interactive web application where AI agents (DeepSeek & Gemini) collaborate as a development team to review, improve, and test your code!

๐Ÿ“ธ Screenshots

Main Interface
Main chat interface with real-time streaming and neon theme

Dashboard Demo
Live Multi-Agent Collaboration Dashboard

Deep Research Demo
Tandem agents performing parallel search and synthesis

Review Agent Dashboard
Review Agent analyzing agent performance and suggesting improvements

โœจ Features

  • ๐Ÿง™ Senior Dev (Gemini) - Architecture, planning, Mission Checklist management
  • ๐Ÿฃ Junior Dev (Gemini) - Implementation, code writing, bug fixes
  • ๐Ÿงช Unit Tester (Gemini) - Test creation (Pytest/Unittest), quality assurance
  • ๐Ÿ” Researcher (Gemini) - Targeted web searches, documentation lookups
  • ๐Ÿ—๏ธ Research Lead (Gemini) - Deep research orchestration, report synthesis
  • ๐Ÿ•ต๏ธโ€โ™‚๏ธ Review Agent (Gemini) - Silent observer that critiques performance and suggests system improvements

๐Ÿ”ฌ Dedicated Deep Research

The Research framework uses a Tandem Architecture for maximum depth and speed:

  1. Lead Architect (Gemini 3 Flash): Orchestrates the mission, performs high-speed web searches, and scrapes multiple sources in parallel.
  2. Synthesis Engine (DeepSeek V3): Analyzes the gathered raw data and synthesizes it into a high-impact Executive Report.

๐Ÿง  Hybrid "Thinking" Process

We utilize a combination of Gemini 3 Flash for development speed and DeepSeek-V3 (via deepseek-chat) for the final research synthesis and complex reasoning.

  • Agents explicitly show their internal monologue using <think> tags.
  • This "Show Your Work" approach allows you to see how the agent arrived at a solution before it writes any code.
  • The Junior Dev and Senior Dev use it to plan architectural and implementation steps.
  • The Researcher uses it to formulate search strategies and cross-reference sources.
  • The Summarizer uses it to analyze multi-source data for the final report.

Highlights

  • ๐ŸŽฌ Real-time streaming - Watch agents think and respond live
  • ๐Ÿ“ Mission Checklists - Agents create and track multi-step plans automatically
  • ๐Ÿ’ป Terminal Integration - Embedded PowerShell terminal with automated task execution
  • ๐Ÿงช Smart Test Runner - Agents can write AND run tests (pytest, unittest) autonomously
  • โšก Optimistic UI - Instant feedback for stop actions and state changes
  • ๐Ÿ“Ž File Context - Intelligent file reading (only reads what is needed)
  • ๐Ÿ”’ Safe Switch Management - Dynamically switch between project folders without data loss
  • ๐ŸŒˆ Color-coded Diffs - Visual representation of code additions and removals
  • ๐Ÿ“Š Usage tracking - Monitor API usage and costs
  • ๐Ÿ•ต๏ธโ€โ™‚๏ธ Self-Improving System - Review Agent analyzes performance and suggests prompt/orchestrator improvements

๐Ÿš€ Quick Start

Prerequisites

1. Clone & Setup

cd DevSquad-AI

# Copy environment file and add your API keys
cp .env.example .env
# Edit .env and add your GEMINI_API_KEY, DEEPSEEK_API_KEY, and SERPER_API_KEY

2. Backend Setup

cd backend

# Create virtual environment
python -m venv venv

# Activate (Windows)
.\venv\Scripts\activate

# Activate (Mac/Linux)
source venv/bin/activate

# Install dependencies
pip install -r requirements.txt

# Install Playwright browsers (for web research)
playwright install chromium

# Start the server (Hypercorn recommended for Windows/Playwright stability)
hypercorn main:app --bind 0.0.0.0:8000 --reload

3. Frontend Setup

cd frontend

# Install dependencies
npm install

# Start dev server
npm run dev

4. Open the App

Visit http://localhost:5173 ๐ŸŽ‰


๐Ÿ“ Project Structure

DevSquad-AI/
โ”œโ”€โ”€ backend/
โ”‚   โ”œโ”€โ”€ main.py              # FastAPI server
โ”‚   โ”œโ”€โ”€ agents/              # AI agent personas
โ”‚   โ”‚   โ”œโ”€โ”€ orchestrator.py  # Manages conversations
โ”‚   โ”‚   โ”œโ”€โ”€ senior_dev.py    # Gemini senior dev
โ”‚   โ”‚   โ”œโ”€โ”€ junior_dev.py    # DeepSeek junior dev
โ”‚   โ”‚   โ”œโ”€โ”€ unit_tester.py   # Gemini tester
โ”‚   โ”‚   โ”œโ”€โ”€ researcher.py    # DeepSeek researcher
โ”‚   โ”‚   โ””โ”€โ”€ research_lead.py # Deep research coordinator
โ”‚   โ”œโ”€โ”€ prompts/             # Fine-tuned system prompts
โ”‚   โ”œโ”€โ”€ services/            # File manager, browser, etc.
โ”‚   โ””โ”€โ”€ requirements.txt
โ”œโ”€โ”€ frontend/
โ”‚   โ”œโ”€โ”€ src/
โ”‚   โ”‚   โ”œโ”€โ”€ App.jsx          # Main layout
โ”‚   โ”‚   โ”œโ”€โ”€ components/      # React components
โ”‚   โ”‚   โ””โ”€โ”€ index.css        # Neon theme
โ”‚   โ””โ”€โ”€ package.json
โ”œโ”€โ”€ projects/                # Your dynamically managed project folders
โ”œโ”€โ”€ .env                     # Your API keys
โ””โ”€โ”€ README.md

๐ŸŽฎ How It Works

๐Ÿง  System Logic

The application is designed around a State Machine pattern where each AI agent acts as a specialized node in a collaborative workflow.

  • Mission Checklists: The Senior Dev breaks down complex requests into a step-by-step [MISSION_CHECKLIST]. Agents execute one step at a time and mark it complete.
  • Autonomous Delegation: Agents decide who is best suited for the next step (e.g., Senior Dev hands off implementation to Junior Dev, who hands off to Unit Tester).
  • The Orchestrator: The central orchestrator.py manages the "handoff" logic, parsing [โ†’AGENT] cues and ensuring strict sequential execution.
  • Terminal & Task Safety: On Windows, PowerShell is the forced default to ensure terminal stability. The system uses log debouncing and fuzzy echo filtering to prevent terminal "noise" from cluttering the agent's context memory.
  • Priority Review: Agents cannot execute code until the user has approved the file changes. The orchestrator now pauses all follow-up tasks (like RUN_TESTS) specifically until the "Save" is confirmed.

๐Ÿ•ต๏ธโ€โ™‚๏ธ Review Agent & Self-Improvement

The Review Agent operates in Shadow Mode, silently observing all agent interactions and providing:

  • Performance Scoring: Each agent response is rated 0-100 based on accuracy, efficiency, and style
  • Pattern Detection: Identifies recurring issues (e.g., "Junior Dev consistently forgets error handling")
  • Prompt Engineering: Suggests concrete improvements to system prompts (backend/prompts/*.md)
  • Handoff Optimization: Analyzes and suggests improvements to the orchestrator's cue detection and agent selection logic
  • One-Click Fixes: Review suggestions appear as pending changes that can be applied with a single click

This creates a self-improving system where the AI agents get better over time based on their own performance analysis.

Agent Cue System

Agents communicate via special cues in their responses:

Cue Action
[โ†’SENIOR] Pass conversation to Senior Dev
[โ†’JUNIOR] Pass to Junior Dev
[โ†’TESTER] Pass to Unit Tester
[โ†’RESEARCH] Request web research
[MISSION_CHECKLIST] Create a new task plan
[CHECKLIST_UPDATE] Mark a task step as complete
[PROJECT_COMPLETE] Mark the entire mission as finished
[EDIT_FILE:path] Propose file edit (requires approval)
[CREATE_FILE:path] Create a new file (requires approval)
[READ_FILE:path] Read file content in background
[RUN_TESTS:cmd] Automate test execution (e.g., pytest)
[RUN_COMMAND:cmd] Run general shell commands for verification
[DONE] End current turn

File Safety

  • โœ… Create new files
  • โœ… Edit existing files
  • โœ… Delete files (requires explicit approval)
  • ๐Ÿ”’ All changes require user approval
  • ๐Ÿ“ Dynamic Sandboxing - Prevents access outside the active project folder
  • ๐Ÿงน Intelligent Scrubbing - Raw technical tags and oversized code blocks are moved to the Review Panel for readability

โš™๏ธ Configuration

Edit .env to customize:

# Required
GEMINI_API_KEY=your_key
DEEPSEEK_API_KEY=your_key

# Optional
MAX_FILE_SIZE_MB=10          # Max upload size
ENABLE_BROWSER_AGENT=true    # Enable web browsing
USAGE_LIMIT_PER_DAY=1000     # API call limit

๐Ÿ› ๏ธ API Endpoints

Endpoint Method Description
/upload POST Upload files/folders to project
/chat POST Send message to agents
/ws/agents WebSocket Real-time agent stream
/files GET List active project files
/create-folder POST Create a new directory
/move POST Move files/folders (Drag & Drop)
/rename POST Rename files/folders inline
/select-folder GET Open native folder picker
/set-workspace POST Switch active project
/approve POST Approve/Reject file changes
/research POST Web research query
/usage GET API usage stats

๐Ÿ”ฎ Roadmap

Phase 2: AWS Deployment

  • S3 for file storage
  • Lambda for agent execution
  • DynamoDB for session history
  • CloudFront CDN

Future Ideas

  • More agent personas (DevOps, Security, etc.)
  • Git integration
  • Voice chat with agents
  • VS Code extension

๐Ÿ“ License

MIT License - feel free to use and modify!


๐Ÿ™ Acknowledgments


๐Ÿ“ž Contact

Andrew Ivory


About

A collaborative multi-agent AI development environment powered by Gemini and DeepSeek. This tool features a virtual dev team (Senior, Junior, Tester, and Researcher) that can review code, perform web research, and edit files in real-time within a sandboxed environment.

Resources

Stars

Watchers

Forks

Packages

No packages published