Task Orchestration System

Sugar's Task Orchestration system enables intelligent decomposition and execution of complex features through staged workflows and specialist agent routing.

Overview

When Sugar encounters a large feature request, the orchestration system:

Detects that the task requires decomposition
Researches context via web search and codebase analysis
Plans the implementation and generates sub-tasks
Routes each sub-task to the appropriate specialist agent
Executes sub-tasks with parallelism where possible
Reviews the completed work before marking done

flowchart TB
    A[Large Feature Task] --> B

    subgraph B["Stage 1: RESEARCH"]
        B1["• Web search for best practices"]
        B2["• Gather relevant documentation"]
        B3["• Analyze existing codebase patterns"]
        B4["Agent: tech-lead / Explore"]
        B5["Output: context.md, research_findings.md"]
    end

    B -->|context passes forward| C

    subgraph C["Stage 2: PLANNING"]
        C1["• Create implementation plan"]
        C2["• Break into sub-tasks"]
        C3["• Identify specialist agents needed"]
        C4["Agent: tech-lead / Plan"]
        C5["Output: plan.md, sub-tasks[]"]
    end

    C -->|sub-tasks added to queue| D

    subgraph D["Stage 3: IMPLEMENTATION"]
        D1["Sub-task A: Auth UI → frontend-designer"]
        D2["Sub-task B: Auth API → backend-developer"]
        D3["Sub-task C: Auth tests → qa-engineer"]
        D4["Sub-task D: Auth docs → general-purpose"]
    end

    D -->|all sub-tasks complete| E

    subgraph E["Stage 4: REVIEW & INTEGRATION"]
        E1["• Code review all changes"]
        E2["• Run full test suite"]
        E3["• Verify feature works end-to-end"]
        E4["Agent: code-reviewer, qa-engineer"]
    end

Architecture

flowchart TB
    subgraph Orchestrator["TaskOrchestrator"]
        O1["Stage management"]
        O2["Context accumulation"]
        O3["Sub-task generation"]
    end

    subgraph Router["AgentRouter"]
        R1["Pattern matching on task content"]
        R2["Maps to specialist agents"]
    end

    subgraph Manager["SubAgentManager"]
        M1["Concurrency control"]
        M2["Isolated execution"]
    end

    subgraph Executor["AgentSDKExecutor"]
        E1["Agent SDK integration"]
    end

    Orchestrator -->|"Specialist selection"| Router
    Router -->|"Parallel execution"| Manager
    Manager -->|"Task execution"| Executor

Configuration

# .sugar/config.yaml
orchestration:
  enabled: true

  # When to trigger orchestration
  # - auto: System detects complex tasks automatically
  # - explicit: Only when task has orchestrate: true flag
  # - disabled: Never orchestrate, run tasks directly
  auto_decompose: "auto"

  # Detection rules for auto mode
  detection:
    # Task types that always trigger orchestration
    task_types: ["feature", "epic"]

    # Keywords in title/description that trigger orchestration
    keywords:
      - "implement"
      - "build"
      - "create full"
      - "add complete"
      - "redesign"
      - "refactor entire"

    # Minimum estimated complexity (future: AI-based estimation)
    min_complexity: "high"  # low, medium, high

  # Stage definitions
  stages:
    research:
      enabled: true
      agent: "Explore"
      timeout: 600  # 10 minutes
      actions:
        - web_search
        - codebase_analysis
        - doc_gathering
      output_to_context: true
      output_path: ".sugar/orchestration/{task_id}/research.md"

    planning:
      enabled: true
      agent: "Plan"
      timeout: 300  # 5 minutes
      depends_on: ["research"]
      creates_subtasks: true
      output_path: ".sugar/orchestration/{task_id}/plan.md"

    implementation:
      parallel: true
      max_concurrent: 3
      timeout_per_task: 1800  # 30 minutes per sub-task
      agent_routing:
        # Pattern → Agent mapping
        "*ui*|*frontend*|*component*|*design*": "frontend-designer"
        "*api*|*backend*|*endpoint*|*service*": "backend-developer"
        "*test*|*spec*|*coverage*": "qa-engineer"
        "*security*|*auth*|*permission*": "security-engineer"
        "*devops*|*deploy*|*ci*|*docker*": "devops-engineer"
        "*doc*|*readme*|*guide*": "general-purpose"
        "default": "general-purpose"

    review:
      enabled: true
      depends_on: ["implementation"]
      agents:
        - "code-reviewer"
        - "qa-engineer"
      run_tests: true
      require_passing: true

Components

TaskOrchestrator

The main orchestration engine that manages the workflow.

class TaskOrchestrator:
    """
    Orchestrates complex tasks through staged execution.

    Responsibilities:
    - Detect if task requires orchestration
    - Manage stage transitions
    - Accumulate context between stages
    - Track sub-task completion
    - Trigger review stage when implementation complete
    """

    async def should_orchestrate(self, task: Task) -> bool:
        """Determine if task needs orchestration based on config."""

    async def orchestrate(self, task: Task) -> OrchestrationResult:
        """Run full orchestration workflow for a task."""

    async def run_stage(self, stage: Stage, context: Context) -> StageResult:
        """Execute a single stage of the workflow."""

    async def generate_subtasks(self, plan: Plan) -> List[Task]:
        """Generate sub-tasks from planning stage output."""

AgentRouter

Routes tasks to appropriate specialist agents.

class AgentRouter:
    """
    Routes tasks to specialist agents based on content analysis.

    Supports:
    - Pattern matching on task title/description
    - Task type based routing
    - Fallback to default agent
    """

    def route(self, task: Task) -> str:
        """Return the agent name for a task."""

    def get_available_agents(self) -> List[str]:
        """List available specialist agents."""

Available Specialist Agents

These map to Claude Code's built-in agent types:

Agent	Use Case
`general-purpose`	Default for most tasks
`tech-lead`	Architecture, planning, complex decisions
`code-reviewer`	Code review, refactoring feedback
`frontend-designer`	UI/UX, components, styling
`backend-developer`	APIs, databases, server logic
`qa-engineer`	Testing, test strategies, coverage
`security-engineer`	Security audits, auth, vulnerabilities
`devops-engineer`	CI/CD, infrastructure, deployment
`Explore`	Codebase exploration, research
`Plan`	Implementation planning

Workflow Example

Input Task

sugar add "Add user authentication with OAuth support" --type feature

Stage 1: Research

The Explore agent:

Searches web for "OAuth 2.0 best practices 2025"
Analyzes codebase for existing auth patterns
Checks for existing user models
Reviews dependencies (existing auth libraries)

Output saved to .sugar/orchestration/{task_id}/research.md:

# Research: OAuth Authentication

## Web Research
- OAuth 2.0 recommended flow: Authorization Code with PKCE
- Popular libraries: authlib (Python), passport (Node)
- Security considerations: token storage, CSRF protection

## Codebase Analysis
- Existing User model in `app/models/user.py`
- No current auth implementation
- Using FastAPI framework
- SQLAlchemy for ORM

## Recommendations
- Use authlib for OAuth implementation
- Add OAuth provider configuration
- Implement token refresh mechanism

Stage 2: Planning

The Plan agent reads research context and creates:

# Implementation Plan: OAuth Authentication

## Sub-tasks

1. **Create OAuth Configuration**
   - Add OAuth provider settings
   - Environment variables for client ID/secret
   - Agent: backend-developer

2. **Implement OAuth Routes**
   - /auth/login - Initiate OAuth flow
   - /auth/callback - Handle OAuth callback
   - /auth/logout - Clear session
   - Agent: backend-developer

3. **Create Login UI**
   - Login page with OAuth buttons
   - Loading states
   - Error handling
   - Agent: frontend-designer

4. **Add Session Management**
   - JWT token generation
   - Token refresh logic
   - Session storage
   - Agent: security-engineer

5. **Write Tests**
   - Unit tests for OAuth flow
   - Integration tests for routes
   - E2E login flow test
   - Agent: qa-engineer

6. **Update Documentation**
   - Auth setup guide
   - Environment variables
   - API documentation
   - Agent: general-purpose

## Dependencies
- Tasks 1 must complete before 2, 3, 4
- Tasks 2, 3, 4 can run in parallel
- Task 5 depends on 2, 3, 4
- Task 6 can run anytime

Stage 3: Implementation

Sub-tasks added to queue with relationships:

Parent: "Add user authentication with OAuth support" (orchestrating)
  ├── Sub-task 1: "Create OAuth Configuration" (pending)
  ├── Sub-task 2: "Implement OAuth Routes" (blocked by 1)
  ├── Sub-task 3: "Create Login UI" (blocked by 1)
  ├── Sub-task 4: "Add Session Management" (blocked by 1)
  ├── Sub-task 5: "Write Tests" (blocked by 2,3,4)
  └── Sub-task 6: "Update Documentation" (pending)

Execution order:

Tasks 1 and 6 start (no blockers)
When 1 completes → Tasks 2, 3, 4 start in parallel
When 2, 3, 4 complete → Task 5 starts
When all complete → Stage 4 triggers

Stage 4: Review

The code-reviewer agent:

Reviews all file changes from sub-tasks
Checks for code quality issues
Verifies patterns are consistent

The qa-engineer agent:

Runs full test suite
Verifies OAuth flow works end-to-end
Reports any failures

If review passes → Parent task marked complete If review fails → Issues added as new tasks

Task Schema Extensions

@dataclass
class Task:
    id: str
    title: str
    description: str
    type: str  # bug_fix, feature, epic, etc.
    priority: int

    # Orchestration fields
    orchestrate: bool = False  # Explicit orchestration flag
    parent_task_id: Optional[str] = None  # Link to parent
    stage: Optional[str] = None  # Current stage
    blocked_by: List[str] = field(default_factory=list)  # Task IDs

    # Context accumulation
    context_path: Optional[str] = None  # Path to accumulated context

    # Routing
    assigned_agent: Optional[str] = None  # Specialist agent

Context Accumulation

Each stage can read context from previous stages:

class OrchestrationContext:
    """Accumulated context across orchestration stages."""

    task_id: str
    base_path: Path  # .sugar/orchestration/{task_id}/

    def add_research(self, content: str) -> None:
        """Add research findings."""

    def add_plan(self, content: str) -> None:
        """Add implementation plan."""

    def add_subtask_result(self, subtask_id: str, result: str) -> None:
        """Add result from completed sub-task."""

    def get_full_context(self) -> str:
        """Get accumulated context for current stage."""

    def get_files_modified(self) -> List[str]:
        """Get all files modified across sub-tasks."""

CLI Commands

# Add task with explicit orchestration
sugar add "Build payment system" --type feature --orchestrate

# View orchestration status
sugar status --orchestration

# View specific task's orchestration
sugar show <task_id> --stages

# Skip to implementation (bypass research/planning)
sugar add "Add logout button" --type feature --skip-stages research,planning

# Re-run a stage
sugar orchestrate <task_id> --stage planning

# View orchestration context
sugar context <task_id>

Future Enhancements

AI-Based Complexity Detection

Instead of keyword matching, use AI to estimate task complexity:

async def estimate_complexity(task: Task) -> ComplexityScore:
    """Use AI to estimate if task needs orchestration."""
    prompt = f"""
    Analyze this task and estimate complexity:
    Title: {task.title}
    Description: {task.description}

    Consider:
    - Number of files likely affected
    - Number of different concerns (UI, API, DB, etc.)
    - Integration complexity
    - Testing requirements

    Return: low, medium, or high
    """

Learning from History

Track orchestration outcomes to improve:

Which task types benefit most from orchestration
Optimal stage configurations
Agent routing accuracy
Time savings from parallelization

Custom Stage Definitions

Allow users to define custom stages:

orchestration:
  custom_stages:
    security_audit:
      enabled: true
      agent: "security-engineer"
      after: "implementation"
      before: "review"
      actions:
        - security_scan
        - vulnerability_check

Relationship to SubAgentManager

SubAgentManager is the low-level execution primitive used by the orchestration system:

Layer	Component	Responsibility
High	TaskOrchestrator	Workflow stages, context
Mid	AgentRouter	Specialist selection
Low	SubAgentManager	Parallel execution
Base	AgentSDKExecutor	Individual task execution

The orchestration system uses SubAgentManager when:

Running multiple sub-tasks in parallel during implementation stage
Executing parallel research queries
Running multiple review checks simultaneously

# Orchestrator using SubAgentManager for parallel execution
async def run_implementation_stage(self, subtasks: List[Task]) -> List[Result]:
    manager = SubAgentManager(
        parent_config=self.config,
        max_concurrent=self.stages["implementation"]["max_concurrent"]
    )

    # Group subtasks by dependency level
    ready_tasks = [t for t in subtasks if not t.blocked_by]

    # Execute ready tasks in parallel
    results = await manager.spawn_parallel([
        {
            "task_id": t.id,
            "prompt": t.to_prompt(),
            "agent": self.router.route(t)
        }
        for t in ready_tasks
    ])

    return results

Real-World Example Scenarios

Example 1: E-Commerce Payment Integration

A team needs to integrate Stripe payment processing into their e-commerce platform.

sugar add "Integrate Stripe payment processing" --type feature

Stage 1: Research The Explore agent:

Searches for Stripe API best practices and PCI compliance requirements
Analyzes existing checkout flow in the codebase
Identifies current order and cart models
Reviews existing payment placeholders

Stage 2: Planning The Plan agent creates subtasks:

1. backend-developer: Create Stripe configuration and API wrapper
2. backend-developer: Implement payment intent creation endpoint
3. backend-developer: Add webhook handlers for payment events
4. frontend-designer: Build payment form with Stripe Elements
5. security-engineer: Implement PCI-compliant token handling
6. qa-engineer: Write payment flow tests (success, failure, refunds)
7. general-purpose: Update checkout documentation

Stage 3: Implementation Parallel execution groups:

Group 1: Config + Webhook handlers (no dependencies)
Group 2: Payment endpoints + UI (after config)
Group 3: Tests + Docs (after implementation)

Stage 4: Review

Security review of token handling
Test payment flow end-to-end with Stripe test mode
Verify PCI compliance measures

Example 2: API Platform with GraphQL

Building a new GraphQL API layer on top of existing REST services.

sugar add "Build GraphQL API layer for user management" --type feature

How Orchestration Helps:

Stage	What Happens	Value Added
Research	Analyzes existing REST endpoints, identifies data models, reviews GraphQL schema patterns	Understands existing architecture before proposing changes
Planning	Creates schema design, identifies resolvers needed, plans authentication integration	Ensures comprehensive coverage of all endpoints
Implementation	Parallel work on schema, resolvers, auth middleware, client SDK	3x faster than sequential development
Review	Schema validation, resolver testing, performance benchmarking	Catches N+1 queries and auth gaps

Subtask Breakdown:

Schema & Types (backend-developer)
├── Define User type and queries
├── Define mutations (createUser, updateUser, deleteUser)
└── Add input types and validation

Resolvers (backend-developer)
├── User query resolvers
├── Mutation resolvers with REST service calls
└── DataLoader for batching

Auth & Middleware (security-engineer)
├── JWT validation middleware
├── Permission directives
└── Rate limiting

Testing (qa-engineer)
├── Schema snapshot tests
├── Resolver unit tests
└── Integration tests with mocked REST services

Documentation (general-purpose)
├── Schema documentation
├── Authentication guide
└── Example queries and mutations

Example 3: CI/CD Pipeline Setup

Setting up a complete CI/CD pipeline for a new microservice.

sugar add "Set up CI/CD pipeline with GitHub Actions and Kubernetes" --type feature

Orchestration Flow:

flowchart TB
    subgraph Research["RESEARCH: Explore agent"]
        R1["Analyze existing deployment scripts"]
        R2["Review Kubernetes cluster configuration"]
        R3["Check current GitHub Actions workflows"]
        R4["Identify environment requirements"]
    end

    subgraph Planning["PLANNING: Plan agent"]
        P1["1. devops-engineer: Dockerfile"]
        P2["2. devops-engineer: K8s manifests"]
        P3["3. devops-engineer: GH Actions"]
        P4["4. security-engineer: Secrets"]
        P5["5. qa-engineer: Tests"]
        P6["6. general-purpose: Docs"]
    end

    subgraph Implementation["IMPLEMENTATION: Parallel"]
        subgraph DevOps["devops-engineer"]
            D1["Dockerfile"]
            D2["K8s manifests"]
            D3["GH Actions"]
        end
        subgraph Security["security-engineer"]
            S1["Secrets setup"]
            S2["RBAC config"]
            S3["Image scanning"]
        end
        subgraph Docs["general-purpose"]
            Doc1["Documentation"]
        end
        subgraph QA["qa-engineer"]
            Q1["Smoke tests"]
            Q2["Rollback tests"]
        end
    end

    subgraph Review["REVIEW: Validate"]
        Rev1["Dry-run deployment to staging"]
        Rev2["Verify rollback mechanism"]
        Rev3["Check monitoring and alerting"]
    end

    Research --> Planning
    Planning --> Implementation
    DevOps --> QA
    Security --> QA
    Implementation --> Review

Example 4: Full-Stack Dashboard Feature

Building an analytics dashboard with real-time updates.

sugar add "Build real-time analytics dashboard" --type feature --orchestrate

Agent Routing in Action:

Subtask	Assigned Agent	Reasoning
Design dashboard layout and components	frontend-designer	Contains "dashboard", "design", "components"
Create WebSocket server for real-time data	backend-developer	Contains "server", "data"
Implement chart components with D3.js	frontend-designer	Contains "components"
Add user preference storage	backend-developer	Contains "storage"
Write component tests	qa-engineer	Contains "tests"
Set up WebSocket authentication	security-engineer	Contains "authentication"
Configure CDN for static assets	devops-engineer	Contains "CDN", "assets"

Parallel Execution Benefits:

Frontend and backend work happens simultaneously
Tests are written as features are built
Security review runs in parallel with documentation
Total time: ~40% of sequential execution

When Orchestration Shines

Best Use Cases

Cross-cutting features - Features that touch UI, API, database, and tests
Complex integrations - Third-party API integrations requiring research
New subsystems - Building entirely new functionality from scratch
Refactoring epics - Large-scale code modernization efforts
Security implementations - Auth, encryption, compliance features

Signs a Task Needs Orchestration

Description includes multiple technical domains (frontend + backend + database)
Requires research before implementation can begin
Will touch more than 10 files
Needs coordination between different specialties
Has complex dependencies between subtasks

When to Skip Orchestration

Simple bug fixes (one file, clear solution)
Documentation updates
Minor UI tweaks
Configuration changes
Single-endpoint API additions

Architecture Integration

flowchart TB
    subgraph App["Your Application"]
        A[Task Request]
    end

    subgraph Sugar["Sugar"]
        B[Work Queue<br/>SQLite] --> C[Task Executor]
        C --> D{Needs<br/>Orchestration?}

        D -->|No| E[Direct Execution]
        D -->|Yes| F[TaskOrchestrator]

        F --> G[Research<br/>Explore Agent]
        G --> H[Planning<br/>Plan Agent]
        H --> I[Implementation<br/>Parallel Agents]
        I --> J[Review<br/>Code Reviewer]

        subgraph Specialists["Specialist Agents"]
            S1[frontend-designer]
            S2[backend-developer]
            S3[qa-engineer]
            S4[security-engineer]
            S5[devops-engineer]
        end

        I --> Specialists
    end

    subgraph SDK["Claude Agent SDK"]
        K[Agent Execution]
    end

    subgraph API["Claude API"]
        L[Claude Models]
    end

    A --> B
    E --> K
    J --> K
    Specialists --> K
    K --> L

This architecture enables Sugar to handle everything from simple one-liner fixes to complex multi-day feature implementations, automatically choosing the right level of sophistication for each task.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Task Orchestration System

Overview

Architecture

Configuration

Components

TaskOrchestrator

AgentRouter

Available Specialist Agents

Workflow Example

Input Task

Stage 1: Research

Stage 2: Planning

Stage 3: Implementation

Stage 4: Review

Task Schema Extensions

Context Accumulation

CLI Commands

Future Enhancements

AI-Based Complexity Detection

Learning from History

Custom Stage Definitions

Relationship to SubAgentManager

Real-World Example Scenarios

Example 1: E-Commerce Payment Integration

Example 2: API Platform with GraphQL

Example 3: CI/CD Pipeline Setup

Example 4: Full-Stack Dashboard Feature

When Orchestration Shines

Best Use Cases

Signs a Task Needs Orchestration

When to Skip Orchestration

Architecture Integration

FilesExpand file tree

task_orchestration.md

Latest commit

History

task_orchestration.md

File metadata and controls

Task Orchestration System

Overview

Architecture

Configuration

Components

TaskOrchestrator

AgentRouter

Available Specialist Agents

Workflow Example

Input Task

Stage 1: Research

Stage 2: Planning

Stage 3: Implementation

Stage 4: Review

Task Schema Extensions

Context Accumulation

CLI Commands

Future Enhancements

AI-Based Complexity Detection

Learning from History

Custom Stage Definitions

Relationship to SubAgentManager

Real-World Example Scenarios

Example 1: E-Commerce Payment Integration

Example 2: API Platform with GraphQL

Example 3: CI/CD Pipeline Setup

Example 4: Full-Stack Dashboard Feature

When Orchestration Shines

Best Use Cases

Signs a Task Needs Orchestration

When to Skip Orchestration

Architecture Integration