PolyAgent Agent Core - Architecture Documentation

Overview

The PolyAgent Agent Core is a high-performance Rust implementation of the agent execution layer, providing secure sandboxing, efficient memory management, and intelligent tool orchestration. This document describes the modernized architecture following 2025 best practices.

Architecture Principles

Separation of Concerns: Intelligence (Python) vs Execution (Rust)
Zero-Copy Operations: Minimize string cloning and memory allocations
Modern Concurrency: Use OnceLock and std::sync::Once instead of lazy_static
Comprehensive Error Handling: Result<T> types with structured errors via thiserror
Observable Systems: OpenTelemetry tracing and Prometheus metrics
Security First: WASI sandboxing for untrusted code execution

Component Architecture

┌─────────────────────────────────────────────────────────────┐
│                     gRPC Server (port 50051)                │
├─────────────────────────────────────────────────────────────┤
│                    Enforcement Gateway                     │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────────┐ │
│  │  Timeouts    │  │  Rate Limits │  │ Circuit Breakers │ │
│  └──────────────┘  └──────────────┘  └──────────────────┘ │
├─────────────────────────────────────────────────────────────┤
│                    Tool Execution Layer                     │
│  ┌──────────┐  ┌──────────┐  ┌──────────┐  ┌──────────┐  │
│  │   Tool   │  │   Tool   │  │   Tool   │  │   WASI   │  │
│  │ Registry │  │   Cache  │  │ Executor │  │  Sandbox │  │
│  └──────────┘  └──────────┘  └──────────┘  └──────────┘  │
├─────────────────────────────────────────────────────────────┤
│                    Infrastructure Layer                     │
│  ┌──────────┐  ┌──────────┐  ┌──────────┐  ┌──────────┐  │
│  │  Memory  │  │  Config  │  │ Tracing  │  │ Metrics  │  │
│  │   Pool   │  │  Manager │  │  (OTEL)  │  │  (Prom)  │  │
│  └──────────┘  └──────────┘  └──────────┘  └──────────┘  │
└─────────────────────────────────────────────────────────────┘

Core Components

1. Enforcement Gateway (`src/enforcement.rs`)

Uniform per-request policy enforcement for every code path:

Timeouts: hard wall clock limit per request
Token ceiling: reject requests with excessive estimated tokens
Rate limiting: simple per-key token bucket
Circuit breaker: rolling error window per key
- Optional distributed limiter: set ENFORCE_RATE_REDIS_URL to enable a Redis-backed token bucket shared across instances.

Configuration lives under enforcement in config/agent.yaml with environment variable overrides (ENFORCE_*).

2. Tool System

Tool Registry (`src/tool_registry.rs`)

Centralized tool capability management
Discovery API with filtering and relevance scoring
Metadata including schemas, permissions, and TTL

Tool Cache (`src/tool_cache.rs`)

LRU caching with configurable TTL
Deterministic cache key generation
Automatic expiration and sweeping
Comprehensive statistics tracking

Tool Executor (`src/tools.rs`)

Unified interface for tool execution
Integration with Python LLM service
WASI sandbox routing for code execution
Automatic result caching

3. WASI Sandbox (`src/wasi_sandbox.rs`)

Secure WebAssembly execution environment with:

Filesystem isolation (read-only /tmp access)
Memory limits (configurable, default 256MB)
Execution timeouts (default 30s)
Fuel metering for CPU usage control

4. Memory Management (`src/memory.rs`)

Efficient memory pool with:

Pre-allocated memory blocks
Automatic garbage collection
Pressure-based rejection
Thread-safe allocation/deallocation

5. Configuration (`src/config.rs`)

Centralized configuration management:

YAML-based configuration files
Environment variable overrides (including enforcement: ENFORCE_*)
Hot-reload support (future)
Structured configuration types

6. Observability

Tracing (`src/tracing.rs`)

OpenTelemetry integration
W3C trace context propagation
Active span context injection
Cross-service tracing support

Metrics (`src/metrics.rs`)

Prometheus metrics export
Tool execution metrics
Memory usage tracking
Cache performance stats
Enforcement metrics: drops by reason, allowed outcomes

API Contracts

gRPC API

The agent exposes the following gRPC services:

service AgentService {
  rpc ExecuteTask(ExecuteTaskRequest) returns (ExecuteTaskResponse);
  rpc StreamExecuteTask(ExecuteTaskRequest) returns (stream TaskUpdate);
  rpc GetCapabilities(GetCapabilitiesRequest) returns (GetCapabilitiesResponse);
  rpc HealthCheck(HealthCheckRequest) returns (HealthCheckResponse);
  rpc DiscoverTools(DiscoverToolsRequest) returns (DiscoverToolsResponse);
  rpc GetToolCapability(GetToolCapabilityRequest) returns (GetToolCapabilityResponse);
}

Python-Rust Contract

The Rust agent communicates with Python LLM service via HTTP:

Tool Selection

POST /tools/select
{
  "task": "string",
  "context": {},
  "exclude_dangerous": boolean,
  "max_tools": number
}

Tool Execution

POST /tools/execute
{
  "tool_name": "string",
  "parameters": {}
}

Task Analysis

POST /analyze_task
{
  "query": "string",
  "context": {}
}

Error Handling

Comprehensive error taxonomy using thiserror:

pub enum AgentError {
    ToolNotFound { name: String },
    ToolExecutionFailed { tool: String, reason: String },
    MemoryExhausted { requested: usize, available: usize },
    SandboxViolation { operation: String },
    ConfigurationError(String),
    NetworkError(String),
    // ... 20+ error variants
}

Performance Optimizations

1. Zero-Copy Strings

Using Cow<str> for string handling to avoid unnecessary allocations:

pub fn process_text<'a>(input: &'a str) -> Cow<'a, str>

2. Lazy Initialization

Modern OnceLock pattern for metrics:

static METRICS: OnceLock<HashMap<String, Counter>> = OnceLock::new();

3. Parallel Tool Execution

Concurrent tool execution with tokio:

let futures = tools.iter().map(|tool| executor.execute_tool(tool));
let results = futures::future::join_all(futures).await;

4. Cache-First Architecture

Tool result caching with configurable TTL
LLM response caching for simple queries
Discovery result caching

Security Model

WASI Sandbox Isolation

No network access
Limited filesystem access (read-only /tmp)
Memory limits enforced
CPU usage controlled via fuel metering

Tool Permission System

pub struct ToolCapability {
    pub required_permissions: Vec<String>,
    pub is_dangerous: bool,
    pub requires_confirmation: bool,
}

Input Validation

Parameter schema validation
Size limits on inputs
Timeout protection

Testing Strategy

Unit Tests

Component-level testing
Mock dependencies
Property-based testing for complex logic

Integration Tests

Python-Rust contract validation
End-to-end tool execution
Cache behavior verification
Error handling scenarios

Performance Tests

Benchmark critical paths
Memory usage profiling
Concurrent execution stress tests

Deployment

Docker Container

FROM rust:1.75 as builder
WORKDIR /app
COPY . .
RUN cargo build --release

FROM debian:bookworm-slim
COPY --from=builder /app/target/release/shannon-agent-core /usr/local/bin/
EXPOSE 50051 2113
CMD ["shannon-agent-core"]

Configuration

Environment variables:

RUST_LOG: Logging level
OTEL_EXPORTER_OTLP_ENDPOINT: Tracing endpoint
MEMORY_POOL_SIZE_MB: Memory pool size
WASI_MEMORY_LIMIT_MB: WASI sandbox memory limit
TOOL_CACHE_TTL_SECONDS: Default cache TTL

Health Checks

gRPC health endpoint: :50051/health
Metrics endpoint: :2113/metrics

Migration Guide

From Legacy Patterns

Replace `lazy_static!`

// Old
lazy_static! {
    static ref METRICS: Mutex<HashMap<String, Counter>> = Mutex::new(HashMap::new());
}

// New
static METRICS: OnceLock<Mutex<HashMap<String, Counter>>> = OnceLock::new();

Error Handling

// Old
let result = operation().unwrap();

// New
let result = operation().context("Failed to perform operation")?;

String Operations

// Old
fn process(input: String) -> String

// New
fn process(input: &str) -> Cow<str>

Future Enhancements

WebAssembly Component Model: Support for WASI Preview 2
Distributed Caching: Redis integration for cache sharing
GPU Acceleration: CUDA/ROCm support for ML operations
Multi-Region Support: Geo-distributed agent deployment
Advanced Monitoring: Custom metrics and tracing spans

Contributing

Please follow these guidelines:

Use cargo fmt and cargo clippy before commits
Add tests for new functionality
Update documentation for API changes
Follow error handling best practices
Minimize unnecessary allocations

FilesExpand file tree

agent-core-architecture.md

Latest commit

History

agent-core-architecture.md

File metadata and controls

PolyAgent Agent Core - Architecture Documentation

Overview

Architecture Principles

Component Architecture

Core Components

1. Enforcement Gateway (src/enforcement.rs)

2. Tool System

Tool Registry (src/tool_registry.rs)

Tool Cache (src/tool_cache.rs)

Tool Executor (src/tools.rs)

3. WASI Sandbox (src/wasi_sandbox.rs)

4. Memory Management (src/memory.rs)

5. Configuration (src/config.rs)

6. Observability

Tracing (src/tracing.rs)

Metrics (src/metrics.rs)

API Contracts

gRPC API

Python-Rust Contract

Tool Selection

Tool Execution

Task Analysis

Error Handling

Performance Optimizations

1. Zero-Copy Strings

2. Lazy Initialization

3. Parallel Tool Execution

4. Cache-First Architecture

Security Model

WASI Sandbox Isolation

Tool Permission System

Input Validation

Testing Strategy

Unit Tests

Integration Tests

Performance Tests

Deployment

Docker Container

Configuration

Health Checks

Migration Guide

From Legacy Patterns

Replace lazy_static!

Error Handling

String Operations

Future Enhancements

Contributing

License

1. Enforcement Gateway (`src/enforcement.rs`)

Tool Registry (`src/tool_registry.rs`)

Tool Cache (`src/tool_cache.rs`)

Tool Executor (`src/tools.rs`)

3. WASI Sandbox (`src/wasi_sandbox.rs`)

4. Memory Management (`src/memory.rs`)

5. Configuration (`src/config.rs`)

Tracing (`src/tracing.rs`)

Metrics (`src/metrics.rs`)

Replace `lazy_static!`