Terrarium Agent

A flexible agent runtime with sandboxed execution, vLLM integration, and extensible tool system.

Architecture

Agent-first design where tools (IRC, code execution, web, files) are integrated into a unified agent runtime, rather than building tool-specific bots.

Components

HTTP API Server: OpenAI-compatible REST API for external integration (port 8080)
Agent Runtime: Core loop with context management and tool orchestration
vLLM Integration: GLM-4.5-Air-AWQ-4bit and Qwen3-Next-80B-A3B-Instruct-AWQ-4bit via vLLM Docker containers (port 8000)
Tool System: Pluggable tools with clean interfaces
Context Swapping: Different personas/contexts per domain (IRC ambassador, coder, etc.)
Session Management: Persistent conversation storage with multi-context support
Harness System: Structured game/task environments (chess, CTF, coding challenges)

Runtime Behavior Notes

Tool calling: Agent runtime now follows the OpenAI tool/function call spec. Assistant messages include tool_calls, tool results are fed back as role: "tool" with tool_call_id, and the loop continues until no more tool calls or max iterations.
Prompt sizing: A lightweight estimator trims oldest turns before sending to vLLM, keeping headroom for completions (default prompt budget ~6k tokens, completion cap ~2k). vLLM still enforces the true --max-model-len.

Project Structure

terrarium-agent/
├── agent/           # Core agent runtime
├── llm/             # vLLM client and prompt management
├── tools/           # Tool implementations (IRC, shell, python, files)
├── config/          # Configuration and context definitions
├── main.py          # Entry point
└── requirements.txt

Tools

IRC Tool

Integrates with terrarium-irc for reading/sending IRC messages, accessing chat history.

Shell Tool

Execute shell commands in sandboxed environment.

Python Tool

Execute Python code with resource limits.

Files Tool

Read/write files with access controls.

Setup

Prerequisites

Docker + nvidia-container-toolkit (for vLLM)
NVIDIA GPU: GB10 (Blackwell) or compatible with driver 580+
Models: Place downloaded models under models/
- GLM-4.5-Air-AWQ-4bit
- (optional) Qwen3-Next-80B-A3B-Instruct-AWQ-4bit

Quick Start

# 1. Check model is downloaded
./check_model.sh

# 2. Install Python client dependencies
pip install -r requirements.txt

# 3a. Start GLM (8k context, tool+reasoning parsers)
./start_vllm_docker.sh --num-agents 1 --max-model-len 8192
# (override --gpu-mem if you want a larger/smaller KV pool)

# 3b. Start Qwen3 (long-context)
./start_vllm_docker_qwen3.sh --num-agents 1 --max-model-len 32768 --enforce-eager
# Defaults: dtype bf16, tool parser hermes (reasoning parser unset).
# Increase --max-model-len or --num-agents if you need more parallel contexts;
# reduce --gpu-mem to lower VRAM.

# 4. Choose how to run the agent:

# Option A: HTTP API Server (recommended for external integration)
source venv/bin/activate
python server.py  # Starts on http://localhost:8080

# Option B: Interactive chat with persistent sessions
source venv/bin/activate
python chat.py

# Option C: Full agent runtime with tools and harnesses
source venv/bin/activate
python main.py

vLLM scripts and sizing:

start_vllm_docker.sh (GLM) and start_vllm_docker_qwen3.sh (Qwen3) accept --max-model-len and --num-agents; if you omit --gpu-mem, the scripts auto-size the KV pool based on those. Lower --gpu-mem to reduce VRAM; higher --max-model-len trades concurrency for longer prompts.
Qwen3 defaults: bf16, tool parser hermes, --enforce-eager on to avoid torch.compile issues in this image.

Documentation:

QUICKSTART.md - Detailed setup instructions
INTEGRATION.md - How to integrate external apps (IRC, web, games)
AGENT_API.md - HTTP API specification
DOCKER_SETUP.md - Docker setup and troubleshooting

Systemd Service

Copy systemd/terrarium-agent.service to terrarium-agent.service.local (kept out of git) and update User, Group, WorkingDirectory, and ExecStart to match your host.
Install it with sudo cp terrarium-agent.service.local /etc/systemd/system/terrarium-agent.service and reload with sudo systemctl daemon-reload.
Enable/start via sudo systemctl enable --now terrarium-agent and tail logs using sudo journalctl -u terrarium-agent -f.
Set secrets or overrides in /etc/terrarium-agent.env (optional) and restart the service whenever the environment changes.

Configuration

See config/ directory for:

agent.yaml - Main agent configuration
tools.yaml - Tool-specific settings
contexts/ - Context definitions for different domains

Development Status

🚧 Early Development - Core architecture being built

External Integration

Terrarium Agent provides an HTTP API for integration with external applications.

For IRC Integration:

See INTEGRATION.md for complete guide
Start agent server: python server.py
Make HTTP requests from terrarium-irc to http://localhost:8080/v1/chat/completions
Client manages conversation history (stateless server)

Other Use Cases:

Web chat applications
Game environments (harnesses)
Custom tools and bots

Related Projects

terrarium-irc - IRC bot (integrates via HTTP API)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Terrarium Agent

Architecture

Components

Runtime Behavior Notes

Project Structure

Tools

IRC Tool

Shell Tool

Python Tool

Files Tool

Setup

Prerequisites

Quick Start

Systemd Service

Configuration

Development Status

External Integration

Related Projects

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
agent		agent
config		config
llm		llm
scripts		scripts
systemd		systemd
tests		tests
tools		tools
.gitignore		.gitignore
AGENTS.md		AGENTS.md
AGENT_API.md		AGENT_API.md
CLAUDE.md		CLAUDE.md
DOCKER_SETUP.md		DOCKER_SETUP.md
FUTURE_FEATURES.md		FUTURE_FEATURES.md
HARNESS_GUIDE.md		HARNESS_GUIDE.md
INTEGRATION.md		INTEGRATION.md
QUICKSTART.md		QUICKSTART.md
README.md		README.md
SESSION_STORAGE.md		SESSION_STORAGE.md
chat.py		chat.py
check_model.sh		check_model.sh
client_library.py		client_library.py
main.py		main.py
requirements.txt		requirements.txt
server.py		server.py
start_vllm_docker.sh		start_vllm_docker.sh
start_vllm_docker_qwen3.sh		start_vllm_docker_qwen3.sh
test_harness.py		test_harness.py

Folders and files

Latest commit

History

Repository files navigation

Terrarium Agent

Architecture

Components

Runtime Behavior Notes

Project Structure

Tools

IRC Tool

Shell Tool

Python Tool

Files Tool

Setup

Prerequisites

Quick Start

Systemd Service

Configuration

Development Status

External Integration

Related Projects

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages