Heddle

TypeScript LLM API harness — tool execution, streaming, edits, context management.

Heddle gives LLMs the ability to read, write, edit, and search files, run shell commands, and maintain persistent conversation sessions. Built on OpenRouter's OpenAI-compatible API with a headless JSON-over-stdio mode for embedding in other applications.

Quick Start

bun install

# Add your API key
echo 'OPENROUTER_API_KEY=sk-or-v1-your-key' > .env.local

# Run the interactive CLI
bun run dev

The default model is openrouter/free. Override it with HEDDLE_MODEL or in your config file.

Requirements

Bun (runtime, test runner, package manager)
An OpenRouter API key

Configuration

Heddle uses layered TOML configuration: defaults -> global -> local -> env vars (last wins).

Config Files

Location	Purpose
`~/.heddle/config.toml`	Global user settings
`.heddle/config.toml`	Project-specific overrides

# ~/.heddle/config.toml
model = "anthropic/claude-sonnet-4"
weak_model = "openrouter/free"
api_key = "sk-or-v1-..."
system_prompt = "You are a helpful coding assistant."
approval_mode = "suggest"
temperature = 0.7
max_tokens = 128000

Config Fields

Field	Type	Default	Description
`model`	string	`openrouter/free`	Primary LLM model
`weak_model`	string	—	Weak model for context compaction
`editor_model`	string	—	Specialized editing model
`api_key`	string	—	OpenRouter API key
`base_url`	string	—	Custom API endpoint
`max_tokens`	number	—	Token limit
`temperature`	number	—	Generation temperature
`system_prompt`	string	—	Custom system prompt
`approval_mode`	string	—	Permission mode (see Permissions)
`instructions`	string[]	—	Additional instruction files to inject
`tools`	string[]	—	Allowlist of tools to enable
`doom_loop_threshold`	number	3	Identical tool call iterations before stopping
`budget_limit`	number	—	Cost limit for session
`compact_trigger`	number	0.80	Context usage ratio that triggers compaction
`prune_protect`	number	40000	Token window protected from pruning
`prune_minimum`	number	4	Minimum messages before compaction runs
`compact_buffer`	number	0.50	Target context usage after compaction

Feature Flags

Feature flags control which capabilities are active. Override them in the [features] section of your config:

[features]
history = false
file_history = false

Flag	Default (interactive)	Description
`history`	true	Cross-session message history
`usage_data`	true	Token usage tracking
`facets`	true	Contextual features
`file_history`	true	Backup files before write/edit
`paste_cache`	true	Paste caching
`status_line`	true	Status line display
`hooks`	true	Hook execution
`tasks`	true	Task tracking

Feature defaults vary by execution mode — see Execution Modes.

Environment Variables

All config fields have env var overrides:

Variable	Overrides
`HEDDLE_MODEL`	`model`
`OPENROUTER_API_KEY`	`api_key`
`HEDDLE_BASE_URL`	`base_url`
`HEDDLE_MAX_TOKENS`	`max_tokens`
`HEDDLE_TEMPERATURE`	`temperature`
`HEDDLE_WEAK_MODEL`	`weak_model`
`HEDDLE_APPROVAL_MODE`	`approval_mode`
`HEDDLE_TOOLS`	`tools` (comma-separated)
`HEDDLE_HOME`	Global config directory (default `~/.heddle`)

CLI Usage

The interactive CLI is started with bun run dev. Type messages to chat with the agent.

Slash Commands

Command	Description
`/help`	List available commands
`/clear`	Clear conversation context
`/exit`, `/quit`	Exit heddle
`/cost`	Show token usage and cost
`/status`	Show session status
`/context`	Show context size estimate
`/model [name]`	Switch model or show current
`/tools`	List available tools
`/history [--limit N] [--search term]`	Show message history
`/compact`	Force context compaction
`/sessions`	List recent sessions
`/name <name>`	Name the current session
`/fork`	Fork the current session
`/restore <file> [version]`	Restore a file from backup

Shell Commands

Prefix	Behavior
`!command`	Run shell command, print output (not added to context)
`!!command`	Run shell command, print output and inject into agent context

Mentions

Reference files with @ to inject their contents into your message:

you> Can you refactor @src/config/loader.ts to use async file reads?
  [injected] src/config/loader.ts (159 lines)

Tools

The agent has access to 9 built-in tools:

Tool	Category	Description
`read_file`	read	Read file contents
`write_file`	write	Write/overwrite a file (creates parent dirs)
`edit_file`	write	Find-and-replace with fuzzy matching fallback
`glob`	read	Find files by glob pattern
`grep`	read	Search file contents by regex
`bash`	execute	Run shell commands
`web_fetch`	network	Fetch and extract content from URLs
`ask_user`	read	Ask the user a question (interactive only)
`save_memory`	write	Save persistent notes to project memory

Tools can be filtered via the tools config field or HEDDLE_TOOLS env var.

Fuzzy Edit Matching

The edit_file tool tries 4 match levels when an exact match fails:

Exact — literal string match
Whitespace-normalized — collapse runs of whitespace
Indent-flexible — ignore leading indentation differences
Line-fuzzy — fuzzy per-line matching

If all levels fail, it reports the closest match with line number.

Sessions

Sessions are persisted as JSONL files in ~/.heddle/projects/{encoded-path}/sessions/.

Resume a Session

Pass resume in session options to continue an existing session by ID or name:

const ctx = await createSession({ resume: "abc123" });

Fork a Session

Create a new session branched from an existing one:

const ctx = await createSession({ fork: "abc123" });

Or use /fork in the CLI to fork the current session.

Session Commands

/sessions — list recent sessions with message counts and first user message
/name my-feature — name the current session for easy recall
/fork — fork the current session into a new one

Context Management

Heddle manages context automatically to stay within model limits.

Pruning

Old tool result messages are replaced with [pruned — original: N chars] placeholders. Recent messages within a protection window are preserved. Pruning is automatic after each agent turn.

Compaction

When context usage exceeds the compact_trigger ratio (default 80%), heddle summarizes older messages using the weak_model into a [Context Summary] anchor message. This preserves key decisions, file paths, and tool outcomes while dramatically reducing token count.

Automatic: triggers after each agent turn if a weak_model is configured
Manual: use /compact to force compaction
Depth cap: only one level of summarization (existing summaries are included in new ones, not nested)

Memory

The save_memory tool lets the agent persist notes across sessions. Memory is stored as MEMORY.md files:

Project memory: ~/.heddle/projects/{encoded-path}/memory/MEMORY.md
Global memory: ~/.heddle/memory/MEMORY.md

Both are automatically loaded into the system prompt at session start (global first, then project).

File History

Before every write_file or edit_file operation, heddle backs up the file's current contents. Backups are stored per-project using UUID-based versioning:

~/.heddle/projects/{path}/file-history/
  meta.json                    # maps UUID -> file path + version count
  {uuid}/v1.bak               # first backup
  {uuid}/v2.bak               # second backup (after content changed)

Identical content is deduplicated (no new version if hash matches latest)
Use /restore <file> to list available versions
Use /restore <file> <version> to restore a specific version
Old backups are cleaned up automatically (100MB default limit, oldest first)

Permissions

The approval_mode setting controls which tool categories require user approval:

Mode	Read	Network	Write	Execute
`plan`	allow	allow	deny	deny
`suggest`	allow	allow	ask	ask
`auto-edit`	allow	allow	allow	ask
`full-auto`	allow	allow	allow	allow
`yolo`	allow	allow	allow	allow

Hardcoded protections (active in all modes):

Writing to .env* files is always denied
rm commands in bash are always denied

When a tool requires approval, the CLI prompts with [y/n/always]. Choosing "always" approves that tool for the rest of the session.

AGENTS.md

Heddle loads AGENTS.md files by walking up from the working directory to the home directory, plus ~/.heddle/AGENTS.md. Files are concatenated farthest-first into the system prompt, so project-level instructions take precedence.

Execution Modes

Mode	Entry Point	Feature Defaults
Interactive	`bun run dev`	All features enabled
Non-interactive	Scripted / piped input	history=off, statusLine=off
Headless	`bun run headless`	history=off, facets=off, statusLine=off, pasteCache=off

See docs/headless.md for the headless JSON-over-stdio protocol.

Architecture

src/
  types.ts          # Core message/tool types (TypeBox schemas)
  config/           # TOML config loading, paths, feature flags
  provider/         # LLM API client (OpenRouter)
  agent/            # Agent loop (streaming + non-streaming)
  tools/            # Tool implementations + registry
  session/          # JSONL session persistence, resume, fork
  context/          # Pruning + compaction
  memory/           # Agent memory loader
  file-history/     # File backup, restore, cleanup
  history/          # Cross-session message history
  commands/         # Slash command framework
  permissions/      # Tool approval + permission checking
  cost/             # Token cost tracking
  cli/              # Interactive REPL
  headless/         # JSON-over-stdio adapter
  ipc/              # IPC types, codec, protocol versioning

Agent loop: Send messages to the LLM. If it responds with tool calls, execute them, append results, and send again. Repeat until the LLM responds with text only. Both streaming (runAgentLoopStreaming) and non-streaming (runAgentLoop) variants.

TypeBox: Every type is defined once using TypeBox, producing both a TypeScript type and a JSON Schema. Tool parameter schemas double as OpenAI function definitions.

Development

bun test                    # unit tests
bun run test:integration    # include provider integration tests
bun run test:all            # everything including slow multi-turn tests
bun test test/tools/        # specific directory
bun run tsc --noEmit        # type check
bun run lint                # lint + format (biome, auto-fixes)

Integration tests require HEDDLE_INTEGRATION_TESTS=1 and real API credentials in .env.test.

Dependencies

@sinclair/typebox — TypeScript type + JSON Schema from a single definition
smol-toml — TOML parser for config files
@biomejs/biome — Lint + format (dev)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 107 Commits
.claude		.claude
docs		docs
scripts		scripts
src		src
test		test
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
PROTOCOL_VERSION		PROTOCOL_VERSION
README.md		README.md
biome.json		biome.json
bun.lock		bun.lock
bunfig.toml		bunfig.toml
compatibility.md		compatibility.md
package.json		package.json
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

Heddle

Quick Start

Requirements

Configuration

Config Files

Config Fields

Feature Flags

Environment Variables

CLI Usage

Slash Commands

Shell Commands

Mentions

Tools

Fuzzy Edit Matching

Sessions

Resume a Session

Fork a Session

Session Commands

Context Management

Pruning

Compaction

Memory

File History

Permissions

AGENTS.md

Execution Modes

Architecture

Development

Dependencies

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages