ClampAI Framework Guide

What ClampAI Does

ClampAI is a safety framework for AI agents. It ensures that agents can't break rules or exceed budgets by enforcing at the execution layer, not in prompts. See the Known Limitations section and docs/VULNERABILITIES.md for the cases where coverage is partial.

The key insight: Safety happens at the execution level, not the prompt level. We let the LLM think freely, but we verify every action before it runs.

The Three Safety Layers

Layer 1: Formal Safety Kernel

The foundation that everything else builds on.

State (Immutable)

States are snapshots of the world. You can't modify them directly - any change creates a new state.

from clampai import State

# Create
state = State({"balance": 100, "deployed": False})

# Read
balance = state.get("balance")  # 100

# "Modify" (creates new state)
new_state = state.with_updates({"balance": 95})
# Original state is still {"balance": 100}

Why this matters: If state is immutable, undo is trivial. We just go back to the previous state.

Effects (Declarative)

Actions don't contain code - they describe what changes:

from clampai import Effect

Effect("balance", "decrement", 50)      # balance -= 50
Effect("status", "set", "deployed")     # status = "deployed"
Effect("log", "append", "done")         # log.append("done")
Effect("counter", "increment", 1)       # counter += 1

The consequence: Effects can be verified before any code runs. No surprises, no side effects.

Invariants (Always True)

Safety rules that must hold after every action:

from clampai import Invariant

Invariant(
    name="budget_safe",
    predicate=lambda s: s.get("balance", 0) >= 0,
    description="Balance must never go negative"
)

How it works: Before accepting an action, we simulate it on a copy of the state. If the invariant fails, we reject the action. State is never modified.

Layer 2: Reference Monitor

Enforces clampaints across multiple dimensions.

Information Flow Control (IFC)

Prevents data from flowing to places it shouldn't:

from clampai import DataLabel, SecurityLevel

# Mark sensitive data
secret_label = DataLabel(SecurityLevel.SECRET)
public_label = DataLabel(SecurityLevel.PUBLIC)

# Monitor ensures:
# - Secret data can flow to Secret variables ✓
# - Secret data cannot flow to Public variables ✗

Resource Barriers

Prevents overconsumption via Control Barrier Functions:

from clampai import ControlBarrierFunction

def budget_h(state):
    """Barrier function: how much budget margin do we have?"""
    remaining = state.get("budget", 0)
    return remaining / 100.0  # Normalized to [0,1]

barrier = ControlBarrierFunction(h=budget_h, alpha=0.1)
# alpha=0.1 means: budget can decay 10% per step, but no faster

What it does: Prevents actions that would erode the resource boundary too fast.

Capture Basins (Forbidden Regions)

Defines areas of state space to avoid:

from clampai import CaptureBasin

CaptureBasin(
    name="bankruptcy",
    is_bad=lambda s: s.get("balance", 0) < 0,
    max_steps=5  # Will reach bad region within 5 steps?
)

Prevention: System actively rejects actions that would lead into these regions.

Layer 3: Execution Flow

The orchestrator ties it all together:

1. Agent proposes action
   ↓
2. Boundary check
   "Is any variable getting close to a limit?"
   ↓
3. Barrier enforcement
   "Would this enter a forbidden region?"
   ↓
4. Reference monitor
   "Information flow? Resource limits? Can we repair it?"
   ↓
5. Simulate on copy
   "Does invariant hold after this action?"
   ↓
6. Execute (if all pass)
   "Actually apply the effects"
   ↓
7. Record for undo
   "Save what we did, in case we need to reverse it"

If any check fails → action is rejected, state unchanged.

Current Features (v0.4.0)

Boundary Detection

Detects when variables are approaching clampaint violations:

from clampai import JacobianFusion

jacobian = JacobianFusion()
report = jacobian.compute_gradients(state)

# Check which variables are near limits
for variable, severity in report.severity_scores.items():
    if severity > 0.7:  # Getting close
        print(f"Warning: {variable} is near its limit")

Severity levels:

0.0-0.3: Safe, plenty of margin
0.3-0.6: Caution, monitor this variable
0.6-0.9: Warning, very close to limit
0.9-1.0: Critical, one small step from violation

Enforcement Barriers

Actively prevents dangerous actions:

from clampai import AuthoritativeHJBBarrier

barrier = AuthoritativeHJBBarrier()

# Check if action would be dangerous
check = barrier.check_action_leads_to_danger(
    state=current_state,
    action=proposed_action,
    available_actions=all_actions
)

if not check.safe:
    # Action is rejected, state unchanged
    print(f"Action rejected: {check.reason}")

Task Composition

Combine verified subtasks safely using the SuperTask / TaskComposer API in clampai/operadic_composition.py. Interface compatibility is checked at composition time (OC-1); budget and invariants are inherited by the composed task (OC-2).

# Conceptual sketch — see clampai/operadic_composition.py for full API
from clampai import SuperTask, TaskComposer

# Register two independently-verified subtasks
composer = TaskComposer()
composer.register_task(prepare_supertask)   # verified=True in certificate
composer.register_task(deploy_supertask)    # verified=True in certificate

# Compose sequentially — checks interface compatibility
combined = composer.compose_chain(["prepare", "deploy"])

if combined:
    # combined is a new SuperTask with inherited invariants and budget
    engine = Orchestrator(combined.to_task_definition(), llm=your_llm)
    result = engine.run()

The 8 Theorems

ClampAI proves these theorems by construction:

Theorem	What It Guarantees	How
T1: Budget Safety	You cannot spend more than your budget	Check before commit
T2: Termination	Execution will eventually stop	Budget runs out → no more actions
T3: Invariant Preservation	Safety rules always hold	Simulate before commit
T4: Monotone Resources	Spending only increases	Non-negative cost assertion
T5: Atomicity	Rejected actions don't change anything	Simulate on copy only
T6: Trace Integrity	Execution log can't be tampered with	Cryptographic hash chain
T7: Rollback Exactness	Undo restores exact prior state	Immutable state + inverse effects
T8: Emergency Halt	Designated emergency actions always execute regardless of budget	Emergency action set checked before budget check

See docs/THEOREMS.md for detailed proofs.

Working Example

from clampai import (
    State, Effect, ActionSpec, Invariant, CaptureBasin,
    TaskDefinition, Orchestrator
)

# 1. Define actions
spend_action = ActionSpec(
    id="spend",
    name="Spend money",
    effects=(Effect("balance", "decrement", 50),),
    cost=1.0
)

invest_action = ActionSpec(
    id="invest",
    name="Invest money",
    effects=(Effect("invested", "increment", 100),),
    cost=2.0
)

# 2. Define safety rules
safety_rule = Invariant(
    "no_negative_balance",
    lambda s: s.get("balance", 0) >= 0,
    "Balance must never go negative"
)

danger_zone = CaptureBasin(
    "bankruptcy",
    is_bad=lambda s: s.get("balance", 0) < 0,
    max_steps=2
)

# 3. Create task
task = TaskDefinition(
    goal="Invest exactly 100 units",
    initial_state=State({"balance": 150, "invested": 0}),
    available_actions=[spend_action, invest_action],
    invariants=[safety_rule],
    capture_basins=[danger_zone],
    budget=10.0,  # Can't spend more than 10 tokens
    goal_predicate=lambda s: s.get("invested", 0) == 100
)

# 4. Run with LLM
orch = Orchestrator(task, llm=your_llm)
result = orch.run()

print(f"Goal achieved: {result.goal_achieved}")
print(f"Steps: {result.step_count}")
print(f"Budget used: {result.spent}")

# Safety guarantees:
# ✓ Balance never went negative (T3: Invariant Preservation)
# ✓ Never entered bankruptcy zone (Barrier enforcement)
# ✓ Never exceeded 10-token budget (T1: Budget Safety)
# ✓ Can undo any action that happened (T7: Rollback Exactness)

API Quick Reference

State Management

from clampai import State

s = State({"key": value})
s.get("key")                          # Read
s.with_updates({"key": new_value})    # Update (immutable)
s.without_keys(["key"])               # Remove

Actions

from clampai import Effect, ActionSpec

effect = Effect(variable, mode, value)
# Modes: set, increment, decrement, multiply, append, remove, delete

action = ActionSpec(
    id="action_id",
    name="Human name",
    description="What does it do?",
    effects=(effect1, effect2),
    cost=1.0,
    risk_level="low"  # low, medium, high, critical
)

# Simulate without executing
next_state = action.simulate(current_state)

Safety Rules

from clampai import Invariant, CaptureBasin, ControlBarrierFunction

# Rule that must always hold
invariant = Invariant("name", predicate, "description")

# Region to avoid
basin = CaptureBasin("name", is_bad, max_steps=5)

# Smooth clampaint boundary
barrier = ControlBarrierFunction(h=function, alpha=0.1)

Execution

from clampai import TaskDefinition, Orchestrator

task = TaskDefinition(
    goal="What should happen",
    initial_state=State({...}),
    available_actions=[...],
    invariants=[...],
    capture_basins=[...],
    budget=100.0,
    goal_predicate=lambda s: condition,
    dependencies={...}
)

orch = Orchestrator(task, llm=your_llm)
result = orch.run()

Key Properties

Non-bypassable — LLM cannot talk its way past the safety kernel
Immutable state — Every change creates a new snapshot
Simulate before commit — Nothing real changes until verification passes
Reversible — Actions declared with reversible=True can be undone exactly via T7 rollback
Formal guarantees — All 8 theorems proven by construction
No token overhead — Safety happens at execution layer, not in prompts

When to Use ClampAI

✓ When you need hard safety guarantees
✓ When actions cost money or have side effects
✓ When you need formal verification
✓ When you can't afford mistakes

✗ When you need extreme performance
✗ When you have no clampaints
✗ When safety requirements are vague

Next Steps

Read README.md for overview
Check docs/THEOREMS.md for formal proofs
Review docs/API.md for complete API
Look at examples/ for working code
See docs/VULNERABILITIES.md for limitations

All code is tested: python -m pytest tests/ -v

Real-World Action Guarantees

The formal theorems (T1-T8) operate over declared Effect objects in the kernel's formal model. The table below shows exactly what is and is not guaranteed for each category of real-world action.

Action Type	Example	T1 Budget	T3 Invariants	T7 Rollback	Mechanism
Abstract state only	`pages_written += 1`	PROVEN	PROVEN	PROVEN	Full Effect declarations, kernel simulation
Real-world with proxy tracking	`emails_sent += 1` proxy for `send_email()`	PROVEN	PROVEN on proxy	PROVEN on proxy	Proxy Effect + EnvironmentProbe reconciliation
Real-world, no proxy declared	`http_request()` with no declared effects	PROVEN	NOT APPLICABLE	NOT APPLICABLE	Budget enforced; invariants cannot simulate undeclared effects
Opaque shell/code execution	`execute_shell("rm ...")`	PROVEN	PATTERN-MATCH ONLY	NOT APPLICABLE	Pattern classifier feeds ActionSpec; kernel checks declared effects

The rows where T3 and T7 show NOT APPLICABLE are not bugs - they reflect an honest boundary: ClampAI's kernel can only simulate and invert effects it knows about. Actions that touch the real world without a declared proxy leave the kernel's model intact but create a spec-reality gap that the EnvironmentReconciler (hardening.py:413-469) is designed to detect at the next reconciliation cycle.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ClampAI Framework Guide

What ClampAI Does

The Three Safety Layers

Layer 1: Formal Safety Kernel

State (Immutable)

Effects (Declarative)

Invariants (Always True)

Layer 2: Reference Monitor

Information Flow Control (IFC)

Resource Barriers

Capture Basins (Forbidden Regions)

Layer 3: Execution Flow

Current Features (v0.4.0)

Boundary Detection

Enforcement Barriers

Task Composition

The 8 Theorems

Working Example

API Quick Reference

State Management

Actions

Safety Rules

Execution

Key Properties

When to Use ClampAI

Next Steps

Real-World Action Guarantees

FilesExpand file tree

FRAMEWORK_GUIDE.md

Latest commit

History

FRAMEWORK_GUIDE.md

File metadata and controls

ClampAI Framework Guide

What ClampAI Does

The Three Safety Layers

Layer 1: Formal Safety Kernel

State (Immutable)

Effects (Declarative)

Invariants (Always True)

Layer 2: Reference Monitor

Information Flow Control (IFC)

Resource Barriers

Capture Basins (Forbidden Regions)

Layer 3: Execution Flow

Current Features (v0.4.0)

Boundary Detection

Enforcement Barriers

Task Composition

The 8 Theorems

Working Example

API Quick Reference

State Management

Actions

Safety Rules

Execution

Key Properties

When to Use ClampAI

Next Steps

Real-World Action Guarantees