OpenClaw Baseline Evaluation

Use this workflow to generate a repeatable OpenClaw-first baseline snapshot for the v3 learning loop.

If the baseline remains too cold to say anything meaningful about candidate creation or repeated-task intervention, use the higher-signal companion workflow:

docs/development/openclaw-high-confidence-scenarios.md

Preconditions

openclaw CLI is installed and the gateway can load experienceengine
ee doctor openclaw reports a healthy host wiring state
the current local ExperienceEngine database already contains OpenClaw task data

Command

ee evaluate openclaw-baseline

Optional flags:

ee evaluate openclaw-baseline --lookback-hours 168
ee evaluate openclaw-baseline --output-dir ./artifacts/evaluations/openclaw/manual-run

Outputs

By default, ExperienceEngine writes local-only artifacts to:

artifacts/evaluations/openclaw/<timestamp>/

Each snapshot contains:

summary.json
summary.md

What The Snapshot Covers

input record totals and outcome distribution
injection coverage
candidate lifecycle distribution
distillation job status distribution
node state and feedback distribution
latest observed record / candidate / node pointers

How To Use It In The Current WSL Baseline

Run one or more real OpenClaw tasks in the current workspace.
Run:

ee doctor openclaw
ee evaluate openclaw-baseline

Record the generated snapshot path.
Compare later snapshots after distiller/profile/gating changes.

Interpretation Notes

This snapshot is a baseline, not a trend report.
Injection coverage being high is not automatically good.
A growing discarded candidate count usually means either the gate is too wide or the distiller profile needs work.
OpenClaw is the current baseline host. Claude Code and Codex remain regression or reuse hosts for this stage.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenClaw Baseline Evaluation

Preconditions

Command

Outputs

What The Snapshot Covers

How To Use It In The Current WSL Baseline

Interpretation Notes

FilesExpand file tree

openclaw-baseline-evaluation.md

Latest commit

History

openclaw-baseline-evaluation.md

File metadata and controls

OpenClaw Baseline Evaluation

Preconditions

Command

Outputs

What The Snapshot Covers

How To Use It In The Current WSL Baseline

Interpretation Notes