OpenCode Auto Research

OpenCode Auto Research is an engineering project that combines a governed autonomous experiment loop with a lightweight local innovation brain.

It is designed for research workflows where you want one outer orchestrator to:

run baseline and candidate experiments,
consult three read-only specialists,
ground proposals in a local paper vault,
learn from keep/discard outcomes,
redirect the search when results underperform,
and keep the whole process traceable through structured artifacts.

Core Capabilities

governed OpenCode experiment loop driven by Sisyphus
three-specialist proposal workflow with Apollo, Athena, and Hermes
Python controller for baseline, tick, resume, stop, and status
local research brain with paper indexing, retrieval, evidence packs, and feedback reweighting
session-level direction memory for multi-round pivot suggestions
deterministic artifacts under experiments/ for reproducibility

System Roles

Sisyphus: only outer-loop orchestrator
sisyphus-junior: only code executor
Prometheus: bootstrap and replanning only
Apollo: exploit-oriented research proposal specialist
Hermes: orthogonal divergence specialist
Athena: attribution and validity guard

Architecture

flowchart TD
    A[Goal config + session state] --> B[Sisyphus orchestrator]
    C[Research brain index + evidence pack] --> B
    B --> D[Apollo exploit proposal]
    B --> E[Hermes orthogonal proposal]
    B --> F[Athena guard + redirect]
    D --> G[Primary hypothesis selection]
    E --> G
    F --> G
    G --> H[sisyphus-junior execution]
    H --> I[Python controller]
    I --> J[Judge + keep/discard]
    J --> K[Feedback + direction memory]
    K --> C
    K --> B

High-level control flow

Sisyphus reads the goal, session state, and research context.
Apollo, Hermes, and Athena produce grounded proposal signals.
One primary hypothesis is selected.
sisyphus-junior is the only agent allowed to change code.
The Python controller executes and judges the round.
Research-brain feedback and direction memory are updated for the next round.

Repository Layout

opencode-auto-research/
├── .opencode/                  # OpenCode commands and local skills
├── configs/                    # Goal and research-brain config
├── experiments/                # Session truth-source artifacts
├── fixtures/                   # Test fixtures, including KB fixtures
├── scripts/                    # Python controller and research-brain scripts
├── src/                        # TypeScript plugin, agents, tools, orchestration
├── tests/                      # Unit, integration, and E2E tests
├── AGENTS.md                   # Project rules and routing guidance
├── GUIDE.md                    # Full setup and operator guide
├── README.md
├── package.json
├── requirements.txt
└── .env.example

Research Brain Workspace

One convenient local layout is:

~/workspace/opencode-auto-research/
├── opencode-auto-research/     # this repository
└── vault/                      # local paper vault or symlink

The default configs/research_brain.yaml points to ../vault, so the engineering repo stays publishable while the paper vault remains local.

Runtime Requirements

Required

Node.js 22+
npm 10+
Python 3.10+

Optional

Remote training server
GPU / ROCm / CUDA

Quick Start

1. Clone

git clone <your-repo-url>
cd opencode-auto-research

2. Install JavaScript dependencies

npm ci

3. Install Python dependencies

python3 -m pip install -r requirements.txt

4. Configure environment variables

cp .env.example .env

Set at least:

KIMI_CODING_API_KEY
KIMI_CODING_BASE_URL
optionally INNOVATION_LOOP_AGENT_MODEL

5. Build and verify

npm run build
npm test

Day-to-Day Usage

Run the controller in mock mode

python3 scripts/innovation_loop.py bootstrap --config configs/goal.yaml --workspace . --mode mock
python3 scripts/innovation_loop.py tick --config configs/goal.yaml --workspace . --mode mock
python3 scripts/innovation_loop.py status --config configs/goal.yaml --workspace . --mode mock

Run research-brain maintenance manually

python3 scripts/kb/organize_vault.py --vault-root ../vault
python3 scripts/kb/standardize_vault_format.py --vault-root ../vault
python3 scripts/kb/fill_figure_notes.py --vault-root ../vault
python3 scripts/kb/build_index.py --vault-root ../vault --workspace-root . --config configs/research_brain.yaml --output-dir experiments/research/index --scaffold-missing --extract-claims

Generate one evidence pack

python3 scripts/kb/retrieve_papers.py --goal configs/goal.yaml --session experiments/session.json --best experiments/best.json --attempts experiments/attempts.jsonl --workspace-root . --config configs/research_brain.yaml --round 1
python3 scripts/kb/make_evidence_pack.py --round 1 --retrieval experiments/research/retrieval-cache/retrieval-round-0001.json --workspace-root . --config configs/research_brain.yaml

OpenCode commands

/innovate-loop
/experiment-init
/experiment-run
/experiment-status
/experiment-bootstrap
/research-context

Scheduler

The current recommended daily maintenance job is daily-research-brain.

Its responsibilities are:

organize the local vault
standardize file naming
fill missing figure notes
rebuild the research-brain index
generate one evidence context when controller state exists

See GUIDE.md for the full operator flow.

Testing and Validation

Full suite

npm test

Build

npm run build

Example focused suites

npm test -- tests/kb/make-evidence-pack.test.ts
npm test -- tests/e2e/research-brain-direction-memory.test.ts

Publication Notes

Do not commit .env or real provider credentials.
Do not commit your full private paper vault unless you explicitly intend to publish it.
Review configs/research_brain.yaml before publishing if your local vault path differs.
Review experiments/ and exclude transient artifacts you do not want in source control.
For a public release, add CONTRIBUTING.md and SECURITY.md to clarify collaboration and disclosure expectations.

What Is Already Verified

The packaged engineering version currently passes:

npm test
npm run build
research-brain retrieval / evidence E2E flows
redirect memory and multi-round pivot tests

Next Reading

GUIDE.md — full environment and operator guide
AGENTS.md — project routing rules
configs/research_brain.yaml — research-brain maintenance configuration
CONTRIBUTING.md — contribution expectations
SECURITY.md — safe disclosure and publishing guidance

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github		.github
.opencode		.opencode
configs		configs
experiments		experiments
fixtures		fixtures
scripts		scripts
src		src
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
Dockerfile.opencode		Dockerfile.opencode
GUIDE.md		GUIDE.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml
dvc.yaml		dvc.yaml
evaluate.py		evaluate.py
opencode.json		opencode.json
package-lock.json		package-lock.json
package.json		package.json
params.yaml		params.yaml
requirements.txt		requirements.txt
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

OpenCode Auto Research

Core Capabilities

System Roles

Architecture

High-level control flow

Repository Layout

Research Brain Workspace

Runtime Requirements

Required

Recommended

Optional

Quick Start

1. Clone

2. Install JavaScript dependencies

3. Install Python dependencies

4. Configure environment variables

5. Build and verify

Day-to-Day Usage

Run the controller in mock mode

Run research-brain maintenance manually

Generate one evidence pack

OpenCode commands

Scheduler

Testing and Validation

Full suite

Build

Example focused suites

Publication Notes

What Is Already Verified

Next Reading

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages