Consolidate housekeeper runtime entrypoints by nxs9bg24js-tech · Pull Request #27 · srxly888-creator/autonomous-agent-stack

nxs9bg24js-tech · 2026-03-31T11:39:51Z

This cleanup consolidates the active housekeeper runtime path onto .

Changes:

tighten scope enforcement markers
keep as schema/documentation layer only
add guard coverage to ensure API/dependency wiring continues to use the mainline
document the cleanup and remaining schema follow-up

Validation:

..................... [100%]
21 passed in 4.92s

Non-blocking follow-up:

legacy naming in can be reduced or deprecated further later

…d CI

…strain usage

✅ 新增内容： 1. Self-Evolving Agent 自进化智能体（2,720 字） 2. DyTopo 动态拓扑网络革命（4,121 字） 3. 研究索引 README（1,973 字） 📊 核心发现： - 🚨 80亿参数"绞杀"1200亿参数 - ✅ 突破 Scaling Law 铁律 - 🔄 静态群聊 → 自由交易集市 🎯 技术价值： - 动态拓扑网络架构 - 上下文污染解决方案 - 自进化机制设计 📚 总字数：8,814 字 🔗 来源：YouTube 播放列表深度分析

…losure

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- task_contract: add bidirectional HousekeeperTaskStatus mapping - worker_registry: import worker_status_rank from unified worker_contract replacing inline status rank dict - Add 11 new HousekeeperTaskStatus roundtrip tests Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…nd 2) - Offline demo fixture with success/failure/review flows and demo runner script - Fake Linux and Windows/Yingdao worker adapters for offline simulation - Heartbeat/lease/timeout simulation and 10-category failure taxonomy - Exhaustive illegal state transition test matrix for TaskStatus and RunStatus - Five gate scenario tests and retry/fallback/needs_review rule tests - Enhanced console: task filtering, worker metrics, run detail with logs/artifacts, and approval actions (approve/reject/retry/fallback) - Add pytest-asyncio to requirements.lock for CI async test support - Update CI workflow paths to include all new test and source files Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Acceptance test script: 30 consecutive runs with 5 fault types (timeout, crash, overreach, missing_artifacts, permission_denied) - Weighted deterministic fault selection with ~25% success baseline - Per-fault-type breakdown in console report - JSON machine-readable report output (--json-report) - Pytest wrapper (test_acceptance_harness.py) for CI integration - Updated offline-demo.md with acceptance harness docs, console demo, run detail walkthrough, and review/retry/fallback decision logic - Updated CI to include new test paths Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Linux housekeeper supervisor script and systemd-compatible runner - dispatch_to_linux.sh for remote task dispatch - WORKER_CONTRACT_SPEC.md documenting unified worker interface - CLAUDE.md project instructions for AI-assisted development - .env.linux.example template for Linux worker configuration - Update Makefile with new targets - Update README and linux-remote-worker docs Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

… checklist - 5 acceptance gates (G1-G5): contract models, state machine legality, gate verdict correctness, 30-run harness, API availability - 5 fail-fast conditions (F1-F5): lint/format, import integrity, state machine invariants, gate core rules, acceptance harness - PR merge checklist: code quality, tests, contract sync, docs, architecture - Demo-ready minimum standard (DR1-DR5): offline demos, console interaction, contract tests, CI green, docs complete Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…integration migration plan Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…cklist - OFFLINE_CAPABILITY_AUDIT.md: 11 REAL / 8 OFFLINE-ONLY / 7 FAKE classification - test_acceptance_gaps.py: 8 blind-spot test classes (concurrent dispatch, heartbeat timeout, cancel flow, orphan recovery, artifact validation, approval pre-check, gate after DriverResult, max-concurrent enforcement) - test_worker_contract_compliance.py: 5 contract classes for worker certification (registration, heartbeat, lifecycle, gate, error classification) - WORKER_ONBOARDING_CHECKLIST.md: checklist for certifying new workers - CI paths updated with both new test files 475 tests passing (52 new). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…tput to unified contracts Six bridge functions translate LinuxSupervisorConclusion/TaskSummaryRead/HeartbeatRead into GateOutcome/GateCheck/RunStatus/WorkerHeartbeat/WorkerRegistration without modifying any production service or unified contract model. 43 tests cover all 7 conclusions through the full chain (summary → gate checks → verdict). CI paths and acceptance criteria updated. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…uction path After run_once() returns a LinuxSupervisorTaskSummaryRead, the control plane now calls bridge functions to produce GateOutcome, GateCheck[], RunStatus, and GateVerdict. These are stored in result_payload["gate_evaluation"]. Task status decision logic (summary.success → COMPLETED/FAILED) is unchanged — gate evaluation is observational, proving real Linux paths are constrained by unified contracts. 6 integration tests pass with real subprocess execution. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

8 integration tests proving that real Linux execution through ControlPlaneService produces run_record in result_payload with unified RunStatus/RunRecord compatible data. Tests cover SUCCEEDED, FAILED (timeout, infra_error), NEEDS_REVIEW (unknown conclusion), field consistency with summary, result_data bridge fields, and error_message propagation. CI paths and production path changelog updated. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…te() LINUX_SUPERVISOR branch After gate evaluation, call supervisor_summary_to_run_record() to produce a BridgeRunRecord with unified RunStatus. Store as result_payload["run_record"] with datetime fields serialized to ISO strings. Task status decision logic unchanged. 8 integration tests pass. queued_at/leased_at approximation: started_at from summary.started_at (completed_at from summary.finished_at). No separate lease mechanism exists in LinuxSupervisorService. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

asdict() on frozen dataclass preserves RunStatus enum in the 'status' field. Add explicit .value serialization so result_payload["run_record"]["status"] is always a plain string, consistent with gate_evaluation.run_status. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ACCEPTANCE_CRITERIA.md: Add G7 gate (9 items) for production path integration — gate evaluation, run lifecycle, field consistency. WORKER_GAP_ANALYSIS.md: Mark gate/run as wired, add wiring status table showing 5/7 bridge functions connected, 2 (heartbeat/registration) still unwired. Document queued_at/leased_at approximation limitations. No production code changed. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Update docs to reflect Codex's two-commit heartbeat wiring (86de697 + 6307136): both get_worker_heartbeat() and list_workers() now use the bridge function and return consistent unified WorkerStatus. - ACCEPTANCE_CRITERIA.md: add G8 gate (7 conditions, all implemented) - WORKER_GAP_ANALYSIS.md: add Section 6 heartbeat audit, mark WorkerHeartbeat as fully wired, clarify WorkerRegistration remains unwired - PRODUCTION_PATH_CHANGELOG.md: add heartbeat integration changelog with before/after, test inventory, and known gaps Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Prepare acceptance criteria and gap analysis for upcoming WorkerRegistration wiring. No production code changes. - ACCEPTANCE_CRITERIA.md: add G9 gate (8 conditions) for WorkerRegistration production-path integration - WORKER_GAP_ANALYSIS.md: add Section 7 audit covering legacy vs unified type gap, bridge function metadata loss, minimal wiring point, field mapping, and 5 suggested tests - PRODUCTION_PATH_CHANGELOG.md: no changes needed (already accurate for heartbeat wiring only) Key findings: - WorkerRegistrationRead (8 fields) is a strict subset of WorkerRegistration (14 fields) — 6 unified fields missing - Bridge function discards all metadata from process_status - Codex has started implementation (uncommitted) with 5 tests Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Install gstack as a project-local skill set and register available slash commands in CLAUDE.md. Use /browse for all web browsing tasks. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

T and others added 30 commits March 29, 2026 11:07

fix(runtime): align Python baseline across packaging, doctor, docs an…

53e244b

…d CI

chore(ci): widen quality coverage for gateway and core runtime paths

429aabc

Build manager, audit trail, and Telegram control workflows

578e159

test(gateway): lock Telegram mainline webhook guards as regression cases

b956174

refactor(gateway): mark legacy Telegram webhook as deprecated and con…

e2e7624

…strain usage

docs(devex): add PR review trial rubric and feedback loop

2c1c423

Add Telegram issue intake and manager task dispatch

8fb582c

Harden manager routing and OpenHands execution preflights

94779cc

Harden manager routing and OpenHands stall handling

dccdb1b

Route landing page work to isolated apps surfaces

53e70ad

Harden business-surface promotion validation

8ebffa1

Tighten landing-page scope and retry feedback

085dfd0

Treat adapter stdout as heartbeat signal

3c6efe9

Cover post-write stdout heartbeat stalling

ede643c

Close runner summaries and add Telegram task approval

beb6047

docs(memory): record Telegram --approve chaos run and validator failure

3ed0990

fix(runner): ignore spinner stdout heartbeats

8da012b

docs(memory): record live spinner hang rerun

4c6c604

Merge branch 'codex/openhands-worker-strict'

90957d0

docs(devex): add linux remote worker best practices

1818f9d

fix(runner): stop stdout noise from masking stalled runs

885d94b

docs(memory): verify watchdog process group termination and summary c…

2f30e2d

…losure

fix(telegram): route /start to help command

3143773

docs: clarify cc-switch as local control-plane workstation only

7d30b99

Harden OpenHands Linux fallback behavior

5c438e4

Harden Telegram notifier retries

27fae9b

Add personal OpenClaw housekeeper v0

171ac57

Harden housekeeper boundaries and scope policy

dcfe25c

fix: remove duplicate housekeeper dispatch route

289673b

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Yin and others added 30 commits April 1, 2026 02:06

Fix invalid noqa comment in console router

5608ec5

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Add worker gap analysis: Linux/Win contract gaps, fake vs real diff, …

e9363a0

…integration migration plan Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Normalize linux run_record payload

3f42daf

Wire linux worker heartbeat into registry

86de697

Align linux worker status with heartbeat

6307136

Wire unified worker registration into registry

fd9e720

Update registration wiring gap analysis

9a9fd17

Fix worker wiring status notes

015aa27

Move registration metadata into supervisor bridge

6d3fc6f

Add control-plane worker API integration coverage

f249e9d

Add gstack skills to project

cc37a29

Install gstack as a project-local skill set and register available slash commands in CLAUDE.md. Use /browse for all web browsing tasks. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Project run and gate metadata into task APIs

17045cf

Document control-plane positioning and investor brief

fc4f41f

Route manual gate verdicts into approval

f533c54

Handle post-run gated approvals explicitly

13d3442

Add gated rejection coverage for housekeeper approvals

5c354ea

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consolidate housekeeper runtime entrypoints#27

Consolidate housekeeper runtime entrypoints#27
nxs9bg24js-tech wants to merge 71 commits intomainfrom
codex/consolidate-housekeeper-entrypoints

nxs9bg24js-tech commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

nxs9bg24js-tech commented Mar 31, 2026

Validation:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant