From f914e082c17200cbf7e735497d62ba82122746f6 Mon Sep 17 00:00:00 2001 From: Copilot Date: Fri, 3 Apr 2026 06:43:51 +0000 Subject: [PATCH] docs: sync smoke-claude toolset and max-turns after token optimization MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Update docs/test-analysis/ci-smoke.md to reflect changes from commit ac27bc7 (feat(smoke-claude): trim unused tools to reduce token spend ~8%): - max-turns 15 → 8 (cost-runaway protection) - Remove cache-memory tool (was removed from workflow) - Remove edit tool (was removed from workflow) - Remove MCP container row (no longer in workflow config) - Update 'What it tests' description to drop MCP server mention Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> --- docs/test-analysis/ci-smoke.md | 7 +++---- 1 file changed, 3 insertions(+), 4 deletions(-) diff --git a/docs/test-analysis/ci-smoke.md b/docs/test-analysis/ci-smoke.md index 9b266123..39d5799c 100644 --- a/docs/test-analysis/ci-smoke.md +++ b/docs/test-analysis/ci-smoke.md @@ -163,14 +163,13 @@ These are gh-aw agentic workflows compiled from `.md` source files into `.lock.y | Attribute | Value | |-----------|-------| -| **What it tests** | Claude Code engine running inside AWF sandbox with MCP servers, Playwright, file I/O, and bash tools | -| **Engine** | `claude` (max 15 turns) | +| **What it tests** | Claude Code engine running inside AWF sandbox with GitHub API, Playwright, file I/O, and bash tools | +| **Engine** | `claude` (max 8 turns) | | **Triggers** | Every 12h (schedule), PR (opened/synchronize/reopened), manual dispatch | | **Timeout** | 10 minutes | | **Network allowed** | defaults, github, playwright | -| **Tools** | cache-memory, github (repos, pull_requests), playwright, edit, bash | +| **Tools** | github (repos, pull_requests), playwright, bash | | **Safe outputs** | add-comment (hide older), add-labels (smoke-claude) | -| **MCP** | ghcr.io/github/gh-aw-mcpg container | | **Real-world mapping** | Validates that Claude Code can operate within AWF's network sandbox: GitHub API access via MCP, browser automation via Playwright, local file operations — the core use case for agentic workflows | | **Gaps** | Non-deterministic (AI agent may behave differently). No HTTPS blocking verification. | | **Integration test relationship** | High-level end-to-end complement. Integration tests verify AWF mechanics (iptables, proxy); this verifies an actual AI agent works through the firewall. |