You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
test: E2E tests for plan review report and Codex offering (v0.11.15.0) (garrytan#449)
* chore: regen SKILL.md from template changes
Regenerated via `bun run gen:skill-docs` — was stale from prior
template updates (Codex paths, preamble resolver).
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* test: add E2E tests for plan review report and codex offering
- plan-review-report: verifies plan-eng-review writes ## GSTACK REVIEW
REPORT to the bottom of the plan file
- codex-offered-{office-hours,ceo-review,design-review,eng-review}:
verifies each skill has Codex availability check, user prompt, and
fallback behavior (4 concurrent lightweight tests)
- Updated touchfiles and selection count assertion
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* docs: add touchfiles to global touchfile list in CLAUDE.md
The touchfiles.ts file itself is a global touchfile that triggers all
tests when changed, but was missing from the documented list.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* chore: bump version and changelog (v0.11.15.0)
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Copy file name to clipboardExpand all lines: CHANGELOG.md
+13Lines changed: 13 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,5 +1,18 @@
1
1
# Changelog
2
2
3
+
## [0.11.15.0] - 2026-03-24 — E2E Test Coverage for Plan Reviews & Codex
4
+
5
+
### Added
6
+
7
+
-**E2E tests verify plan review reports appear at the bottom of plans.** The `/plan-eng-review` review report is now tested end-to-end — if it stops writing `## GSTACK REVIEW REPORT` to the plan file, the test catches it.
8
+
-**E2E tests verify Codex is offered in every plan skill.** Four new lightweight tests confirm that `/office-hours`, `/plan-ceo-review`, `/plan-design-review`, and `/plan-eng-review` all check for Codex availability, prompt the user, and handle the fallback when Codex is unavailable.
9
+
10
+
### For contributors
11
+
12
+
- New E2E tests in `test/skill-e2e-plan.test.ts`: `plan-review-report`, `codex-offered-eng-review`, `codex-offered-ceo-review`, `codex-offered-office-hours`, `codex-offered-design-review`
13
+
- Updated touchfile mappings and selection count assertions
14
+
- Added `touchfiles` to the documented global touchfile list in CLAUDE.md
test('/plan-eng-review writes GSTACK REVIEW REPORT to plan file',async()=>{
592
+
constresult=awaitrunSkillTest({
593
+
prompt: `Read plan-eng-review/SKILL.md for the review workflow.
594
+
595
+
Read plan.md — that's the plan to review. This is a standalone plan document, not a codebase — skip any codebase exploration steps.
596
+
597
+
Proceed directly to the full review. Skip any AskUserQuestion calls — this is non-interactive.
598
+
Skip the preamble bash block, lake intro, telemetry, and contributor mode sections.
599
+
600
+
CRITICAL REQUIREMENT: plan.md IS the plan file for this review session. After completing your review, you MUST write a "## GSTACK REVIEW REPORT" section to the END of plan.md, exactly as described in the "Plan File Review Report" section of SKILL.md. If gstack-review-read is not available or returns NO_REVIEWS, write the placeholder table with all four review rows (CEO, Codex, Eng, Design). Use the Edit tool to append to plan.md — do NOT overwrite the existing plan content.
601
+
602
+
This review report at the bottom of the plan is the MOST IMPORTANT deliverable of this test.`,
0 commit comments