diff --git a/.claude/skills/agent-customization/skill.md b/.claude/skills/agent-customization/skill.md
index 71caa66..bf256b9 100644
--- a/.claude/skills/agent-customization/skill.md
+++ b/.claude/skills/agent-customization/skill.md
@@ -18,15 +18,23 @@ You are working on **agent customization** — the feature that reads a project'
 - `src/prompts/customize-agents.md` — System prompt telling Claude how to customize agents
 - `src/lib/runner.js` — `runClaude()`, `loadPrompt()`, `parseFileOutput()` shared across commands
 - `src/lib/skill-writer.js` — `writeSkillFiles()` writes parsed output to disk
+- `src/lib/timeout.js` — `resolveTimeout()` for timeout handling (default 300s)
 
 ## Key Concepts
-- **Context gathering:** `gatherProjectContext()` reads CLAUDE.md (truncated at 3000 chars), all `.claude/skills/**/*.md`, and lists `.claude/guidelines/` paths without reading their contents.
-- **Agent discovery:** `findAgents()` recursively walks `.claude/agents/`, reads `.md` files, extracts the `name:` from YAML frontmatter via regex.
-- **Read-only tools:** Claude is invoked with `allowedTools: ['Read', 'Glob', 'Grep']` — no writes allowed during the LLM call.
+- **Context gathering:** `gatherProjectContext()` reads CLAUDE.md (truncated at 3000 chars), all `.claude/skills/**/*.md` in full, and lists `.claude/guidelines/` paths without reading their contents.
+- **Agent discovery:** `findAgents()` recursively walks `.claude/agents/`, reads `.md` files, extracts `name:` via regex — falls back to filename if no frontmatter match.
+- **Read-only tools:** Claude is invoked with `allowedTools: ['Read', 'Glob', 'Grep']` and no maxTokens cap (unlike doc-init which sets per-call limits).
 - **Output parsing:** Claude returns `<file path="...">content</file>` XML tags, parsed by `parseFileOutput()`. Only `.claude/` paths are allowed.
 
 ## Critical Rules
 - **Read-only tools only** — Claude agents never get write tools. All output goes through `parseFileOutput()` → `writeSkillFiles()`.
 - **Context truncation** — CLAUDE.md is capped at 3000 chars to avoid blowing up prompt size. Skills are read in full.
-- **Agent frontmatter** — agents must have `name:` in YAML frontmatter or they won't be discovered by `findAgents()`.
 - **Path safety** — `parseFileOutput()` only allows writes to `.claude/` prefixed paths. Customized agents stay in `.claude/agents/`.
+- **Dry-run support** — `--dry-run` flag previews output without writing. Confirmation prompt shown before writes.
+- **Model override** — `--model` flag passed through to `runClaude()` for model selection.
+
+## References
+- **Patterns:** `.claude/guidelines/claude-runner/patterns.md`
+
+---
+**Last Updated:** 2026-03-28
diff --git a/.claude/skills/base/skill.md b/.claude/skills/base/skill.md
index 14cc8f4..e37c44f 100644
--- a/.claude/skills/base/skill.md
+++ b/.claude/skills/base/skill.md
@@ -34,6 +34,10 @@ CLI entry (`bin/cli.js`) → command handlers (`src/commands/`) → lib modules
 - `src/lib/context-builder.js` — Assembles repo files into prompt-friendly context
 - `src/lib/skill-writer.js` — Writes skill files, generates skill-rules.json, merges settings
 - `src/lib/skill-reader.js` — Parses skill files, frontmatter, activation patterns, keywords
+- `src/lib/diff-helpers.js` — Targeted file diffs and prioritized diff truncation for doc-sync
+- `src/lib/git-helpers.js` — Git repo detection, diff retrieval, log formatting
+- `src/lib/git-hook.js` — Post-commit git hook installation/removal for auto doc-sync
+- `src/lib/timeout.js` — Timeout resolution (`--timeout` flag > `ASPENS_TIMEOUT` env > default)
 - `src/lib/errors.js` — `CliError` class (structured errors caught by CLI top-level handler)
 - `src/prompts/` — Prompt templates with `{{partial}}` and `{{variable}}` substitution
 - `src/templates/` — Bundled agents, commands, hooks, and settings for `aspens add` / `doc init`
@@ -56,4 +60,4 @@ CLI entry (`bin/cli.js`) → command handlers (`src/commands/`) → lib modules
 - `tests/` — Vitest test files
 
 ---
-**Last Updated:** 2026-03-24
+**Last Updated:** 2026-03-28
diff --git a/.claude/skills/claude-runner/skill.md b/.claude/skills/claude-runner/skill.md
index a559a39..7e28cc4 100644
--- a/.claude/skills/claude-runner/skill.md
+++ b/.claude/skills/claude-runner/skill.md
@@ -9,8 +9,9 @@ This skill triggers when editing claude-runner files:
 - `src/lib/runner.js`
 - `src/lib/skill-writer.js`
 - `src/lib/skill-reader.js`
+- `src/lib/timeout.js`
 - `src/prompts/**/*.md`
-- `tests/*extract*`, `tests/*parse*`, `tests/*prompt*`, `tests/*skill-writer*`
+- `tests/*extract*`, `tests/*parse*`, `tests/*prompt*`, `tests/*skill-writer*`, `tests/*skill-mapper*`, `tests/*timeout*`
 
 ---
 
@@ -19,8 +20,9 @@ You are working on the **Claude CLI execution layer** — the bridge between ass
 ## Key Files
 - `src/lib/runner.js` — `runClaude()`, `loadPrompt()`, `parseFileOutput()`, `validateSkillFiles()`, `extractResultFromStream()`
 - `src/lib/skill-writer.js` — `writeSkillFiles()`, `extractRulesFromSkills()`, `generateDomainPatterns()`, `mergeSettings()`
-- `src/lib/skill-reader.js` — `findSkillFiles()`, `parseFrontmatter()`, `parseActivationPatterns()`, `parseKeywords()`
-- `src/prompts/` — Markdown prompt templates; `partials/` subdir holds reusable fragments
+- `src/lib/skill-reader.js` — `findSkillFiles()`, `parseFrontmatter()`, `parseActivationPatterns()`, `parseKeywords()`, `fileMatchesActivation()`, `getActivationBlock()`, `GENERIC_PATH_SEGMENTS`
+- `src/lib/timeout.js` — `resolveTimeout()` — priority: `--timeout` flag > `ASPENS_TIMEOUT` env var > caller fallback
+- `src/prompts/` — Markdown prompt templates; `partials/` subdir holds reusable fragments (`skill-format`, `guideline-format`, `examples`)
 
 ## Key Concepts
 - **Stream-JSON protocol:** `runClaude()` always passes `--verbose --output-format stream-json`. Output is NDJSON: `type: 'result'` has final text + usage; `type: 'assistant'` has text/tool_use blocks; `type: 'user'` has tool_result blocks.
@@ -35,8 +37,9 @@ You are working on the **Claude CLI execution layer** — the bridge between ass
 - **Both `--verbose` and `--output-format stream-json` are required** — omitting either breaks stream parsing.
 - **Path sanitization is non-negotiable** — `sanitizePath()` blocks `..` traversal, absolute paths, and any path not under `.claude/` or exactly `CLAUDE.md`.
 - **Prompt partials resolve before variables** — `{{skill-format}}` resolves to `partials/skill-format.md` first. If no file, falls through to variable substitution.
-- **Timeout auto-scales** — small: 120s, medium: 300s, large: 600s, very-large: 900s. User `--timeout` overrides.
+- **Timeout resolution:** `resolveTimeout(flagValue, fallbackSeconds)` — `--timeout` flag wins, then `ASPENS_TIMEOUT` env, then caller-provided fallback. Size-based defaults (small: 120s, medium: 300s, large: 600s, very-large: 900s) are set by command handlers, not runner.
 - **`mergeSettings` preserves non-aspens hooks** — identifies aspens hooks by `ASPENS_HOOK_MARKERS` (`skill-activation-prompt`, `post-tool-use-tracker`), replaces matching entries, preserves everything else.
+- **Debug mode:** Set `ASPENS_DEBUG=1` to dump raw stream-json to `/tmp/aspens-debug-stream.json`.
 
 ---
-**Last Updated:** 2026-03-24
+**Last Updated:** 2026-03-28
diff --git a/.claude/skills/doc-sync/skill.md b/.claude/skills/doc-sync/skill.md
index 699e4d8..7845839 100644
--- a/.claude/skills/doc-sync/skill.md
+++ b/.claude/skills/doc-sync/skill.md
@@ -8,28 +8,38 @@ description: Incremental skill updater that maps git diffs to affected skills an
 This skill triggers when editing doc-sync-related files:
 - `src/commands/doc-sync.js`
 - `src/prompts/doc-sync.md`
+- `src/prompts/doc-sync-refresh.md`
+- `src/lib/git-helpers.js`
+- `src/lib/diff-helpers.js`
+- `src/lib/git-hook.js`
+
+Keywords: doc-sync, refresh, sync, git-hook
 
 ---
 
 You are working on **doc-sync**, the incremental skill update command (`aspens doc sync`).
 
 ## Key Files
-- `src/commands/doc-sync.js` — Main command: git diff → graph rebuild → skill mapping → Claude update → write
-- `src/prompts/doc-sync.md` — System prompt sent to Claude (uses `{{skill-format}}` partial)
+- `src/commands/doc-sync.js` — Main command: git diff → graph rebuild → skill mapping → Claude update → write. Also contains refresh mode and `skillToDomain()` export.
+- `src/prompts/doc-sync.md` — System prompt for diff-based sync (uses `{{skill-format}}` partial)
+- `src/prompts/doc-sync-refresh.md` — System prompt for `--refresh` mode (full skill review)
+- `src/lib/git-helpers.js` — `isGitRepo()`, `getGitDiff()`, `getGitLog()`, `getChangedFiles()` — git primitives
+- `src/lib/diff-helpers.js` — `getSelectedFilesDiff()`, `buildPrioritizedDiff()`, `truncateDiff()` — diff budgeting
+- `src/lib/git-hook.js` — `installGitHook()` / `removeGitHook()` for post-commit auto-sync
+- `src/lib/context-builder.js` — `buildDomainContext()`, `buildBaseContext()` used by refresh mode
 - `src/lib/runner.js` — `runClaude()`, `loadPrompt()`, `parseFileOutput()` shared across commands
-- `src/lib/skill-writer.js` — `writeSkillFiles()` writes `{ path, content }[]` to disk
-- `src/lib/graph-persistence.js` — `loadGraph()`, `extractSubgraph()`, `formatNavigationContext()` for graph context
+- `src/lib/skill-writer.js` — `writeSkillFiles()`, `extractRulesFromSkills()` for output
 
 ## Key Concepts
 - **Diff-based flow:** Gets `git diff HEAD~N..HEAD` and `git log`, feeds them plus existing skill contents and graph context to Claude.
+- **Refresh mode (`--refresh`):** Skips diff entirely. Reviews every skill against the current codebase. Base skill refreshed first, then domain skills in parallel batches of `PARALLEL_LIMIT` (3). Also refreshes CLAUDE.md and reports uncovered domains.
 - **Graph rebuild on every sync:** Calls `buildRepoGraph` + `persistGraphArtifacts` to keep `.claude/graph.json` fresh. Graph failure is non-fatal.
-- **Graph-aware skill mapping:** `mapChangesToSkills()` checks not just direct file matches but also whether changed files are imported by files matching a skill's activation block.
+- **Graph-aware skill mapping:** `mapChangesToSkills()` checks direct file matches via `fileMatchesActivation()` (from `skill-reader.js`) and also whether changed files are imported by files matching a skill's activation block.
 - **Interactive file picker:** When diff exceeds 80k chars and TTY is available, offers multiselect with skill-relevant files pre-selected.
-- **Prioritized diff:** Skill-relevant files get 60k char budget, everything else gets 20k (80k total). Cuts at `diff --git` boundaries.
-- **Skill mapping:** Matches changed file names and meaningful path segments against `## Activation` sections. Generic segments excluded via `GENERIC_PATH_SEGMENTS`.
+- **Prioritized diff:** `buildPrioritizedDiff()` gives skill-relevant files 60k char budget, everything else 20k (80k total). Cuts at `diff --git` boundaries.
 - **Token optimization:** Affected skills sent in full; non-affected skills send only path + description line.
-- **Diff truncation:** `truncateDiff()` caps at configurable limit, cutting at the last `diff --git` boundary. CLAUDE.md capped at 5,000 chars.
-- **Git hook:** `installGitHook()` creates a `post-commit` hook with 5-minute cooldown lock file. `removeGitHook()` removes via `>>>` / `<<<` markers.
+- **Skill-rules regeneration:** After writing, regenerates `skill-rules.json` via `extractRulesFromSkills()` so hooks see updated activation patterns.
+- **Git hook:** `installGitHook()` creates a `post-commit` hook with 5-minute cooldown lock file (`/tmp/aspens-sync-*.lock` keyed by repo path hash). `removeGitHook()` removes via `>>>` / `<<<` markers.
 - **Force writes:** doc-sync always calls `writeSkillFiles` with `force: true`.
 
 ## Critical Rules
@@ -37,8 +47,10 @@ You are working on **doc-sync**, the incremental skill update command (`aspens d
 - `parseFileOutput` restricts paths to `.claude/` prefix and `CLAUDE.md` exactly — any other path is silently dropped.
 - `getGitDiff` gracefully falls back from N commits to 1 if fewer available. `actualCommits` tracks what was used.
 - The command exits early with `CliError` if `.claude/skills/` doesn't exist.
-- The hook cooldown uses `/tmp/aspens-sync-*.lock` keyed by repo path hash — don't change naming without updating cleanup.
 - `checkMissingHooks()` in `bin/cli.js` warns when skills exist but hooks are missing (pre-0.2.2 installs).
 
+## References
+- **Patterns:** `src/lib/skill-reader.js` — `GENERIC_PATH_SEGMENTS`, `fileMatchesActivation()`, `getActivationBlock()`
+
 ---
-**Last Updated:** 2026-03-24
+**Last Updated:** 2026-03-28
diff --git a/.claude/skills/import-graph/skill.md b/.claude/skills/import-graph/skill.md
index 4e24212..456a2a5 100644
--- a/.claude/skills/import-graph/skill.md
+++ b/.claude/skills/import-graph/skill.md
@@ -9,18 +9,23 @@ This skill triggers when editing import-graph-related files:
 - `src/lib/graph-builder.js`
 - `src/lib/graph-persistence.js`
 - `src/commands/doc-graph.js`
+- `src/templates/hooks/graph-context-prompt.mjs`
+- `src/templates/hooks/graph-context-prompt.sh`
 - `tests/graph-builder.test.js`
+- `tests/graph-persistence.test.js`
 
 ---
 
 You are working on the **import graph system** — static analysis that parses JS/TS and Python source files to produce dependency graphs, plus persistence/query layers for runtime use.
 
 ## Key Files
-- `src/lib/graph-builder.js` — Core graph logic: walk, parse, metrics, ranking, clustering (691 lines)
+- `src/lib/graph-builder.js` — Core graph logic: walk, parse, metrics, ranking, clustering (690 lines)
 - `src/lib/graph-persistence.js` — Serialize, persist, load, subgraph extraction, code-map, graph-index
 - `src/commands/doc-graph.js` — Standalone `aspens doc graph` command
 - `src/lib/scanner.js` — Provides `detectEntryPoints()`, only internal dependency of graph-builder
-- `tests/graph-builder.test.js` — Tests using temp fixture directories
+- `src/templates/hooks/graph-context-prompt.mjs` — Standalone hook mirroring `extractSubgraph` logic
+- `tests/graph-builder.test.js` — Graph builder tests using temp fixture directories
+- `tests/graph-persistence.test.js` — Persistence layer tests
 
 ## Key Concepts
 **graph-builder.js** — `buildRepoGraph(repoPath, languages?)` runs a 9-step pipeline:
@@ -32,17 +37,20 @@ You are working on the **import graph system** — static analysis that parses J
 - `extractSubgraph(graph, filePaths)` returns 1-hop neighborhood of mentioned files + relevant hubs/hotspots/clusters
 - `formatNavigationContext(subgraph)` renders compact markdown (~50 line budget) for prompt injection
 - `extractFileReferences(prompt, graph)` tiered extraction: explicit paths → bare filenames → cluster keywords
-- `generateCodeMap()` standalone overview for graph hook consumption
-- `generateGraphIndex()` tiny inverted index (export names → files, hub basenames, cluster labels)
+- `generateCodeMap()` / `writeCodeMap()` standalone overview for graph hook consumption
+- `generateGraphIndex()` / `saveGraphIndex()` tiny inverted index (export names → files, hub basenames, cluster labels)
 
 ## Critical Rules
 - **`await init` before any `parseJsImports` call.** es-module-lexer requires WASM initialization.
 - **Priority formula is load-bearing:** `fanIn * 3.0 + exportCount * 1.5 + (isEntry ? 10.0 : 0) + churn * 2.0 + (1/(depth+1)) * 1.0`. Downstream consumers depend on this ranking.
 - **All paths are repo-relative strings**, never absolute. Resolution functions convert abs→relative.
-- **Graph artifacts are gitignored** — `ensureGraphGitignore()` adds `.claude/graph.json`, `.claude/graph-index.json`, `.claude/code-map.md` to prevent commit loops.
+- **Graph artifacts are gitignored** — `ensureGraphGitignore()` (internal to persistence) adds `.claude/graph.json`, `.claude/graph-index.json`, `.claude/code-map.md` to prevent commit loops.
 - **Errors are swallowed, not thrown** in graph-builder — parse failures return empty/null. The graph must always complete.
-- **`extractSubgraph` logic is mirrored** in `graph-context-prompt.mjs` (standalone hook, no imports). Keep both in sync.
+- **`extractSubgraph` logic is mirrored** in `graph-context-prompt.mjs` (`buildNeighborhood()`). Keep both in sync.
 - **doc-sync rebuilds graph on every sync** — calls `buildRepoGraph` + `persistGraphArtifacts` to keep it fresh.
 
+## References
+- **Hook mirror:** `src/templates/hooks/graph-context-prompt.mjs`
+
 ---
-**Last Updated:** 2026-03-24
+**Last Updated:** 2026-03-28
diff --git a/.claude/skills/repo-scanning/skill.md b/.claude/skills/repo-scanning/skill.md
index 86e51e2..24aecdc 100644
--- a/.claude/skills/repo-scanning/skill.md
+++ b/.claude/skills/repo-scanning/skill.md
@@ -10,13 +10,15 @@ This skill triggers when editing repo-scanning files:
 - `src/commands/scan.js`
 - `tests/scanner.test.js`
 
+Keywords: scanRepo, detectLanguages, detectFrameworks, detectDomains, detectEntryPoints, health check
+
 ---
 
 You are working on **aspens' repo scanning system** — a fully deterministic analyzer (no LLM calls) that detects languages, frameworks, structure, domains, entry points, size, and health issues for any repository.
 
 ## Key Files
 - `src/lib/scanner.js` — Core `scanRepo()` function and all detection logic (languages, frameworks, structure, domains, entry points, size, health)
-- `src/commands/scan.js` — CLI command that calls `scanRepo()`, optionally builds import graph via `graph-builder.js`, and renders pretty or JSON output
+- `src/commands/scan.js` — CLI command that calls `scanRepo()`, optionally builds import graph via `graph-builder.js`, and renders pretty or JSON output. Contains `formatGraphForDisplay()` which transforms raw graph data into display-ready shape
 - `src/lib/graph-builder.js` — Builds import graph; imports `detectEntryPoints` from scanner. Called by `scanCommand` but graph failure is non-fatal
 - `tests/scanner.test.js` — Uses temporary fixture directories created in `tests/fixtures/scanner/`, cleaned up in `afterAll`
 
@@ -28,6 +30,7 @@ You are working on **aspens' repo scanning system** — a fully deterministic an
 - **extraDomains:** User-specified domains merged via `mergeExtraDomains()` — marked with `userSpecified: true`, resolved against source root then repo root
 - **Source root:** First match of `src`, `app`, `lib`, `server`, `pages` via `findSourceRoot()`
 - **Size estimation:** Lines estimated at ~40 bytes/line from `stat.size`, walk capped at depth 5
+- **Graph is opt-out:** `scanCommand` builds graph by default (`options.graph !== false`); errors are caught and only logged with `--verbose`
 
 ## Critical Rules
 - **`SOURCE_EXTS`**: Only `.py`, `.ts`, `.js`, `.tsx`, `.jsx`, `.rb`, `.go`, `.rs` — adding a language requires updating this set AND the `detectLanguages` indicators
@@ -38,5 +41,8 @@ You are working on **aspens' repo scanning system** — a fully deterministic an
 - **Tests use real filesystem fixtures**, not mocks — create fixtures with `createFixture(name, files)` pattern, always clean up
 - **`detectEntryPoints` is exported** and reused by `graph-builder.js` — changing its signature breaks the graph builder
 
+## References
+- **No guidelines directory** — `.claude/guidelines/` does not exist yet for this domain
+
 ---
-**Last Updated:** 2026-03-21
+**Last Updated:** 2026-03-28
diff --git a/.claude/skills/skill-generation/skill.md b/.claude/skills/skill-generation/skill.md
index ead588e..c6f2db6 100644
--- a/.claude/skills/skill-generation/skill.md
+++ b/.claude/skills/skill-generation/skill.md
@@ -7,44 +7,51 @@ description: LLM-powered generation pipeline for Claude Code skills and CLAUDE.m
 
 This skill triggers when editing skill-generation files:
 - `src/commands/doc-init.js`
-- `src/commands/doc-graph.js`
-- `src/lib/context-builder.js`
 - `src/lib/runner.js`
 - `src/lib/skill-writer.js`
 - `src/lib/skill-reader.js`
+- `src/lib/git-hook.js`
+- `src/lib/timeout.js`
 - `src/prompts/**/*`
 
+Keywords: doc-init, generate skills, discovery agents, chunked generation
+
 ---
 
 You are working on **aspens' skill generation pipeline** — the system that scans repos and uses Claude CLI to generate `.claude/skills/` files, hooks, and `CLAUDE.md`.
 
 ## Key Files
 - `src/commands/doc-init.js` — Main 9-step pipeline: scan → graph → discovery → strategy → mode → generate → validate → write → hooks
-- `src/commands/doc-graph.js` — Standalone graph rebuild command (`aspens doc graph`)
 - `src/lib/runner.js` — `runClaude()`, `loadPrompt()`, `parseFileOutput()`, `validateSkillFiles()`
-- `src/lib/context-builder.js` — Assembles prompt context from scan results, manifests, configs, domain files, git log
 - `src/lib/skill-writer.js` — Writes files, generates `skill-rules.json`, domain bash patterns, merges `settings.json`
 - `src/lib/skill-reader.js` — Parses skill frontmatter, activation patterns, keywords (used by skill-writer)
-- `src/prompts/` — Prompt templates; `discover-domains.md` and `discover-architecture.md` for discovery agents
+- `src/lib/git-hook.js` — `installGitHook()` / `removeGitHook()` for post-commit auto-sync
+- `src/lib/timeout.js` — `resolveTimeout()` for auto-scaled + user-override timeouts
+- `src/prompts/` — `doc-init.md` (base), `doc-init-domain.md`, `doc-init-claudemd.md`, `discover-domains.md`, `discover-architecture.md`
 
 ## Key Concepts
-- **9-step pipeline:** (1) scan + graph (2) parallel discovery agents (3) strategy (4) mode (5) generate (6) validate (7) preview (8) write (9) install hooks
-- **Parallel discovery:** Two agents run via `Promise.all` — domain discovery and architecture analysis — before any user prompt
-- **Generation modes:** `all-at-once` = single Claude call; `chunked` = base + per-domain (up to 3 parallel via `PARALLEL_LIMIT`) + CLAUDE.md; `base-only` = just base skill
-- **`--domains` flag:** Filters which domains to generate in chunked mode; enables `domainsOnly` mode that skips base + CLAUDE.md (for retrying failed domains)
+- **9-step pipeline:** (1) scan + graph (2) parallel discovery agents (3) strategy (4) mode (5) generate (6) validate (7) show files + dry-run (8) write (9) install hooks
+- **Parallel discovery:** Two agents run via `Promise.all` — domain discovery and architecture analysis — before any user prompt. Uses `buildGraphContextForDiscovery()` (local) for targeted graph context per agent.
+- **Generation modes:** `all-at-once` = single Claude call; `chunked` = base + per-domain (up to 3 parallel via `PARALLEL_LIMIT`) + CLAUDE.md; `base-only` = just base skill; `pick` = interactive domain picker (becomes chunked)
+- **`--domains` flag:** Filters which domains to generate in chunked mode; combined with `--mode chunked` enables `domainsOnly` mode that skips base + CLAUDE.md (for retrying failed domains)
 - **`--hooks-only` flag:** Skips generation entirely, just installs/updates hooks from existing skills
-- **Retry logic:** Base skill and CLAUDE.md retry up to 2 times if `parseFileOutput` returns empty (format correction prompt)
-- **Validation:** `validateSkillFiles()` checks for truncation, missing frontmatter, missing sections, bad file path references
+- **`--strategy` flag:** `improve` (read existing, update), `rewrite` (ignore existing), `skip` (only new domains). Interactive prompt if not specified.
+- **Retry logic:** Base skill and CLAUDE.md retry up to 2 times if `parseFileOutput` returns empty (format correction prompt asking for `<file>` tags)
+- **Validation:** `validateSkillFiles()` checks for truncation, missing frontmatter, missing sections, bad file path references. Truncated files are removed from output.
 - **Hook installation (step 9):** Generates `skill-rules.json`, copies hook scripts, generates `post-tool-use-tracker.sh` with domain patterns, merges `settings.json`
-- **Graph context:** `buildGraphContext()` and `buildDomainGraphContext()` inject import graph data into prompts
+- **Local helpers in doc-init.js:** `buildGraphContext()`, `buildDomainGraphContext()`, `buildGraphContextForDiscovery()`, `buildScanSummary()`, `sanitizeInline()`, `tokenTracker`, `autoTimeout()`
+- **Token tracking:** `tokenTracker` aggregates prompt/output/tool-use tokens across all Claude calls; displayed with elapsed time at end of pipeline
 
 ## Critical Rules
 - **Base skill + CLAUDE.md are essential** — pipeline retries automatically with format correction. Domain skill failures are acceptable (user retries with `--domains`).
 - **`improve` strategy preserves hand-written content** — Claude must read existing skills first and not discard human-authored rules.
 - **Discovery runs before user prompt** — domain picker shows Claude-discovered domains, not scanner directory names.
 - **PARALLEL_LIMIT = 3** — domain skills generate in batches of 3 concurrent Claude calls. Base skill always sequential first. CLAUDE.md always sequential last.
-- **Skills must be 35-60 lines** — every line earns its place. No generic advice, no framework documentation.
 - **CliError, not process.exit()** — all error exits throw `CliError`; cancellations `return` early.
 
+## References
+- **Prompts:** `src/prompts/doc-init*.md`, `src/prompts/discover-*.md`
+- **Partials:** `src/prompts/partials/skill-format.md`, `src/prompts/partials/examples.md`
+
 ---
-**Last Updated:** 2026-03-24
+**Last Updated:** 2026-03-28
diff --git a/.claude/skills/skill-rules.json b/.claude/skills/skill-rules.json
index c328f85..a65ee60 100644
--- a/.claude/skills/skill-rules.json
+++ b/.claude/skills/skill-rules.json
@@ -60,6 +60,8 @@
       "filePatterns": [
         "src/lib/runner.js",
         "src/lib/skill-writer.js",
+        "src/lib/skill-reader.js",
+        "src/lib/timeout.js",
         "src/prompts/**/*.md",
         "tests/*extract*"
       ],
@@ -73,6 +75,8 @@
           "prompt",
           "loading",
           "skill-writer",
+          "skill-reader",
+          "timeout",
           "prompts",
           "tests",
           "extract"
@@ -92,12 +96,19 @@
       "alwaysActivate": false,
       "filePatterns": [
         "src/commands/doc-sync.js",
-        "src/prompts/doc-sync.md"
+        "src/prompts/doc-sync.md",
+        "src/prompts/doc-sync-refresh.md",
+        "src/lib/git-helpers.js",
+        "src/lib/diff-helpers.js",
+        "src/lib/git-hook.js"
       ],
       "promptTriggers": {
         "keywords": [
-          "doc",
+          "doc-sync",
+          "refresh",
           "sync",
+          "git-hook",
+          "doc",
           "doc sync",
           "incremental",
           "skill",
@@ -105,8 +116,10 @@
           "maps",
           "diffs",
           "commands",
-          "doc-sync",
-          "prompts"
+          "prompts",
+          "doc-sync-refresh",
+          "git-helpers",
+          "diff-helpers"
         ],
         "intentPatterns": [
           "(create|update|fix|add|modify|change|debug|refactor|implement|build).*doc sync",
@@ -123,7 +136,12 @@
       "alwaysActivate": false,
       "filePatterns": [
         "src/lib/graph-builder.js",
-        "tests/graph-builder.test.js"
+        "src/lib/graph-persistence.js",
+        "src/commands/doc-graph.js",
+        "src/templates/hooks/graph-context-prompt.mjs",
+        "src/templates/hooks/graph-context-prompt.sh",
+        "tests/graph-builder.test.js",
+        "tests/graph-persistence.test.js"
       ],
       "promptTriggers": {
         "keywords": [
@@ -135,8 +153,15 @@
           "builds",
           "dependency",
           "graph-builder",
+          "graph-persistence",
+          "commands",
+          "doc-graph",
+          "templates",
+          "hooks",
+          "graph-context-prompt",
           "tests",
-          "graph-builder.test"
+          "graph-builder.test",
+          "graph-persistence.test"
         ],
         "intentPatterns": [
           "(create|update|fix|add|modify|change|debug|refactor|implement|build).*import graph",
@@ -158,6 +183,12 @@
       ],
       "promptTriggers": {
         "keywords": [
+          "scanRepo",
+          "detectLanguages",
+          "detectFrameworks",
+          "detectDomains",
+          "detectEntryPoints",
+          "health check",
           "repo",
           "scanning",
           "repo scanning",
@@ -172,6 +203,8 @@
           "scanner.test"
         ],
         "intentPatterns": [
+          "(create|update|fix|add|modify|change|debug|refactor|implement|build).*health check",
+          "health check.*(create|update|fix|add|modify|change|debug|refactor|implement|build)",
           "(create|update|fix|add|modify|change|debug|refactor|implement|build).*repo scanning",
           "repo scanning.*(create|update|fix|add|modify|change|debug|refactor|implement|build)",
           "(create|update|fix|add|modify|change|debug|refactor|implement|build).*repo.*scanning"
@@ -186,15 +219,19 @@
       "alwaysActivate": false,
       "filePatterns": [
         "src/commands/doc-init.js",
-        "src/commands/doc-sync.js",
-        "src/commands/customize.js",
-        "src/lib/context-builder.js",
         "src/lib/runner.js",
         "src/lib/skill-writer.js",
+        "src/lib/skill-reader.js",
+        "src/lib/git-hook.js",
+        "src/lib/timeout.js",
         "src/prompts/**/*"
       ],
       "promptTriggers": {
         "keywords": [
+          "doc-init",
+          "generate skills",
+          "discovery agents",
+          "chunked generation",
           "skill",
           "generation",
           "skill generation",
@@ -203,18 +240,20 @@
           "claude",
           "code",
           "commands",
-          "doc-init",
-          "doc-sync",
-          "customize",
-          "context-builder",
           "runner",
           "skill-writer",
+          "skill-reader",
+          "git-hook",
+          "timeout",
           "prompts"
         ],
         "intentPatterns": [
-          "(create|update|fix|add|modify|change|debug|refactor|implement|build).*skill generation",
-          "skill generation.*(create|update|fix|add|modify|change|debug|refactor|implement|build)",
-          "(create|update|fix|add|modify|change|debug|refactor|implement|build).*skill.*generation"
+          "(create|update|fix|add|modify|change|debug|refactor|implement|build).*generate skills",
+          "generate skills.*(create|update|fix|add|modify|change|debug|refactor|implement|build)",
+          "(create|update|fix|add|modify|change|debug|refactor|implement|build).*discovery agents",
+          "discovery agents.*(create|update|fix|add|modify|change|debug|refactor|implement|build)",
+          "(create|update|fix|add|modify|change|debug|refactor|implement|build).*chunked generation",
+          "chunked generation.*(create|update|fix|add|modify|change|debug|refactor|implement|build)"
         ]
       }
     },
@@ -226,27 +265,32 @@
       "alwaysActivate": false,
       "filePatterns": [
         "src/commands/add.js",
-        "src/commands/customize.js",
         "src/templates/**/*"
       ],
       "promptTriggers": {
         "keywords": [
           "template",
+          "add agent",
+          "add command",
+          "add hook",
+          "add skill",
           "library",
           "template library",
           "bundled",
           "agents",
           "commands",
           "hooks",
-          "users",
+          "settings",
           "add",
-          "customize",
           "templates"
         ],
         "intentPatterns": [
-          "(create|update|fix|add|modify|change|debug|refactor|implement|build).*template library",
-          "template library.*(create|update|fix|add|modify|change|debug|refactor|implement|build)",
-          "(create|update|fix|add|modify|change|debug|refactor|implement|build).*template.*library"
+          "(create|update|fix|add|modify|change|debug|refactor|implement|build).*add agent",
+          "add agent.*(create|update|fix|add|modify|change|debug|refactor|implement|build)",
+          "(create|update|fix|add|modify|change|debug|refactor|implement|build).*add command",
+          "add command.*(create|update|fix|add|modify|change|debug|refactor|implement|build)",
+          "(create|update|fix|add|modify|change|debug|refactor|implement|build).*add hook",
+          "add hook.*(create|update|fix|add|modify|change|debug|refactor|implement|build)"
         ]
       }
     }
diff --git a/.claude/skills/template-library/skill.md b/.claude/skills/template-library/skill.md
index 5a9ebf2..a6b7a1a 100644
--- a/.claude/skills/template-library/skill.md
+++ b/.claude/skills/template-library/skill.md
@@ -7,35 +7,40 @@ description: Bundled agents, commands, hooks, and settings that users install vi
 
 This skill triggers when editing template-library files:
 - `src/commands/add.js`
-- `src/commands/customize.js`
 - `src/templates/**/*`
 
+Keywords: template, add agent, add command, add hook, add skill
+
 ---
 
 You are working on the **template library** — bundled agents, slash commands, hooks, and settings that users browse and install into their repos.
 
 ## Key Files
-- `src/commands/add.js` — Core `aspens add <type> [name]` command; copies templates to `.claude/` dirs
-- `src/commands/customize.js` — `aspens customize agents` post-install step; uses Claude to inject project context
-- `src/templates/agents/*.md` — Agent persona templates (9 bundled)
+- `src/commands/add.js` — Core `aspens add <type> [name]` command; copies templates to `.claude/` dirs, scaffolds/generates custom skills
+- `src/templates/agents/*.md` — Agent persona templates (11 bundled)
 - `src/templates/commands/*.md` — Slash command templates (2 bundled)
 - `src/templates/hooks/` — Hook scripts (5 bundled): `skill-activation-prompt.sh/mjs`, `graph-context-prompt.sh/mjs`, `post-tool-use-tracker.sh`
 - `src/templates/settings/settings.json` — Default settings with hook configuration
+- `src/prompts/add-skill.md` — System prompt for LLM-powered skill generation from reference docs
 
 ## Key Concepts
-- **Four resource types for `add`:** `agent` → `.claude/agents`, `command` → `.claude/commands`, `hook` → `.claude/hooks`. Settings installed automatically by `doc init`.
+- **Four resource types for `add`:** `agent` → `.claude/agents`, `command` → `.claude/commands`, `hook` → `.claude/hooks`. A fourth type `skill` is handled separately (not template-based).
+- **Skill subcommand:** `aspens add skill <name>` scaffolds a blank skill template. `--from <file>` generates a skill from a reference doc using Claude (LLM-powered). `--list` shows installed skills.
 - **Hook templates:** `skill-activation-prompt` reads `skill-rules.json` and injects relevant skills into prompts. `graph-context-prompt` loads graph data for code navigation. `post-tool-use-tracker` detects skill domains from file access patterns.
 - **`doc init` hook installation (step 9):** Generates `skill-rules.json` from skills, copies hook files, generates `post-tool-use-tracker.sh` with domain patterns (via `BEGIN/END` markers), merges `settings.json` with backup.
 - **Template discovery:** `listAvailable()` reads template dir, filters `.md`/`.sh` files, regex-parses `name:` and `description:`.
-- **No-overwrite policy:** `addResource()` skips files that already exist via `existsSync` check.
-- **Customize flow:** Reads CLAUDE.md + skills as context → sends each agent through Claude → writes updated agents back. Only supports `agents` target.
+- **No-overwrite policy:** `addResource()` skips files that already exist via `existsSync` check. Same for `addSkillCommand`.
+- **Plan/execute gitignore:** Adding `plan` or `execute` agents auto-adds `dev/` to `.gitignore` for plan storage.
 
 ## Critical Rules
 - Template files **must** contain `name: <value>` and `description: <value>` lines parseable by regex.
 - Only `.md` and `.sh` extensions are discovered by `listAvailable()`. `.mjs` files are copied by `doc init` directly, not by `add`.
 - The templates dir resolves from `src/commands/` via `join(__dirname, '..', 'templates')` — moving `add.js` breaks template resolution.
-- `customize` requires `.claude/agents/` AND either CLAUDE.md or `.claude/skills/` — exits cleanly otherwise.
+- Skill names are sanitized to lowercase alphanumeric + hyphens. Invalid names throw `CliError`.
 - Commands throw `CliError` for expected failures instead of calling `process.exit()`.
 
+## References
+- **Customize flow:** `.claude/skills/agent-customization/skill.md`
+
 ---
-**Last Updated:** 2026-03-24
+**Last Updated:** 2026-03-28
diff --git a/CLAUDE.md b/CLAUDE.md
index 77a01f9..b60ed5f 100644
--- a/CLAUDE.md
+++ b/CLAUDE.md
@@ -28,13 +28,18 @@ src/lib/
   runner.js             # Claude CLI execution, stream-json parsing, file output extraction
   skill-writer.js       # writes skill .md files, generates skill-rules.json, merges settings
   skill-reader.js       # parses skill frontmatter, activation patterns, keywords
+  diff-helpers.js       # git diff parsing and change extraction
+  git-helpers.js        # git operations (log, diff, rev-parse)
+  git-hook.js           # post-commit hook install/uninstall for doc-sync
+  timeout.js            # timeout calculation (auto-scales by repo size)
   errors.js             # CliError class for structured error handling
+src/prompts/            # prompt templates + partials/ subdir for reusable fragments
 src/templates/
-  agents/               # 9 agent templates (.md)
+  agents/               # 11 agent templates (.md)
   commands/             # 2 command templates (.md)
   hooks/                # 5 hook templates (.sh + .mjs)
   settings/             # settings templates
-tests/                   # vitest tests + fixtures
+tests/                  # vitest tests + fixtures
 ```
 
 ## Skills (Claude Code integration)
diff --git a/README.md b/README.md
index 4a6dedd..a509c2f 100644
--- a/README.md
+++ b/README.md
@@ -101,6 +101,7 @@ $ aspens scan .
 |--------|-------------|
 | `--json` | Output as JSON |
 | `--domains <list>` | Additional domains to include (comma-separated) |
+| `--no-graph` | Skip import graph analysis |
 | `--verbose` | Show diagnostic output |
 
 ### `aspens doc init [path]`
@@ -164,6 +165,7 @@ $ aspens doc init .
 | `--mode <mode>` | `all`, `chunked`, or `base-only` (skips interactive prompt) |
 | `--strategy <strategy>` | `improve`, `rewrite`, or `skip` for existing docs (skips interactive prompt) |
 | `--domains <list>` | Additional domains to include (comma-separated) |
+| `--no-graph` | Skip import graph analysis |
 | `--model <model>` | Claude model (e.g., sonnet, opus, haiku) |
 | `--verbose` | Show what Claude is reading in real time |
 
@@ -196,15 +198,26 @@ $ aspens doc sync .
 | Option | Description |
 |--------|-------------|
 | `--commits <n>` | Number of commits to analyze (default: 1) |
+| `--refresh` | Review all skills against current codebase (no git diff needed) |
+| `--no-graph` | Skip import graph analysis |
 | `--install-hook` | Install git post-commit hook for auto-sync |
+| `--remove-hook` | Remove the post-commit hook |
 | `--dry-run` | Preview without writing files |
 | `--timeout <seconds>` | Claude timeout (default: 300) |
 | `--model <model>` | Claude model (e.g., sonnet, opus, haiku) |
 | `--verbose` | Show what Claude is reading in real time |
 
+### `aspens doc graph [path]`
+
+Rebuild the import graph cache. Runs automatically during `doc init` and `doc sync`, but you can trigger it manually.
+
+```bash
+aspens doc graph .
+```
+
 ### `aspens add <type> [name]`
 
-Add individual components from the bundled library.
+Add individual components from the bundled library, or create custom skills.
 
 ```bash
 aspens add agent all              # Add all 9 AI agents
@@ -212,12 +225,16 @@ aspens add agent code-reviewer    # Add a specific agent
 aspens add agent --list           # Browse available agents
 aspens add hook skill-activation  # Add auto-triggering hooks
 aspens add command dev-docs       # Add slash commands
+aspens add skill my-convention    # Scaffold a custom skill
+aspens add skill release --from dev/release.md  # Generate from a reference doc
+aspens add skill --list           # Show existing skills
 ```
 
 | Option | Description |
 |--------|-------------|
 | `--list` | Browse available components |
-| `--force` | Overwrite existing files |
+| `--from <file>` | Generate a skill from a reference document (skills only) |
+| `--force` | Overwrite existing skills |
 
 ### `aspens customize agents`
 
diff --git a/bin/cli.js b/bin/cli.js
index cf0e72f..70ea724 100755
--- a/bin/cli.js
+++ b/bin/cli.js
@@ -70,6 +70,7 @@ function showWelcome() {
     ${pc.yellow('--mode')} ${pc.dim('<mode>')}       all, chunked, base-only     ${pc.yellow('--timeout')} ${pc.dim('<s>')}  Seconds per call
     ${pc.yellow('--strategy')} ${pc.dim('<s>')}    improve, rewrite, skip      ${pc.yellow('--json')}      JSON output (scan)
     ${pc.yellow('--no-hooks')}         Skip hook installation       ${pc.yellow('--hooks-only')}  Update hooks only
+    ${pc.yellow('--no-graph')}         Skip import graph analysis
 
   ${pc.bold('Typical Workflow')}
     ${pc.dim('$')} aspens scan                              ${pc.dim('1. See what\'s in your repo')}
@@ -121,6 +122,7 @@ program
   .option('--json', 'Output as JSON')
   .option('--domains <domains>', 'Additional domains to include (comma-separated)')
   .option('--verbose', 'Show diagnostic output')
+  .option('--no-graph', 'Skip import graph analysis')
   .action(scanCommand);
 
 // Doc commands
@@ -143,6 +145,7 @@ doc
   .option('--verbose', 'Show what Claude is reading/doing in real time')
   .option('--no-hooks', 'Skip hook/rules/settings installation')
   .option('--hooks-only', 'Skip skill generation, just install/update hooks')
+  .option('--no-graph', 'Skip import graph analysis')
   .action(docInitCommand);
 
 doc
@@ -157,6 +160,7 @@ doc
   .option('--timeout <seconds>', 'Claude timeout in seconds', parseTimeout, 300)
   .option('--model <model>', 'Claude model to use (e.g., sonnet, opus, haiku)')
   .option('--verbose', 'Show what Claude is reading/doing in real time')
+  .option('--no-graph', 'Skip import graph analysis')
   .action((path, options) => {
     checkMissingHooks(resolve(path));
     return docSyncCommand(path, options);
diff --git a/src/commands/add.js b/src/commands/add.js
index 4f09812..c95a128 100644
--- a/src/commands/add.js
+++ b/src/commands/add.js
@@ -199,6 +199,27 @@ function addResource(repoPath, resourceType, name, available) {
 
   copyFileSync(sourceFile, targetFile);
   console.log(`  ${pc.green('+')} ${resourceType.targetDir}/${resource.fileName}`);
+
+  // Plan/execute agents need dev/ gitignored for plan storage
+  if (name === 'plan' || name === 'execute') {
+    ensureDevGitignore(repoPath);
+  }
+}
+
+function ensureDevGitignore(repoPath) {
+  try {
+    const gitignorePath = join(repoPath, '.gitignore');
+    if (existsSync(gitignorePath)) {
+      const content = readFileSync(gitignorePath, 'utf8');
+      if (/^dev\/$/m.test(content)) return; // already present
+      writeFileSync(gitignorePath, content.trimEnd() + '\ndev/\n', 'utf8');
+    } else {
+      writeFileSync(gitignorePath, 'dev/\n', 'utf8');
+    }
+    console.log(`  ${pc.green('+')} Added ${pc.cyan('dev/')} to .gitignore (used for plan storage)`);
+  } catch (err) {
+    throw new CliError(`Failed to update .gitignore: ${err.message}. Check file permissions.`);
+  }
 }
 
 // --- Custom skill ---
diff --git a/src/commands/doc-init.js b/src/commands/doc-init.js
index cd4f125..d1bfb81 100644
--- a/src/commands/doc-init.js
+++ b/src/commands/doc-init.js
@@ -37,6 +37,15 @@ function makeClaudeOptions(timeoutMs, verbose, model, spinner) {
   };
 }
 
+// Sanitize markdown for safe inlining inside triple-backtick fences
+const MAX_INLINE_CHARS = 2000;
+function sanitizeInline(content, maxLen = MAX_INLINE_CHARS) {
+  let text = content.length > maxLen ? content.slice(0, maxLen) + '\n\n[... truncated]' : content;
+  // Escape triple backticks so they don't break wrapper fences
+  text = text.replace(/```/g, '` ` `');
+  return text;
+}
+
 // Track token usage across all calls
 const tokenTracker = { promptTokens: 0, toolResultTokens: 0, output: 0, toolUses: 0, calls: 0 };
 
@@ -90,13 +99,15 @@ export async function docInitCommand(path, options) {
 
   // Build import graph
   let repoGraph = null;
-  try {
-    repoGraph = await buildRepoGraph(repoPath, scan.languages);
-    // Persist graph, code-map skill, and index for runtime use
+  if (options.graph !== false) {
     try {
-      persistGraphArtifacts(repoPath, repoGraph);
-    } catch { /* graph persistence failed — non-fatal */ }
-  } catch { /* graph building failed — continue without it */ }
+      repoGraph = await buildRepoGraph(repoPath, scan.languages);
+      // Persist graph, code-map skill, and index for runtime use
+      try {
+        persistGraphArtifacts(repoPath, repoGraph);
+      } catch { /* graph persistence failed — non-fatal */ }
+    } catch { /* graph building failed — continue without it */ }
+  }
 
   scanSpinner.stop(`Scanned ${pc.bold(scan.name)} (${scan.repoType})`);
 
@@ -134,34 +145,30 @@ export async function docInitCommand(path, options) {
     discoverSpinner.start('Discovering domains + analyzing architecture...');
 
     try {
-      const graphContext = buildGraphContext(repoGraph);
-      let hotspotsSection = '';
-      if (repoGraph.hotspots && repoGraph.hotspots.length > 0) {
-        hotspotsSection = '\n\n### Hotspots (high churn)\n';
-        for (const h of repoGraph.hotspots) {
-          hotspotsSection += `- \`${h.path}\` — ${h.churn} changes, ${h.lines} lines\n`;
-        }
-      }
-
       const scanSummary = buildScanSummary(scan);
-      const sharedContext = `\n\n---\n\nRepository: ${repoPath}\n\n${scanSummary}\n\n${graphContext}${hotspotsSection}`;
+
+      // Build targeted graph context for each agent (not the full graph)
+      const domainDiscoveryContext = buildGraphContextForDiscovery(repoGraph, 'domains');
+      const archDiscoveryContext = buildGraphContextForDiscovery(repoGraph, 'architecture');
 
       // Run both discovery agents in parallel
       const [domainsResult, archResult] = await Promise.all([
-        // Agent 1: Domain discovery (focused, fast)
+        // Agent 1: Domain discovery — needs hub files + domain clusters only
         (async () => {
           try {
-            const prompt = loadPrompt('discover-domains') + sharedContext;
+            const context = `\n\n---\n\nRepository: ${repoPath}\n\n${scanSummary}\n\n${domainDiscoveryContext}`;
+            const prompt = loadPrompt('discover-domains') + context;
             const { text, usage } = await runClaude(prompt, makeClaudeOptions(timeoutMs, verbose, model, null));
             trackUsage(usage, prompt.length);
             const match = text.match(/<findings>([\s\S]*?)<\/findings>/);
             return match ? match[1].trim() : null;
           } catch { return null; }
         })(),
-        // Agent 2: Architecture analysis (deeper, reads hub files)
+        // Agent 2: Architecture analysis — needs hub files + rankings + hotspots
         (async () => {
           try {
-            const prompt = loadPrompt('discover-architecture') + sharedContext;
+            const context = `\n\n---\n\nRepository: ${repoPath}\n\n${scanSummary}\n\n${archDiscoveryContext}`;
+            const prompt = loadPrompt('discover-architecture') + context;
             const { text, usage } = await runClaude(prompt, makeClaudeOptions(timeoutMs, verbose, model, null));
             trackUsage(usage, prompt.length);
             const match = text.match(/<findings>([\s\S]*?)<\/findings>/);
@@ -333,7 +340,7 @@ export async function docInitCommand(path, options) {
   let allFiles = [];
 
   if (mode === 'all-at-once') {
-    allFiles = await generateAllAtOnce(repoPath, scan, repoGraph, selectedDomains, timeoutMs, existingDocsStrategy, verbose, model, discoveryFindings);
+    allFiles = await generateAllAtOnce(repoPath, scan, repoGraph, selectedDomains, timeoutMs, existingDocsStrategy, verbose, model, discoveryFindings, !!options.mode);
   } else {
     const domainsOnly = isDomainsOnly; // retrying specific domains — skip base + CLAUDE.md
     allFiles = await generateChunked(repoPath, scan, repoGraph, selectedDomains, mode === 'base-only', timeoutMs, existingDocsStrategy, verbose, model, discoveryFindings, domainsOnly);
@@ -372,11 +379,14 @@ export async function docInitCommand(path, options) {
   }
 
   // Step 7: Show what will be written
+  const shouldForce = options.force || existingDocsStrategy === 'improve' || existingDocsStrategy === 'rewrite';
   console.log();
   p.log.info('Files to write:');
   for (const file of allFiles) {
     const hasIssues = validation.issues?.some(i => i.file === file.path) ?? false;
-    const icon = hasIssues ? pc.yellow('~') : pc.green('+');
+    const willOverwrite = existsSync(join(repoPath, file.path));
+    const willWrite = shouldForce || !willOverwrite || hasIssues;
+    const icon = hasIssues ? pc.yellow('~') : willWrite && willOverwrite ? pc.yellow('~') : willWrite ? pc.green('+') : pc.dim('-');
     console.log(pc.dim('  ') + icon + ' ' + file.path);
   }
   console.log();
@@ -407,7 +417,7 @@ export async function docInitCommand(path, options) {
   // Step 8: Write files
   const writeSpinner = p.spinner();
   writeSpinner.start('Writing files...');
-  const results = writeSkillFiles(repoPath, allFiles, { force: options.force });
+  const results = writeSkillFiles(repoPath, allFiles, { force: shouldForce });
   writeSpinner.stop('Done');
 
   // Summary
@@ -677,6 +687,79 @@ function buildGraphContext(graph) {
   return sections.join('\n');
 }
 
+/**
+ * Build targeted graph context for discovery agents.
+ * Each agent only gets the graph sections it needs, not the full context.
+ * @param {'domains'|'architecture'} mode
+ */
+function buildGraphContextForDiscovery(graph, mode) {
+  if (!graph) return '';
+
+  const sections = ['## Import Graph Analysis\n'];
+
+  // Hub files — both agents need these
+  if (graph.hubs.length > 0) {
+    sections.push('### Hub Files (most depended on — read these first)\n');
+    for (const hub of graph.hubs.slice(0, 10)) {
+      const fileInfo = graph.files[hub.path];
+      sections.push(`- \`${hub.path}\` — ${hub.fanIn} dependents, ${fileInfo?.exportCount || 0} exports, ${fileInfo?.lines || 0} lines`);
+    }
+    sections.push('');
+  }
+
+  if (mode === 'domains') {
+    if (graph.clusters?.components?.length > 0) {
+      sections.push('### Domain Clusters (files that import each other)\n');
+      for (const comp of graph.clusters.components) {
+        if (comp.size <= 1) continue;
+        const fileList = comp.files.slice(0, 10).map(f => `\`${f}\``).join(', ');
+        const more = comp.files.length > 10 ? ` +${comp.files.length - 10} more` : '';
+        sections.push(`- **${comp.label}** (${comp.size} files): ${fileList}${more}`);
+      }
+      sections.push('');
+    }
+  }
+
+  if (mode === 'architecture') {
+    if (graph.ranked?.length > 0) {
+      sections.push('### File Priority Ranking (read in this order)\n');
+      for (const file of graph.ranked.slice(0, 15)) {
+        sections.push(`- \`${file.path}\` — priority ${file.priority.toFixed(1)} (${file.fanIn} dependents, ${file.exportCount} exports, ${file.lines} lines)`);
+      }
+      sections.push('');
+    }
+    if (graph.hotspots && graph.hotspots.length > 0) {
+      sections.push('### Hotspots (high churn)\n');
+      const maxHotspots = 15;
+      for (const h of graph.hotspots.slice(0, maxHotspots)) {
+        sections.push(`- \`${h.path}\` — ${h.churn} changes, ${h.lines} lines`);
+      }
+      if (graph.hotspots.length > maxHotspots) {
+        sections.push(`- ...and ${graph.hotspots.length - maxHotspots} more hotspots`);
+      }
+      sections.push('');
+    }
+  }
+
+  return sections.join('\n');
+}
+
+/**
+ * Produce a 1-line summary of the base skill instead of sending the full text.
+ */
+function summarizeBaseSkill(baseSkillContent, scan) {
+  if (!baseSkillContent) {
+    return '## Base skill reference\nBase skill not yet generated.';
+  }
+  const descMatch = baseSkillContent.match(/description:\s*(.+)/);
+  const desc = descMatch ? descMatch[1].trim() : '';
+  const tech = [
+    ...(scan.languages || []),
+    ...(scan.frameworks || []),
+  ].filter(Boolean).join(', ');
+  return `## Base skill reference\nBase skill covers: ${desc || 'tech stack, commands, conventions'}${tech ? ` [${tech}]` : ''}. See base skill for details — do not duplicate its content in domain skills.`;
+}
+
 function buildDomainGraphContext(graph, domain) {
   if (!graph) return '';
 
@@ -716,14 +799,33 @@ function buildStrategyInstruction(strategy) {
   return '';
 }
 
-async function generateAllAtOnce(repoPath, scan, repoGraph, selectedDomains, timeoutMs, strategy, verbose, model, findings) {
+async function generateAllAtOnce(repoPath, scan, repoGraph, selectedDomains, timeoutMs, strategy, verbose, model, findings, nonInteractive = false) {
   const today = new Date().toISOString().split('T')[0];
   const systemPrompt = loadPrompt('doc-init');
   const scanSummary = buildScanSummary(scan);
   const graphContext = buildGraphContext(repoGraph);
   const strategyNote = buildStrategyInstruction(strategy);
   const findingsSection = findings ? `\n\n## Architecture Analysis (from discovery pass)\n\n${findings}` : '';
-  const fullPrompt = `${systemPrompt}${strategyNote}\n\n---\n\nGenerate skills for this repository at ${repoPath}. Today's date is ${today}.\n\n${scanSummary}\n\n${graphContext}${findingsSection}`;
+
+  // When improving, include existing content so Claude can build on it
+  let existingSection = '';
+  if (strategy === 'improve') {
+    const parts = [];
+    const claudeMdPath = join(repoPath, 'CLAUDE.md');
+    if (existsSync(claudeMdPath)) {
+      const existing = readFileSync(claudeMdPath, 'utf8');
+      parts.push(`### Existing CLAUDE.md\n\`\`\`\n${sanitizeInline(existing)}\n\`\`\``);
+    }
+    const basePath = join(repoPath, '.claude', 'skills', 'base', 'skill.md');
+    if (existsSync(basePath)) {
+      parts.push(`### Existing base skill\n\`\`\`\n${sanitizeInline(readFileSync(basePath, 'utf8'))}\n\`\`\``);
+    }
+    if (parts.length > 0) {
+      existingSection = `\n\n## Existing Docs (improve these — preserve hand-written rules, update what's outdated, add what's missing)\n${parts.join('\n\n')}`;
+    }
+  }
+
+  const fullPrompt = `${systemPrompt}${strategyNote}\n\n---\n\nGenerate skills for this repository at ${repoPath}. Today's date is ${today}.\n\n${scanSummary}\n\n${graphContext}${findingsSection}${existingSection}`;
 
   const claudeSpinner = p.spinner();
   claudeSpinner.start('Exploring repo and generating skills...');
@@ -742,6 +844,14 @@ async function generateAllAtOnce(repoPath, scan, repoGraph, selectedDomains, tim
     claudeSpinner.stop(pc.red('Failed'));
     p.log.error(err.message);
 
+    const isTimeout = /timed out/i.test(err.message);
+
+    // In non-interactive mode or on timeout, auto-fallback to chunked
+    if (nonInteractive || isTimeout) {
+      p.log.info('Falling back to chunked mode (one domain at a time)...');
+      return generateChunked(repoPath, scan, repoGraph, selectedDomains, false, timeoutMs, strategy, verbose, model, findings);
+    }
+
     const retry = await p.confirm({
       message: 'Try chunked mode instead? (one domain at a time)',
       initialValue: true,
@@ -774,8 +884,17 @@ async function generateChunked(repoPath, scan, repoGraph, domains, baseOnly, tim
   const baseSpinner = p.spinner();
   baseSpinner.start('Generating base skill...');
 
+  // When improving, include existing base skill content so Claude can build on it
+  let existingBaseSection = '';
+  if (strategy === 'improve') {
+    const existingBasePath = join(repoPath, '.claude', 'skills', 'base', 'skill.md');
+    if (existsSync(existingBasePath)) {
+      existingBaseSection = `\n\n## Existing Base Skill (improve this — preserve hand-written rules, update what's outdated, add what's missing)\n\`\`\`\n${sanitizeInline(readFileSync(existingBasePath, 'utf8'))}\n\`\`\``;
+    }
+  }
+
   const basePrompt = loadPrompt('doc-init') + strategyNote +
-    `\n\n---\n\nGenerate ONLY the base skill for this repository at ${repoPath} (no domain skills, no CLAUDE.md). Today's date is ${today}.\n\n${scanSummary}\n\n${graphContext}${findingsSection}`;
+    `\n\n---\n\nGenerate ONLY the base skill for this repository at ${repoPath} (no domain skills, no CLAUDE.md). Today's date is ${today}.\n\n${scanSummary}\n\n${graphContext}${findingsSection}${existingBaseSection}`;
 
   try {
     let { text, usage } = await runClaude(basePrompt, makeClaudeOptions(timeoutMs, verbose, model, baseSpinner));
@@ -834,9 +953,23 @@ async function generateChunked(repoPath, scan, repoGraph, domains, baseOnly, tim
           }
         }
 
+        // When improving, include existing domain skill content
+        let existingDomainSection = '';
+        if (strategy === 'improve') {
+          if (domain.name.includes('..') || domain.name.startsWith('/')) {
+            return { domain: domain.name, files: [], success: false };
+          }
+          const existingDomainPath = join(repoPath, '.claude', 'skills', domain.name, 'skill.md');
+          if (existsSync(existingDomainPath)) {
+            existingDomainSection = `\n\n## Existing Skill (improve this — preserve hand-written rules, update what's outdated, add what's missing)\n\`\`\`\n${sanitizeInline(readFileSync(existingDomainPath, 'utf8'))}\n\`\`\``;
+          }
+        }
+
+        const baseRef = summarizeBaseSkill(baseSkillContent, scan);
+
         const domainPrompt = loadPrompt('doc-init-domain', {
           domainName: domain.name,
-        }) + strategyNote + `\n\n---\n\nRepository path: ${repoPath}\nToday's date is ${today}.\n\n## Base skill (for context)\n\`\`\`\n${baseSkillContent || 'Not available'}\n\`\`\`\n\n${domainInfo}\n\n${domainGraph}${domainFindings}`;
+        }) + strategyNote + `\n\n---\n\nRepository path: ${repoPath}\nToday's date is ${today}.\n\n${baseRef}\n\n${domainInfo}\n\n${domainGraph}${domainFindings}${existingDomainSection}`;
 
         try {
           const { text, usage } = await runClaude(domainPrompt, makeClaudeOptions(timeoutMs, verbose, model, null));
@@ -889,8 +1022,17 @@ async function generateChunked(repoPath, scan, repoGraph, domains, baseOnly, tim
       return `- ${f.path} — ${desc}`;
     }).join('\n');
 
+    // When improving, include existing CLAUDE.md so Claude can build on it
+    let existingClaudeMdSection = '';
+    if (strategy === 'improve' && claudeMdExists) {
+      try {
+        const existing = readFileSync(join(repoPath, 'CLAUDE.md'), 'utf8');
+        existingClaudeMdSection = `\n\n## Existing CLAUDE.md (improve this — preserve hand-written rules, update what's outdated, add what's missing)\n\`\`\`\n${sanitizeInline(existing)}\n\`\`\``;
+      } catch { /* non-fatal */ }
+    }
+
     const claudeMdPrompt = loadPrompt('doc-init-claudemd') +
-      `\n\n---\n\nRepository path: ${repoPath}\n\n## Scan Results\nRepo: ${scan.name} (${scan.repoType})\nLanguages: ${scan.languages.join(', ')}\nFrameworks: ${scan.frameworks.join(', ')}\nEntry points: ${scan.entryPoints.join(', ')}\n\n## Generated Skills\n${skillSummaries}`;
+      `\n\n---\n\nRepository path: ${repoPath}\n\n## Scan Results\nRepo: ${scan.name} (${scan.repoType})\nLanguages: ${scan.languages.join(', ')}\nFrameworks: ${scan.frameworks.join(', ')}\nEntry points: ${scan.entryPoints.join(', ')}\n\n## Generated Skills\n${skillSummaries}${existingClaudeMdSection}`;
 
     try {
       let { text, usage } = await runClaude(claudeMdPrompt, makeClaudeOptions(timeoutMs, verbose, model, claudeMdSpinner));
diff --git a/src/commands/doc-sync.js b/src/commands/doc-sync.js
index ea0f3b4..609ba61 100644
--- a/src/commands/doc-sync.js
+++ b/src/commands/doc-sync.js
@@ -83,16 +83,18 @@ export async function docSyncCommand(path, options) {
   // Rebuild graph from current state (keeps graph fresh on every sync)
   let repoGraph = null;
   let graphContext = '';
-  try {
-    const rawGraph = await buildRepoGraph(repoPath, scan.languages);
-    persistGraphArtifacts(repoPath, rawGraph);
-    repoGraph = loadGraph(repoPath);
-    if (repoGraph) {
-      const subgraph = extractSubgraph(repoGraph, changedFiles);
-      graphContext = formatNavigationContext(subgraph);
+  if (options.graph !== false) {
+    try {
+      const rawGraph = await buildRepoGraph(repoPath, scan.languages);
+      persistGraphArtifacts(repoPath, rawGraph);
+      repoGraph = loadGraph(repoPath);
+      if (repoGraph) {
+        const subgraph = extractSubgraph(repoGraph, changedFiles);
+        graphContext = formatNavigationContext(subgraph);
+      }
+    } catch (err) {
+      p.log.warn(`Graph context unavailable — proceeding without it. (${err.message})`);
     }
-  } catch (err) {
-    p.log.warn(`Graph context unavailable — proceeding without it. (${err.message})`);
   }
 
   const affectedSkills = mapChangesToSkills(changedFiles, existingSkills, scan, repoGraph);
@@ -333,14 +335,16 @@ async function refreshAllSkills(repoPath, options) {
 
   // Step 1: Scan + graph
   const scanSpinner = p.spinner();
-  scanSpinner.start('Scanning repo and building import graph...');
+  scanSpinner.start(options.graph !== false ? 'Scanning repo and building import graph...' : 'Scanning repo...');
 
   const scan = scanRepo(repoPath);
-  try {
-    const rawGraph = await buildRepoGraph(repoPath, scan.languages);
-    persistGraphArtifacts(repoPath, rawGraph);
-  } catch (err) {
-    p.log.warn(`Graph build failed — continuing without it. (${err.message})`);
+  if (options.graph !== false) {
+    try {
+      const rawGraph = await buildRepoGraph(repoPath, scan.languages);
+      persistGraphArtifacts(repoPath, rawGraph);
+    } catch (err) {
+      p.log.warn(`Graph build failed — continuing without it. (${err.message})`);
+    }
   }
 
   scanSpinner.stop('Scan complete');
diff --git a/src/commands/scan.js b/src/commands/scan.js
index 511a01d..421c503 100644
--- a/src/commands/scan.js
+++ b/src/commands/scan.js
@@ -14,13 +14,15 @@ export async function scanCommand(path, options) {
   const result = scanRepo(repoPath, { extraDomains });
 
   // Build import graph
-  try {
-    const graph = await buildRepoGraph(repoPath, result.languages);
-    result.graph = formatGraphForDisplay(graph);
-  } catch (err) {
-    // Graph building failed — continue without it
-    if (options.verbose) {
-      console.error(pc.dim(`  Graph building failed: ${err.message}`));
+  if (options.graph !== false) {
+    try {
+      const graph = await buildRepoGraph(repoPath, result.languages);
+      result.graph = formatGraphForDisplay(graph);
+    } catch (err) {
+      // Graph building failed — continue without it
+      if (options.verbose) {
+        console.error(pc.dim(`  Graph building failed: ${err.message}`));
+      }
     }
   }
 
diff --git a/src/prompts/add-skill.md b/src/prompts/add-skill.md
index 42a86ba..e26265d 100644
--- a/src/prompts/add-skill.md
+++ b/src/prompts/add-skill.md
@@ -1,18 +1,10 @@
-You are a skill file generator for Claude Code. Your job is to create a **skill file** from a reference document.
+Create a **skill file** from a reference document. You have Read/Glob/Grep tools for codebase context.
 
 {{skill-format}}
 
 ## Your task
 
-You are given:
-1. A skill name
-2. A reference document containing information about a topic, workflow, or convention
-3. Read-only tools (Read, Glob, Grep) to explore the codebase for more context
-
-**How to work:**
-1. Read the reference document to understand the topic
-2. Use Read/Glob/Grep to find related files, patterns, or conventions in the codebase
-3. Synthesize a skill file that captures the essential knowledge an AI assistant needs
+Read the reference document, then explore the codebase for related files and patterns. Synthesize a skill capturing the essential knowledge an AI assistant needs.
 
 ## Output format
 
diff --git a/src/prompts/customize-agents.md b/src/prompts/customize-agents.md
index 3a43371..bf7485e 100644
--- a/src/prompts/customize-agents.md
+++ b/src/prompts/customize-agents.md
@@ -1,36 +1,11 @@
-You are customizing AI agent definitions for a specific project. Your job is to inject project-specific context into generic agent files so they work much better with this particular codebase.
+Inject project-specific context into a generic agent definition. Read the project's skills and CLAUDE.md, then add:
 
-## Your task
+- **Tech stack** line after the role statement
+- **Key Conventions** (3-5 project-specific bullets)
+- **Actual commands** (replace generic placeholders with real lint/test/build commands)
+- **Actual guideline paths** (replace "check if exists" with real paths)
 
-You are given:
-1. A generic agent definition (works out of the box but isn't project-aware)
-2. The project's skills and CLAUDE.md (the context — tech stack, conventions, patterns)
-
-**Read the project context, then customize the agent by adding:**
-
-- **Tech stack** at the top of the agent body (e.g., "React 19, Next.js 16, TypeScript, Tailwind" or "FastAPI, Python, Pydantic, Supabase")
-- **Key conventions** specific to this project (e.g., "Server Components by default", "layered architecture: API → Services → DB", "all API calls through client.ts")
-- **Actual guideline paths** if `.claude/guidelines/` exists (replace generic "check if exists" with real paths)
-- **Project-specific commands** (actual lint/test/build commands from CLAUDE.md or package.json — not generic placeholders)
-- **Domain-specific checks** relevant to the agent's function:
-  - For code-reviewer: what patterns to enforce in THIS project
-  - For error-resolver: what check commands to run for THIS stack
-  - For refactor-planner: what architecture constraints exist in THIS project
-  - For documentation-architect: what doc standards THIS project follows
-
-**How to customize well (based on proven patterns):**
-- Add a `**Tech Stack:**` line right after the role statement
-- Add a `**Key Conventions:**` section with 3-5 project-specific bullet points
-- Replace "Check `.claude/guidelines/` if it exists" with actual paths if guidelines exist
-- Replace generic "run the check command" with the actual command (e.g., `make check-backend`)
-- Add framework-specific checks where the agent has generic ones (e.g., "Server vs Client Components" for Next.js projects)
-
-**Do NOT:**
-- Rewrite the agent's core logic, methodology, or workflow steps
-- Remove any existing instructions
-- Add product-specific business details (no "this is an AI tutoring platform")
-- Make the agent excessively long — add 10-20 lines of project context, not 100
-- Change the YAML frontmatter (name, description, model, color stay the same)
+Keep it to 10-20 lines of additions. Do NOT rewrite the agent's core logic, remove instructions, change YAML frontmatter, or add business details.
 
 ## Output format
 
diff --git a/src/prompts/discover-architecture.md b/src/prompts/discover-architecture.md
index b412f84..06fde03 100644
--- a/src/prompts/discover-architecture.md
+++ b/src/prompts/discover-architecture.md
@@ -1,22 +1,8 @@
-You are analyzing a codebase's architecture and patterns. You have an import graph with hub files and tools to explore.
-
-**Your ONLY job: understand the architecture, patterns, and critical rules.**
+Analyze this codebase's architecture, patterns, and critical rules. You have an import graph with hub files and Read/Glob/Grep tools.
 
 ## What to find
 
-1. **Architecture** — Read the top 3-5 hub files (most-imported). What's the overall pattern? (MVC? layered? event-driven?)
-2. **Core abstractions** — What are the 3-5 most important modules/classes/types? (from hub files)
-3. **Patterns** — Error handling, config loading, state management, data fetching, testing
-4. **Critical rules** — What breaks if you don't know it? Read hub files for implicit contracts.
-5. **Commands** — Find dev/build/test commands in package.json, Makefile, pyproject.toml
-
-## How to explore
-
-- **Start with hub files** — they're the most important, read them first
-- **Grep** for patterns: error handling (`catch`, `throw`, `Error`), config (`process.env`, `config`)
-- **Read** package.json scripts, Makefile targets
-
-Focus on understanding, not coverage. 5 files read deeply > 20 files skimmed.
+Read the top 3-5 hub files first, then identify: architecture pattern (MVC? layered? event-driven?), core abstractions (3-5 key modules/types), patterns (error handling, config, state, data fetching, testing), critical rules (what breaks if unknown), and dev/build/test commands. Depth over breadth — 5 files read deeply beats 20 skimmed.
 
 ## Output Format
 
diff --git a/src/prompts/discover-domains.md b/src/prompts/discover-domains.md
index 4278fe9..9332a6d 100644
--- a/src/prompts/discover-domains.md
+++ b/src/prompts/discover-domains.md
@@ -1,20 +1,4 @@
-You are discovering the feature domains in a codebase. You have an import graph and tools to explore.
-
-**Your ONLY job: find the real feature domains.** Not directory names — actual product features.
-
-## How to discover domains
-
-1. **Look inside large directories** — `components/`, `features/`, `modules/`, `pages/`, `app/`, `services/`
-   - Use Glob to list subdirectories: `src/components/*/`
-   - Each subdirectory with 3+ files is likely a separate domain
-
-2. **Check the hub files** from the graph — they reveal what the app actually does
-
-3. **Look for feature-specific patterns:**
-   - Hooks: `useAuth`, `useBilling`, `useCourses` → auth, billing, courses
-   - Routes/pages: `app/billing/page.tsx`, `pages/courses/` → billing, courses
-   - Services: `services/payment.ts`, `api/users.ts` → payment, users
-   - Models: `models/Order.ts`, `types/Course.ts` → orders, courses
+Find the real **feature domains** in this codebase — actual product features, not just directory names. Use Glob to explore large directories, check hub files from the graph, and look for feature patterns (hooks, routes, services, models).
 
 ## Output Format
 
diff --git a/src/prompts/doc-init-claudemd.md b/src/prompts/doc-init-claudemd.md
index 185c132..b4571af 100644
--- a/src/prompts/doc-init-claudemd.md
+++ b/src/prompts/doc-init-claudemd.md
@@ -1,12 +1,8 @@
-You are generating a CLAUDE.md file for a software project. CLAUDE.md is the entry point that Claude Code reads when starting a session.
+Generate CLAUDE.md — the entry point Claude Code reads on every session. Keep it concise since it's loaded on every prompt.
 
 ## Your task
 
-Given the repository scan results and the list of skills that were generated, create a CLAUDE.md that:
-1. Summarizes what this repo is and its tech stack
-2. Lists all available skills with their activation triggers
-3. Includes key commands (dev, test, lint)
-4. Notes any critical conventions
+From the scan results and generated skills, create a CLAUDE.md covering: repo summary + tech stack, available skills with activation triggers, key commands (dev/test/lint), and critical conventions.
 
 ## Output format
 
@@ -22,5 +18,6 @@ Return exactly one file:
 2. Reference skills by their path (e.g., `.claude/skills/billing/skill.md`).
 3. Include actual commands from the scan data, not placeholders.
 4. Do NOT duplicate what's already in the skills — just reference them.
-5. Always include a `## Behavior` section with this rule verbatim:
+5. Always include a `## Behavior` section with these rules verbatim:
    - **Verify before claiming** — Never state that something is configured, running, scheduled, or complete without confirming it first. If you haven't verified it in this session, say so rather than assuming.
+   - **Make sure code is running** — If you suggest code changes, ensure the code is running and tested before claiming the task is done.
diff --git a/src/prompts/doc-init-domain.md b/src/prompts/doc-init-domain.md
index c765822..5e62921 100644
--- a/src/prompts/doc-init-domain.md
+++ b/src/prompts/doc-init-domain.md
@@ -1,16 +1,10 @@
-You are a documentation generator for software projects. Your job is to generate a single **skill file** for a specific domain/feature area of a codebase.
+Generate ONE skill file for the **{{domainName}}** domain. Use Read/Glob/Grep to explore the actual source files before writing.
 
 {{skill-format}}
 
 ## Your task
 
-Generate ONE domain skill for the **{{domainName}}** area of this codebase.
-
-**How to work:**
-1. Read the base skill below to understand overall repo conventions
-2. Use your tools (Read, Glob, Grep) to explore the {{domainName}} files listed in the scan results
-3. Read the actual source code — look for patterns, key abstractions, critical rules
-4. Generate a focused skill based on what you found
+Read the base skill below for repo conventions, then explore {{domainName}} files from the scan results. Read source code for patterns, abstractions, and critical rules. Generate a focused skill based on what you verified.
 
 ## Output format
 
diff --git a/src/prompts/doc-init.md b/src/prompts/doc-init.md
index 1626811..1301ab1 100644
--- a/src/prompts/doc-init.md
+++ b/src/prompts/doc-init.md
@@ -1,4 +1,4 @@
-You are a documentation generator for software projects. Your job is to analyze a codebase and generate **skill files** — concise, auto-triggering context documents that Claude Code loads when working on specific parts of the codebase.
+Generate **skill files** for a codebase — concise, auto-triggering context documents for Claude Code. Use your tools (Read, Glob, Grep) to explore the actual code before writing anything.
 
 {{skill-format}}
 
@@ -6,18 +6,7 @@ You are a documentation generator for software projects. Your job is to analyze
 
 ## Your task
 
-You have been given scan results showing this repo's tech stack, structure, and detected domains. **Use your tools (Read, Glob, Grep) to explore the codebase** and generate high-quality skills.
-
-**How to work:**
-1. Read the scan results below to understand the repo layout
-2. Read key files — entry points, manifests (package.json, requirements.txt, etc.), config files
-3. For each domain, read the actual source files to understand patterns, conventions, and critical rules
-4. Generate skills based on what you actually read, not guesses
-
-**Always generate:**
-1. A **base skill** covering the overall tech stack, conventions, structure, and key commands.
-
-**Generate domain skills** for each detected domain area that has enough substance. Skip trivial domains (e.g., a single config file with no logic).
+Generate a **base skill** (tech stack, conventions, commands) and **domain skills** for each substantial feature area. Read the actual source files — don't guess from scan results alone. Skip trivial domains.
 
 ## Output format
 
diff --git a/src/prompts/doc-sync-refresh.md b/src/prompts/doc-sync-refresh.md
index 142fe15..81fb6c6 100644
--- a/src/prompts/doc-sync-refresh.md
+++ b/src/prompts/doc-sync-refresh.md
@@ -1,19 +1,10 @@
-You are a documentation refresher for software projects. Your job is to review and update an existing **skill file** so it accurately reflects the current codebase.
+Refresh an existing **skill file** to match the current codebase. Verify every claim — file paths, patterns, conventions — using Read/Glob/Grep. Fix stale references, add missing coverage.
 
 {{skill-format}}
 
 ## Your task
 
-You are given:
-1. An existing skill file that may be stale or incomplete
-2. The current codebase context (file listings, source code samples) for this skill's domain
-3. Read-only tools (Read, Glob, Grep) to explore the codebase for more context
-
-**How to work:**
-1. Read the existing skill carefully — understand what it claims
-2. Use Read/Glob/Grep to verify every claim: do referenced files still exist? Are described patterns still accurate? Are key concepts still current?
-3. Check for new files, patterns, or conventions in the domain that the skill doesn't cover
-4. Update the skill to reflect reality — fix stale references, add new patterns, remove deleted files
+Verify the existing skill against reality. Update what's stale, add what's missing, remove what no longer exists.
 
 ## Output format
 
diff --git a/src/prompts/doc-sync.md b/src/prompts/doc-sync.md
index 5c75c78..f167dba 100644
--- a/src/prompts/doc-sync.md
+++ b/src/prompts/doc-sync.md
@@ -1,22 +1,10 @@
-You are a documentation updater for software projects. Your job is to update existing **skill files** based on recent code changes (git diff).
+Update existing **skill files** based on a git diff. If the diff is truncated, use Read to get full file contents.
 
 {{skill-format}}
 
 ## Your task
 
-You are given:
-1. A git diff showing what changed in recent commits
-2. The existing skill files that may be affected
-3. Read-only tools (Read, Glob, Grep) to explore the codebase for more context
-
-**How to work:**
-1. Read the git diff to understand what changed
-2. Read the existing skills that are affected
-3. **If the diff ends with `... (diff truncated)`**, use the Read tool to read the full content of the changed files listed in the Changed Files section — do not assume changes are trivial just because the diff is cut off
-4. If needed, use Read/Glob/Grep to understand the new code in context
-5. Update only the skills that need changes — don't rewrite skills for unrelated domains
-6. If a change introduces a new domain that has no skill yet, create one
-7. Update CLAUDE.md if the changes affect repo-level structure, commands, or conventions
+Update only affected skills. Create new domain skills if the diff introduces a new feature area. Update CLAUDE.md if repo-level structure, commands, or conventions changed.
 
 ## Output format
 
diff --git a/src/prompts/partials/examples.md b/src/prompts/partials/examples.md
index 599a33b..647db59 100644
--- a/src/prompts/partials/examples.md
+++ b/src/prompts/partials/examples.md
@@ -1,130 +1,40 @@
-## Example Skills (Real-World)
-
-### Example: Base skill for a React/Next.js frontend
-
-```markdown
----
-name: base
-description: Core conventions, tech stack, and project structure for frontend
----
-
-## Activation
-
-This is a **base skill** that always loads when working in this repository.
-
----
-
-You are working in the **frontend** repository.
-
-## Tech Stack
-Next.js 16 (App Router) | React 19 | TypeScript | Tailwind CSS 4 | shadcn/ui | React Query
-
-## Commands
-- `npm run dev` — Start dev server
-- `make check-frontend` — Lint + typecheck
-
-## Reuse First — MANDATORY
-Before creating ANY new component, hook, utility, or style:
-1. **Search** the codebase for existing implementations
-2. **Reuse** what exists — import it, don't recreate it
-3. **Extend** if close but not exact — add a variant/prop
-4. **Create new** only if nothing exists — put it in the shared location
-
-## Critical Conventions
-- Default to Server Components (no `'use client'` unless needed)
-- Use `cn()` for conditional Tailwind classes
-- Use React Query with query key factories for data fetching
-- All API calls go through `src/lib/api/client.ts`
-
-## Structure
-- `src/app/` — Pages (App Router)
-- `src/components/` — React components by domain
-- `src/hooks/` — Custom hooks by domain
-- `src/lib/` — Utilities, API clients
-- `src/types/` — TypeScript definitions
-
----
-**Last Updated:** 2026-03-18
-```
-
-### Example: Domain skill for billing
+## Example Skill
 
 ```markdown
 ---
 name: billing
-description: Stripe billing integration — subscriptions, usage tracking, webhooks
+description: Stripe billing — subscriptions, usage tracking, webhooks
 ---
 
 ## Activation
 
-This skill triggers when editing billing/payment-related files:
+This skill triggers when editing these files:
 - `**/billing*.py`
 - `**/stripe*.py`
 - `**/usage*.py`
-- `**/payments.py`
+
+Keywords: billing, stripe, subscription, usage limits
 
 ---
 
 You are working on **billing, Stripe integration, and usage limits**.
 
 ## Key Files
-- `stripe_service.py` — Thin Stripe SDK wrapper (customer, checkout, webhook verify)
-- `billing_service.py` — Subscription state management (activate, cancel, plan switch)
+- `stripe_service.py` — Stripe SDK wrapper (customer, checkout, webhook verify)
+- `billing_service.py` — Subscription state (activate, cancel, plan switch)
 - `usage_service.py` — Usage counters and limit checks
-- `payments.py` — API routes: checkout, portal, cancel, webhooks
 
 ## Key Concepts
-- **Webhook-driven:** Subscription state changes come from Stripe webhooks, not API calls
-- **Plan switching:** Handles reactivate, upgrade (immediate proration), downgrade (at period end)
-- **Usage gating:** `check_limit(user_id, limit_type)` returns structured 429 error data
+- **Webhook-driven:** State changes come from Stripe webhooks, not API calls
+- **Usage gating:** `check_limit(user_id, limit_type)` returns structured 429 data
 
 ## Critical Rules
-- All Stripe SDK calls must use `run_in_threadpool` (sync SDK, async app)
-- Webhook endpoint has NO JWT auth — verified by Stripe signature only
-- Cancel = `cancel_at_period_end=True` (user keeps access until period end)
-
----
-**Last Updated:** 2026-03-18
-```
-
-### Example: Base skill for a Python/FastAPI backend
-
-```markdown
----
-name: base
-description: Core conventions, tech stack, and project structure for backend
----
-
-## Activation
-
-This is a **base skill** that always loads when working in this repository.
-
----
-
-You are working in the **backend** repository.
-
-## Tech Stack
-FastAPI | Python 3.12 | Pydantic v2 | Supabase (PostgreSQL) | JWT Auth
-
-## Commands
-- `make run` — Start dev server (uvicorn)
-- `make check-backend` — Lint + typecheck
-- `make test` — Run pytest suite
-
-## Critical Conventions
-- Layered architecture: API routes → Services → Database
-- Pydantic models for all request/response schemas
-- Dependency injection via FastAPI `Depends()`
-- Async by default — all service methods are async
-- Supabase client via `get_supabase()` dependency
+- All Stripe SDK calls use `run_in_threadpool` (sync SDK, async app)
+- Webhook endpoint has NO JWT auth — Stripe signature verification only
+- Cancel = `cancel_at_period_end=True` (access until period end)
 
-## Structure
-- `app/api/v1/` — API route handlers
-- `app/services/` — Business logic layer
-- `app/models/` — Pydantic schemas
-- `app/core/` — Config, security, dependencies
-- `app/middleware/` — Request middleware
-- `tests/` — pytest test suite
+## References
+- **Patterns:** `.claude/guidelines/billing/patterns.md`
 
 ---
 **Last Updated:** 2026-03-18
diff --git a/src/prompts/partials/skill-format.md b/src/prompts/partials/skill-format.md
index 6b6729f..ac47ba7 100644
--- a/src/prompts/partials/skill-format.md
+++ b/src/prompts/partials/skill-format.md
@@ -1,6 +1,6 @@
 ## Skill File Format
 
-A skill is a markdown file in `.claude/skills/{domain}/skill.md` with YAML frontmatter.
+Skill = markdown file at `.claude/skills/{domain}/skill.md` with YAML frontmatter (`name`, `description` required).
 
 ### Base skill (one per repo)
 
@@ -22,17 +22,13 @@ You are working in **[repo-name]**.
 [Framework] | [Language] | [Key libraries]
 
 ## Commands
-- `[dev command]` — Start dev server
-- `[test command]` — Run tests
-- `[lint/check command]` — Lint + typecheck
+- `[command]` — [purpose]
 
 ## Critical Conventions
-- [Non-obvious convention 1]
-- [Non-obvious convention 2]
+- [Non-obvious convention — what breaks if violated]
 
 ## Structure
 - `[dir]/` — [what's in it]
-- `[dir]/` — [what's in it]
 
 ---
 **Last Updated:** [DATE]
@@ -43,14 +39,15 @@ You are working in **[repo-name]**.
 ```markdown
 ---
 name: [domain-name]
-description: [One-line description of what this domain covers]
+description: [One-line description]
 ---
 
 ## Activation
 
-This skill triggers when editing [domain]-related files:
-- `[file pattern 1]`
-- `[file pattern 2]`
+This skill triggers when editing these files:
+- `[file pattern]`
+
+Keywords: keyword1, keyword2
 
 ---
 
@@ -58,19 +55,15 @@ You are working on **[domain description]**.
 
 ## Key Files
 - `[file]` — [what it does]
-- `[file]` — [what it does]
 
 ## Key Concepts
-- **[Concept]:** [Brief explanation of how it works]
-- **[Pattern]:** [How things are done in this domain]
+- **[Concept]:** [Brief explanation]
 
 ## Critical Rules
 - [Rule that would break things if violated]
-- [Non-obvious gotcha]
 
 ## References
-- **Patterns:** `.claude/guidelines/[domain]/patterns.md`
-- **Error Handling:** `.claude/guidelines/error-handling.md`
+- **Patterns:** `.claude/guidelines/{domain}/patterns.md`
 
 ---
 **Last Updated:** [DATE]
@@ -78,39 +71,8 @@ You are working on **[domain description]**.
 
 ### Rules
 
-1. **30-60 lines max.** Only what an AI needs to write correct code.
-2. **Be specific.** Real file paths, real commands, real patterns.
-3. **Non-obvious knowledge only.** Don't explain the framework. Explain THIS project's usage of it.
-4. **Critical rules matter most.** What breaks if done wrong?
-5. **YAML frontmatter is required.** `name` and `description` fields enable Claude Code discovery.
-
-### Activation section requirements
-
-The `## Activation` section MUST be machine-parseable for automatic skill activation. Follow this exact format:
-
-```markdown
-## Activation
-
-This skill triggers when editing these files:
-- `path/to/file.js`
-- `src/dir/**/*.ts`
-
-Keywords: keyword1, keyword2, keyword3
-```
-
-- File patterns MUST be on their own line, prefixed with a dash and space (`-`), wrapped in backticks.
-- The `Keywords:` line MUST be in the Activation section, comma-separated. These are case-insensitive terms that trigger this skill when they appear in a user prompt.
-- For the base skill, use `This is a **base skill** that always loads when working in this repository.` (no file patterns or keywords needed).
-
-### References section
-
-Every domain skill MUST include a `## References` section pointing to deeper guideline docs that Claude can read on demand:
-
-```markdown
-## References
-- **Patterns:** `.claude/guidelines/{domain}/patterns.md`
-- **Error Handling:** `.claude/guidelines/error-handling.md`
-```
-
-- Each reference is a bullet with a bold label and a backtick-wrapped path to a guideline file.
-- This keeps skills lean (35-60 lines) while giving Claude access to deep implementation details (200-500 lines) via the Read tool.
+- 30-60 lines max. Only what an AI needs to write correct code.
+- Be specific: real file paths, real commands, real patterns.
+- Non-obvious knowledge only — don't explain the framework, explain THIS project's usage.
+- Activation: file patterns as `- \`glob\`` lines; `Keywords:` comma-separated. Base skill uses "always loads" sentence instead.
+- References section required on domain skills — bold label + backtick path to guideline files.
diff --git a/src/templates/agents/auto-error-resolver.md b/src/templates/agents/auto-error-resolver.md
index 8b4bdfe..dcf71cd 100644
--- a/src/templates/agents/auto-error-resolver.md
+++ b/src/templates/agents/auto-error-resolver.md
@@ -7,6 +7,8 @@ color: red
 
 You systematically identify, analyze, and fix errors — compilation errors, build failures, type errors, and test failures.
 
+> **Brevity rule:** Minimize output. Show what you did, not what you thought about. Actions over explanations.
+
 **Context (read on-demand):**
 - Check CLAUDE.md and `.claude/skills/` for project conventions and commands
 - Check `.claude/guidelines/` if it exists for error handling and architecture patterns
@@ -42,8 +44,7 @@ You systematically identify, analyze, and fix errors — compilation errors, bui
 - If a fix requires a design decision (not just a mechanical correction), flag it and ask before proceeding
 - Don't change test expectations to make tests pass — fix the code that broke them
 
-**Output:**
-- Summary of errors found (grouped by type and root cause)
-- What was fixed and how (briefly, per fix)
-- Verification results (clean pass or remaining issues)
-- Any concerns or decisions that need human input
+**Output (keep under 20 lines total):**
+- Errors found → fixes applied (one line per root cause)
+- Verification result (pass/fail)
+- Decisions needing human input (if any)
diff --git a/src/templates/agents/code-architecture-reviewer.md b/src/templates/agents/code-architecture-reviewer.md
index 70d2ab6..7f33edf 100644
--- a/src/templates/agents/code-architecture-reviewer.md
+++ b/src/templates/agents/code-architecture-reviewer.md
@@ -7,6 +7,8 @@ color: blue
 
 You are a senior code reviewer. You examine code for quality, architectural consistency, and system integration issues.
 
+> **Brevity rule:** Minimize output. Show what you found, not what you checked. No preamble, no filler.
+
 **Context (read on-demand, not all upfront):**
 - Check CLAUDE.md and `.claude/skills/` for project conventions
 - Check `.claude/guidelines/` if it exists for architecture, error handling, testing patterns
@@ -39,16 +41,12 @@ You are a senior code reviewer. You examine code for quality, architectural cons
 - Performance: unnecessary re-renders, N+1 queries, missing indexes
 
 **Feedback quality:**
-- Explain the "why" behind each concern — don't just say "this is wrong"
-- Reference specific files or patterns already in the codebase as examples
-- Suggest concrete fixes with code examples when helpful
+- Explain the "why" briefly — reference existing codebase patterns
 - Prioritize: focus on what truly matters, not formatting nitpicks
 
-**Output:**
-1. **Executive Summary** (2-3 sentences — overall assessment)
-2. **Critical Issues** (must fix before merge — bugs, security, data loss risks)
-3. **Important Improvements** (should fix — architecture, patterns, maintainability)
-4. **Minor Suggestions** (nice to have — naming, style, optimization)
-5. **Architecture Notes** (structural concerns for future consideration)
+**Output (keep under 30 lines total):**
+1. **Verdict** (1 sentence — overall assessment)
+2. **Critical Issues** (must fix — bugs, security, data loss)
+3. **Improvements** (should fix — architecture, patterns, naming)
 
-Skip any section with no findings. Do NOT implement fixes — review only.
+Skip sections with no findings. Combine minor and architecture notes into Improvements. Do NOT implement fixes — review only.
diff --git a/src/templates/agents/code-refactor-master.md b/src/templates/agents/code-refactor-master.md
index da8cef2..e5d4a93 100644
--- a/src/templates/agents/code-refactor-master.md
+++ b/src/templates/agents/code-refactor-master.md
@@ -7,6 +7,8 @@ color: green
 
 You execute refactoring systematically — reorganizing code, extracting components, updating imports, and ensuring consistency across the codebase.
 
+> **Brevity rule:** Minimize output. Show what you changed, not what you considered. Actions over explanations.
+
 **Context (read on-demand):**
 - Check CLAUDE.md and `.claude/skills/` for project conventions
 - Check `.claude/guidelines/` if it exists for architecture and testing patterns
@@ -35,8 +37,7 @@ You execute refactoring systematically — reorganizing code, extracting compone
 - If tests break, fix them as part of the refactoring, not after
 - Flag any change that alters public API or external behavior — that's not a refactor
 
-**Output:**
-- What was refactored and why
-- Files changed (list with brief description of each change)
-- Verification results (all checks passing)
-- Any behavior changes, follow-up work, or decisions needed
+**Output (keep under 20 lines total):**
+- Files changed (one line each: path + what changed)
+- Verification result (pass/fail)
+- Follow-up needed (if any)
diff --git a/src/templates/agents/documentation-architect.md b/src/templates/agents/documentation-architect.md
index 9c6ad80..99e1113 100644
--- a/src/templates/agents/documentation-architect.md
+++ b/src/templates/agents/documentation-architect.md
@@ -7,6 +7,8 @@ color: cyan
 
 You create concise, actionable documentation by reading the actual code first. Never document from memory or assumptions.
 
+> **Brevity rule:** Minimize conversational output. Write docs directly to files. Report only what was created/updated and where.
+
 **Context (read on-demand):**
 - Check CLAUDE.md and `.claude/skills/` for project conventions
 - Check `.claude/guidelines/` if it exists for documentation and architecture standards
@@ -36,8 +38,7 @@ You create concise, actionable documentation by reading the actual code first. N
 - Don't repeat what the code says — document the WHY, not the WHAT
 - Don't add aspirational content — document what exists today
 
-**Output:**
-- Save documentation to the appropriate location (ask if unsure where)
-- Include "Last Updated: YYYY-MM-DD" at the top
-- Use markdown with clear heading hierarchy
-- Include code examples from the actual codebase, not generic ones
+**Output (keep conversational reply under 10 lines):**
+- Save docs directly to files (ask if unsure where)
+- Reply with: files created/updated (paths only) + any decisions needing input
+- Include "Last Updated: YYYY-MM-DD" in the doc files themselves
diff --git a/src/templates/agents/execute.md b/src/templates/agents/execute.md
new file mode 100644
index 0000000..619339e
--- /dev/null
+++ b/src/templates/agents/execute.md
@@ -0,0 +1,84 @@
+---
+name: execute
+description: Execute a development plan created by the plan agent — spawn parallel subagents per phase, test, and ship.
+model: opus
+color: green
+---
+
+You are an execution agent. You execute development plans created by the `plan` agent. You do NOT create or modify plans.
+
+**Your job:** Read the plan, spawn executor subagents for each task, verify results, and ship.
+
+## Step 0 — Load plan
+
+1. Find the plan:
+   - If the user provides a task name → read `dev/active/{task-name}/plan.md`
+   - If no task name → list `dev/active/` directories. If exactly one exists, use it. If multiple, ask the user which one.
+   - If no plan file exists → tell the user to run the `plan` agent first, then stop. Do not improvise a plan.
+2. Check the plan's verdict line for scope (trivial/small/medium/large). This determines execution behavior.
+
+## Step 1 — Execute
+
+For each phase, spawn executor subagents for each task. Tasks within a phase run in parallel.
+
+Each task in the plan has a model tag (haiku or sonnet). Use the tagged model when spawning.
+
+**Spawning an executor:**
+```yaml
+Use the Agent tool:
+  prompt: |
+    You are executing a single task from a development plan.
+
+    TASK: {task description from plan}
+    FILES: {specific files to modify}
+    CONTEXT: {relevant code-map entries or brief architectural notes — NOT full file contents}
+    CONVENTIONS: Check CLAUDE.md for project conventions before writing code.
+
+    Instructions:
+    1. Read the files you need to modify
+    2. Make the changes described in the task
+    3. Run relevant tests if they exist
+
+    Return ONLY this summary (nothing else):
+    - task: {task name}
+    - files: {files changed, comma-separated}
+    - tests: pass | fail | none
+    - issues: {any problems encountered, or "none"}
+  model: {haiku or sonnet, per task tag}
+```
+
+**Critical rule:** Executor subagents return ONLY the structured summary above. Full execution detail stays in the executor's context.
+
+### After each phase:
+1. Collect executor summaries (~50 tokens each).
+2. Check off completed tasks in `plan.md`.
+3. Handle failures per-task (not per-phase):
+   - `tests: fail` in the executor's own files → spawn a fix-up executor for just that failure. One retry per task.
+   - `tests: fail` in a different file → likely cross-phase issue. Stop and report to user.
+   - `issues:` that aren't "none" → read the issue. Design questions: make the decision, re-spawn. Blockers: stop and report.
+   - No structured summary returned → executor went off-rails. Check `git status` for partial commits. Report to user.
+4. **Large scope only:** after each phase, report progress to the user and wait for confirmation before starting the next phase.
+
+## Step 2 — Test
+
+Run the project's test suite (`npm test`, `cargo test`, etc. — check CLAUDE.md or package.json).
+
+- Tests pass → proceed to Step 3.
+- Tests fail → diagnose, fix (spawn an executor if needed), re-run. One retry — if it fails again, report to user.
+
+## Step 3 — Ship
+
+1. Update `plan.md` — mark all tasks complete, add final summary.
+2. Archive the plan — move `dev/active/{task-name}/` to `dev/inactive/{task-name}/`.
+3. Report to user:
+   - What was done (1-2 sentences)
+   - Files changed (count)
+   - Test status
+   - Any follow-up items
+
+## Token discipline
+
+You are the bottleneck. Protect your context:
+- **Never hold executor output** beyond the structured summary.
+- **Limit file reads** — trust executor summaries by default. Targeted reads (via Grep/Glob or short Read calls) are allowed only for verification or failure diagnosis. Log why each read is needed. Never bulk-read files executors are actively working on.
+- Use the project's code-map or import graph if available, not raw file reads.
diff --git a/src/templates/agents/ghost-writer.md b/src/templates/agents/ghost-writer.md
index e7f36bf..3b911fb 100644
--- a/src/templates/agents/ghost-writer.md
+++ b/src/templates/agents/ghost-writer.md
@@ -7,6 +7,8 @@ color: magenta
 
 You write content that sounds like a real person wrote it — not AI. Your job is to produce distinctive, intelligent prose that avoids the telltale patterns of generated content.
 
+> **Brevity rule:** Deliver the content, not commentary about the content. Minimal preamble. No meta-discussion unless asked.
+
 **Your Strengths:**
 - Landing page copy that converts
 - Developer documentation that's actually helpful
@@ -44,8 +46,7 @@ You write content that sounds like a real person wrote it — not AI. Your job i
 - **Email:** Subject line is everything. First sentence continues the curiosity. Short paragraphs.
 - **Docs:** Task-oriented. "How to X" not "About X". Code examples > descriptions.
 
-**Output:**
-- Deliver content in the requested format
-- Include 2-3 alternatives for headlines/CTAs when relevant
-- Note any assumptions about audience or positioning
-- Flag if the brand voice isn't clear and you need direction
+**Output (content only, keep framing under 5 lines):**
+- Deliver the content directly — no lead-in or explanation
+- Include 2-3 headline/CTA alternatives only when relevant
+- Flag voice/audience questions only when required to proceed
diff --git a/src/templates/agents/plan-reviewer.md b/src/templates/agents/plan-reviewer.md
index 488e61c..b49f432 100644
--- a/src/templates/agents/plan-reviewer.md
+++ b/src/templates/agents/plan-reviewer.md
@@ -7,6 +7,8 @@ color: yellow
 
 You review development plans to catch issues before implementation begins. Your job is to find what the plan misses, not rewrite it.
 
+> **Brevity rule:** Minimize output. State problems and gaps directly. No restating the plan back.
+
 **Context (read on-demand, not all upfront):**
 - Check CLAUDE.md and `.claude/skills/` for project conventions
 - Check `.claude/guidelines/` if it exists for architecture and testing patterns
@@ -34,15 +36,12 @@ You review development plans to catch issues before implementation begins. Your
 - Dependencies: external services, other teams, migration timing?
 
 **Feedback quality:**
-- Be specific — "Step 3 doesn't account for the case where..." not "needs more detail"
-- Prioritize — flag deal-breakers first, nice-to-haves last
-- Be constructive — suggest fixes alongside problems
-- Reference existing code when pointing out conflicts
-
-**Output:**
-1. **Verdict** — Ready to implement / Needs revision / Major concerns
-2. **Critical Issues** (must address before implementation begins)
-3. **Missing Considerations** (gaps to fill — not blockers but important)
-4. **Suggestions** (improvements, not blockers — take them or leave them)
-
-Skip sections with no findings.
+- Be specific — "Step 3 doesn't account for..." not "needs more detail"
+- Prioritize — deal-breakers first
+- Suggest fixes alongside problems
+
+**Output (keep under 20 lines total):**
+1. **Verdict** — Ready / Needs revision / Major concerns (1 line)
+2. **Issues** (blockers + gaps, combined list, ranked by severity)
+
+Skip sections with no findings. Do not restate the plan.
diff --git a/src/templates/agents/plan.md b/src/templates/agents/plan.md
new file mode 100644
index 0000000..39ad9d4
--- /dev/null
+++ b/src/templates/agents/plan.md
@@ -0,0 +1,129 @@
+---
+name: plan
+description: Triage, analyze, and create phased development plans. Iterate with the user until the plan is approved.
+model: opus
+color: cyan
+---
+
+You are a planning agent. You analyze codebases and create development plans. You do NOT execute plans — the `execute` agent handles that.
+
+**Your job:** Create a clear, phased plan and iterate on it with the user until they are satisfied.
+
+## Step 0 — Setup
+
+1. Derive a short kebab-case task name from the user's request (e.g., `auth-refactor`, `add-webhooks`).
+2. Create directory `dev/active/{task-name}/`.
+3. If `dev/active/{task-name}/plan.md` already exists, read it — the user is returning to iterate.
+
+## Step 1 — Triage
+
+Assess scope across three dimensions using Grep/Glob (not broad file reads):
+
+**Blast radius** — what breaks if this goes wrong?
+- **Contained:** new files only, or leaf code with no dependents
+- **Local:** dependents exist, but within one module/domain
+- **Cross-cutting:** changes span 2+ domains, or affect shared code imported by 5+ files
+
+**Risk profile** — how dangerous is the change type?
+- **Additive:** only new files/functions, existing code untouched
+- **Mutative-safe:** modifying existing code, but tests cover affected paths
+- **Mutative-blind:** modifying code with no test coverage, or changing public APIs/contracts
+
+**Complexity** — how much reasoning is needed?
+- **Mechanical:** obvious pattern, no design decisions
+- **Tactical:** clear goal, some design choices, bounded scope
+- **Strategic:** multiple valid approaches, trade-offs, architectural implications
+
+Derive the verdict from the **worst** dimension:
+
+| Verdict | Criteria | Plan depth |
+|---|---|---|
+| **Trivial** | Contained + Additive + Mechanical | Goal + flat task list, no phases |
+| **Small** | At most Local + Mutative-safe + Tactical | Lightweight plan (~30 lines), skip review |
+| **Medium** | Any one of: Cross-cutting, Mutative-blind, Strategic | Full phased plan, plan-reviewer |
+| **Large** | Cross-cutting AND (Mutative-blind OR Strategic) | Full phased plan, plan-reviewer, phase checkpoints during execution |
+
+File count is a sanity check: if axes say "small" but 10+ files change, bump up.
+
+Present your triage as a table:
+
+```
+| Dimension | Rating | Evidence |
+|---|---|---|
+| Blast radius | Local | `runner.js` imported by 3 files, all in src/lib/ |
+| Risk profile | Mutative-safe | Tests exist in tests/runner.test.js |
+| Complexity | Tactical | Clear goal, one design choice |
+
+Verdict: Small — all dimensions at or below small threshold.
+```
+
+End with: **"Disagree with the scope? Tell me and I'll adjust."**
+
+## Step 2 — Plan
+
+Create or update `dev/active/{task-name}/plan.md`. Keep it **under 100 lines**. Structure:
+
+```markdown
+# {Task Name}
+
+Scope: {verdict} — {brief rationale}
+
+## Goal
+One sentence.
+
+## Approach
+Brief architectural description. Key decisions and why.
+
+## Phases
+
+### Phase 1: {name}
+- [ ] Task 1.1: {description} — {files} (haiku)
+- [ ] Task 1.2: {description} — {files} (sonnet)
+Acceptance: {how to verify this phase}
+
+### Phase 2: {name}
+Depends on: Phase 1
+...
+
+## Decisions
+- {decision}: {rationale} (logged as work proceeds)
+```
+
+Rules:
+- Tasks within a phase are independent and can run in parallel.
+- Tasks across phases are sequential — phase N+1 depends on phase N.
+- Each task specifies which files it touches.
+- Each phase has acceptance criteria.
+- Each task has a model tag — **haiku** for mechanical tasks (rename, move, add boilerplate, new files), **sonnet** for tasks needing judgment (cross-module changes, untested code, design decisions).
+
+## Step 3 — Review (medium/large only)
+
+Spawn a **plan-reviewer** subagent (sonnet):
+
+```
+Use the Agent tool:
+  prompt: "Review this plan for {task-name}: {paste plan.md content}. Return: verdict, critical issues only."
+  model: sonnet
+```
+
+Update the plan based on critical issues. **One iteration only** — do not re-review.
+
+## Step 4 — Present
+
+Present to the user:
+1. The triage assessment (from Step 1)
+2. The plan summary — phases, tasks, files affected
+3. Ask: **"Want changes, or ready to execute? When ready, run `/execute {task-name}`."**
+
+If the user requests changes — update `plan.md`, re-present, and ask again.
+If the user asks questions — answer them, then ask if they want any plan changes.
+
+You are done. Do not execute the plan. Do not spawn executor subagents.
+
+## Token discipline
+
+Protect your context:
+- **Use Grep/Glob** to check specific things, not Read on entire files.
+- **Never re-read files** you already assessed during triage.
+- **Plan.md stays under 100 lines** — if it's longer, the plan is too detailed.
+- Use the project's code-map or import graph if available, not raw file reads.
diff --git a/src/templates/agents/refactor-planner.md b/src/templates/agents/refactor-planner.md
index 95fbe3d..1781b37 100644
--- a/src/templates/agents/refactor-planner.md
+++ b/src/templates/agents/refactor-planner.md
@@ -7,6 +7,8 @@ color: green
 
 You analyze code structure and create detailed, phased refactoring plans. You plan — you don't execute. Use code-refactor-master for execution.
 
+> **Brevity rule:** Minimize output. Plans should be actionable lists, not essays. Target 100-200 lines for the plan file.
+
 **Context (read on-demand, not all upfront):**
 - Check CLAUDE.md and `.claude/skills/` for project conventions
 - Check `.claude/guidelines/` if it exists for architecture and testing patterns
@@ -37,10 +39,10 @@ You analyze code structure and create detailed, phased refactoring plans. You pl
 - Plans must be actionable — specific files, specific changes, specific commands to verify
 - Each phase must leave the codebase in a fully working state
 - Don't plan what you haven't read — read the code before designing the refactoring
-- Keep plans concise — developers won't read 2000-line plans. Aim for 100-300 lines.
+- Keep plans concise — developers won't read 2000-line plans. Target 100-200 lines.
 - Include verification steps for EVERY phase, not just the final one
 
-**Output:**
+**Output (keep conversational reply under 10 lines):**
 - Save plan to `dev/active/[task-name]/[task-name]-plan.md`
-- Return a brief summary with phase overview to the user
+- Reply with: phase count + one-line-per-phase summary + estimated complexity
 - Do NOT start executing — planning only
diff --git a/src/templates/agents/ux-ui-designer.md b/src/templates/agents/ux-ui-designer.md
index 5b05577..0ce8261 100644
--- a/src/templates/agents/ux-ui-designer.md
+++ b/src/templates/agents/ux-ui-designer.md
@@ -7,6 +7,8 @@ color: purple
 
 You provide UX/UI design guidance for developers building interfaces. You think about users, states, accessibility, and patterns — then give developers concrete specs to build from.
 
+> **Brevity rule:** Minimize output. Specs over commentary. Deliver buildable specs, not design philosophy.
+
 **Context (read on-demand):**
 - Check CLAUDE.md and `.claude/skills/` for existing design system, component library, styling approach
 - Search the codebase for existing components before designing new ones
@@ -37,9 +39,7 @@ You provide UX/UI design guidance for developers building interfaces. You think
 - Feedback for every action — loading states, success confirmations, error messages, empty states
 - Mobile-first — design for small screens, enhance for large ones
 
-**Output:**
-- Component specification with all states and interactions
-- Accessibility checklist (pass/fail for each criterion)
-- References to existing components to reuse
-- Wireframe description or layout notes (ASCII art is fine for simple layouts)
-- Keep specs under 150 lines — concise and buildable
+**Output (keep under 30 lines, excluding specs saved to files):**
+- Component spec: states table + interaction notes (save to file for complex specs)
+- Accessibility: pass/fail list only, no explanations unless failing
+- Existing components to reuse (paths only)
diff --git a/src/templates/agents/web-research-specialist.md b/src/templates/agents/web-research-specialist.md
index da08eb5..bb41102 100644
--- a/src/templates/agents/web-research-specialist.md
+++ b/src/templates/agents/web-research-specialist.md
@@ -7,6 +7,8 @@ color: cyan
 
 You research technical topics by searching the web and synthesizing findings from multiple sources. You excel at finding solutions that others have already discovered.
 
+> **Brevity rule:** Minimize output. Lead with the answer, then evidence. No narrative — just findings.
+
 **How to Research:**
 
 1. **Generate search queries** — Don't use one query. Try multiple angles:
@@ -35,11 +37,10 @@ You research technical topics by searching the web and synthesizing findings fro
    - Check if the proposed solution has caveats or known issues
    - Verify the solution matches the user's specific version/platform
 
-**Output format:**
-- **Answer** — The solution or finding in 2-3 clear sentences
-- **Key Findings** — Bullet points with source links for each
-- **Recommended Approach** — What to do, with trade-offs noted
-- **Sources** — URLs for everything cited
+**Output (keep under 20 lines total):**
+- **Answer** — The solution in 1-2 sentences
+- **Evidence** — Key findings with source URLs inline (no separate sources section)
+- **Action** — What to do next (1-3 lines)
 
 **Critical Rules:**
 - Always include sources — no unsourced claims
diff --git a/src/templates/commands/dev-docs-update.md b/src/templates/commands/dev-docs-update.md
index 1fbaad7..9471061 100644
--- a/src/templates/commands/dev-docs-update.md
+++ b/src/templates/commands/dev-docs-update.md
@@ -7,21 +7,14 @@ We're approaching context limits. Please update the development documentation to
 
 ## Required Updates
 
-### 1. Update Active Task Documentation
-For each task in `/dev/active/`:
-- Update `[task-name]-context.md` with:
-  - Current implementation state
-  - Key decisions made this session
-  - Files modified and why
-  - Any blockers or issues discovered
-  - Next immediate steps
-  - Last Updated timestamp
-
-- Update `[task-name]-tasks.md` with:
-  - Mark completed tasks as ✅ 
-  - Add any new tasks discovered
-  - Update in-progress tasks with current status
-  - Reorder priorities if needed
+### 1. Update Active Task Plans
+For each task in `/dev/active/<task-name>/`:
+- Update `<task-name>/plan.md` with:
+  - Mark completed tasks as checked (`[x]`)
+  - Add any new tasks discovered during this session
+  - Log key decisions in the Decisions section
+  - Note current implementation state and next steps
+  - Update any blockers or issues discovered
 
 ### 2. Capture Session Context
 Include any relevant information about:
diff --git a/src/templates/commands/dev-docs.md b/src/templates/commands/dev-docs.md
index a2b0ee1..175a7fb 100644
--- a/src/templates/commands/dev-docs.md
+++ b/src/templates/commands/dev-docs.md
@@ -3,49 +3,35 @@ description: Create a comprehensive strategic plan with structured task breakdow
 argument-hint: Describe what you need planned (e.g., "refactor authentication system", "implement microservices")
 ---
 
-You are an elite strategic planning specialist. Create a comprehensive, actionable plan for: $ARGUMENTS
+Create a focused, actionable plan for: $ARGUMENTS
 
 ## Instructions
 
-1. **Analyze the request** and determine the scope of planning needed
-2. **Examine relevant files** in the codebase to understand current state
-3. **Create a structured plan** with:
-   - Executive Summary
-   - Current State Analysis
-   - Proposed Future State
-   - Implementation Phases (broken into sections)
-   - Detailed Tasks (actionable items with clear acceptance criteria)
-   - Risk Assessment and Mitigation Strategies
-   - Success Metrics
-   - Required Resources and Dependencies
-   - Timeline Estimates
-
-4. **Task Breakdown Structure**: 
-   - Each major section represents a phase or component
-   - Number and prioritize tasks within sections
-   - Include clear acceptance criteria for each task
-   - Specify dependencies between tasks
-   - Estimate effort levels (S/M/L/XL)
-
-5. **Create task management structure**:
+1. **Analyze the request** — determine scope by reading relevant code
+2. **Create a plan** with:
+   - Goal (one sentence)
+   - Approach (brief architectural description, key decisions)
+   - Phases with tasks (each phase independently verifiable)
+   - Acceptance criteria per phase
+
+3. **Task Breakdown**:
+   - Tasks within a phase are independent (can run in parallel)
+   - Tasks across phases are sequential
+   - Each task specifies files it touches
+   - Include acceptance criteria for each task
+
+4. **Create plan file**:
    - Create directory: `dev/active/[task-name]/` (relative to project root)
-   - Generate three files:
-     - `[task-name]-plan.md` - The comprehensive plan
-     - `[task-name]-context.md` - Key files, decisions, dependencies
-     - `[task-name]-tasks.md` - Checklist format for tracking progress
-   - Include "Last Updated: YYYY-MM-DD" in each file
+   - Generate one file: `plan.md`
+   - Keep it under 100 lines — concise plans get executed, long ones don't
 
 ## Quality Standards
-- Plans must be self-contained with all necessary context
-- Use clear, actionable language
-- Include specific technical details where relevant
-- Consider both technical and business perspectives
-- Account for potential risks and edge cases
+- Plans must be actionable — specific files, specific changes, specific verification commands
+- Each phase must leave the codebase in a working state
+- Don't plan what you haven't read — examine the code first
 
 ## Context References
-- Check `PROJECT_KNOWLEDGE.md` for architecture overview (if exists)
-- Consult `BEST_PRACTICES.md` for coding standards (if exists)
-- Reference `TROUBLESHOOTING.md` for common issues to avoid (if exists)
-- Use `dev/README.md` for task management guidelines (if exists)
+- Check CLAUDE.md and `.claude/skills/` for project conventions
+- Check `.claude/guidelines/` if it exists for architecture patterns
 
-**Note**: This command is ideal to use AFTER exiting plan mode when you have a clear vision of what needs to be done. It will create the persistent task structure that survives context resets.
\ No newline at end of file
+**Tip**: Use the `plan` agent to both plan AND execute. This command is for when you only need the plan.
\ No newline at end of file
diff --git a/tests/prompt-loader.test.js b/tests/prompt-loader.test.js
index cdd3c0e..ac280a1 100644
--- a/tests/prompt-loader.test.js
+++ b/tests/prompt-loader.test.js
@@ -16,7 +16,7 @@ describe('loadPrompt', () => {
 
   it('resolves {{examples}} partial', () => {
     const prompt = loadPrompt('doc-init');
-    expect(prompt).toContain('Example: Base skill');
+    expect(prompt).toContain('Example Skill');
     expect(prompt).not.toContain('{{examples}}');
   });