Skip to content

Commit cd94997

Browse files
committed
Docs: UI reskin, AGT-02, demos, and test updates
Update implementation plan, status, and testing guide to reflect the recent merge wave (2026-03-29). Adds AppShell premium reskin and board/card surface polish entries, documents AGT-02 tool registry/policy evaluator and InboxTriageAssistant bounded template, and records demo-director presets/assertions/reports/soak additions and incident rehearsal docs. Also updates verification dates and automated test totals (backend, frontend unit, and E2E) and adjusts CI/UX status notes to mark delivered items and remaining follow-up work.
1 parent c5d792e commit cd94997

3 files changed

Lines changed: 102 additions & 34 deletions

File tree

docs/IMPLEMENTATION_MASTERPLAN.md

Lines changed: 39 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -455,6 +455,33 @@ Delivered in the latest cycle:
455455
100. TST-08 manual validation slice B — authz policy, cross-user isolation, and API error contracts (`#131`):
456456
- added `docs/testing/manual-validation-b-authz-contracts.md` with 175 step-indexed checks (B-01 to B-175) covering all 28 controllers
457457
- two-user fixture setup with curl-based bootstrap script; covers unauthenticated denial, cross-user board isolation, error payload contract verification
458+
101. AppShell premium reskin delivery (PR `#499`):
459+
- shell sidebar, topbar, command palette, and keyboard help components reskinned from hardcoded Tailwind/rgba values to `--td-*` design token system
460+
- added focus-visible accessibility rings throughout shell layer and glass morphism effects for visual coherence
461+
- no behavior changes; purely CSS/token-based styling refactor
462+
102. Board/card surface polish delivery (PR `#501`):
463+
- board canvas, toolbar, action rail, column lanes, and card components reskinned to design token system
464+
- standardized card visual states (hover, focus, selected, disabled, dragging) with token-based styling
465+
- fixed combined selected+focus-visible keyboard nav specificity conflict; replaced hardcoded font sizes with token references
466+
103. AGT-02 tool registry, policy evaluator, and first bounded template delivery (`#337`, PR `#502`):
467+
- added domain primitives: `ToolScope`/`ToolRiskLevel` enums, `ITaskdeckTool`/`ITaskdeckToolRegistry` interfaces, `PolicyDecision` value object (AllowDirect/AllowWithReview/Deny factories)
468+
- added `TaskdeckToolRegistry` (thread-safe ConcurrentDictionary, duplicate rejection, scope filtering) and `AgentPolicyEvaluator` (allowlist enforcement, risk-level gating with review-first defaults)
469+
- added `InboxTriageAssistant` bounded template: gathers pending inbox items, routes through policy evaluator, creates proposals (never direct board mutations)
470+
- DI registration: singleton tool registry with `inbox.triage` pre-registered, scoped policy evaluator and triage assistant
471+
- 42 backend tests across registry, policy evaluation, and inbox triage assistant suites
472+
104. Demo director reporting, assertions, presets, and soak mode delivery (`#331`, PR `#500`):
473+
- added named preset system (`demo-director-presets.mjs`) for common demo modes with override merging and runtime registration
474+
- added trace assertion utilities (`demo-trace-assertions.mjs`) for exact/structural comparison plus step ordering validation
475+
- added HTML report generator (`demo-report-html.mjs`) with inline styles, trace tables, pass/fail badges, and embedded base64 screenshots
476+
- added soak mode (`demo-soak.mjs`) for long-run director scenario loops with configurable iteration counts, cooldown, and cumulative metrics
477+
- 63 frontend tests covering presets, assertions, reports, soak mode, and integration pipeline
478+
105. Incident rehearsal and recovery program delivery (`#150`, PR `#503`):
479+
- added `docs/ops/INCIDENT_REHEARSAL_CADENCE.md` with monthly lightweight + quarterly deep drill schedule and rotation model
480+
- added `docs/ops/EVIDENCE_TEMPLATE.md` for standardized rehearsal outcome format with ISO 8601 timeline and bidirectional issue linking
481+
- added `docs/ops/REHEARSAL_BACKOFF_RULES.md` with finding-to-issue workflow, severity labels (P1–P4), and SLA expectations
482+
- added 4 rehearsal scenario templates (degraded-api-health, missing-telemetry-signal, mcp-server-startup-regression, deployment-readiness-failure)
483+
- added first execution evidence at `docs/ops/rehearsals/2026-03-29_degraded-api-health.md`
484+
- cross-linked from `TESTING_GUIDE.md` and `MANUAL_TEST_CHECKLIST.md`
458485

459486
## Current Planning Pivot (2026-03-07)
460487

@@ -581,10 +608,15 @@ Exit Criteria:
581608

582609
Focus:
583610
- add `AgentProfile`, `AgentRun`, and `AgentRunEvent` as first-class runtime primitives
584-
- add a tool registry abstraction and policy evaluator
585-
- add inspectable run traces and a first bounded agent template
611+
- ~~add a tool registry abstraction and policy evaluator~~ (delivered in AGT-02, `#337`)
612+
- ~~add a first bounded agent template~~ (delivered: `InboxTriageAssistant` in AGT-02)
613+
- add inspectable run traces
586614
- expose agent mode views only after the substrate is real
587615

616+
Current status:
617+
- tool registry, policy evaluator, and first bounded template are now delivered (`#337`): `ITaskdeckTool`/`ITaskdeckToolRegistry` domain interfaces, `AgentPolicyEvaluator` with allowlist + risk-level gating, and `InboxTriageAssistant` bounded template (proposal-only, review-first default)
618+
- remaining work: `AgentProfile`/`AgentRun`/`AgentRunEvent` runtime primitives (`#336`), agent mode surfaces (`#338`), inspectable run detail
619+
588620
Exit Criteria:
589621
- runs are first-class and inspectable
590622
- agent behavior remains proposal-first and trace-first by default
@@ -608,7 +640,7 @@ Exit Criteria:
608640
These continue in parallel where they protect trust, performance, or operator posture, but they should not outrun Horizon A through C product legibility work:
609641

610642
- managed-key LLM control plane and abuse controls: `#235`, `#237` (pending), `#238` (operator tooling groundwork delivered; live-traffic wiring pending), `#239` (delivered), `#240` (delivered)
611-
- premium UI foundations and reskin wave: `#242` to `#250` (plus optional `#251`); foundations now delivered: `#243` UI-02 shared primitives, `#245` UI-03 stack spike, `#250` PERF-08 budgets
643+
- premium UI foundations and reskin wave: `#242` to `#250` (plus optional `#251`); foundations delivered: `#243` UI-02 shared primitives, `#245` UI-03 stack spike, `#250` PERF-08 budgets; appshell reskin (`#499`) and board/card polish (`#501`) now shipped with design-token-based styling
612644
- long-list responsiveness and related UX scale follow-through: `#213` (delivered — inbox + activity virtualized; board cards deferred due to drag-and-drop conflicts)
613645
- platform, ops, testing, and maturity backlog: `#84` to `#111`, `#87` to `#91`
614646
- deferred outreach CRM expansion: `#262` to `#268`
@@ -627,8 +659,8 @@ These continue in parallel where they protect trust, performance, or operator po
627659
- `AgentProfile`
628660
- `AgentRun`
629661
- `AgentRunEvent`
630-
- tool registry and policy evaluator
631-
- first bounded template
662+
- tool registry and policy evaluator (delivered in AGT-02)
663+
- first bounded template (delivered: `InboxTriageAssistant`)
632664
- inspectable run detail
633665
- `R3` knowledge/integrations alpha:
634666
- `KnowledgeDocument` / `KnowledgeChunk`
@@ -689,12 +721,12 @@ These continue in parallel where they protect trust, performance, or operator po
689721
- Seeded secondary MVP follow-through wave (lower priority than Wave P):
690722
- `#329` tracker
691723
- `#330` in-app demoability and live attention cues
692-
- `#331` demo director reporting/assertions/presets/soak
724+
- `#331` demo director reporting/assertions/presets/soak (delivered)
693725
- `#332` replay-from-trace and scenario-authoring follow-through
694726
- Seeded expanded-blueprint architecture wave (future agent/knowledge/release-gate follow-through):
695727
- `#335` tracker
696728
- `#336` agent profile/run/event foundation
697-
- `#337` tool registry, policy evaluator, and first bounded template
729+
- `#337` tool registry, policy evaluator, and first bounded template (delivered)
698730
- `#339` knowledge document + SQLite FTS foundation
699731
- Reuse-before-duplicate anchors for this later wave: `#75`, `#77`, `#98`, `#100`, `#216`, `#218`, `#219`, `#328`
700732

0 commit comments

Comments
 (0)