Skip to content

feat(skills): add prompt review skill (PromptSentinel v1.2)#1806

Merged
bmadcode merged 1 commit intobmad-code-org:mainfrom
alexeyv:feat/prompt-review-skill
Mar 4, 2026
Merged

feat(skills): add prompt review skill (PromptSentinel v1.2)#1806
bmadcode merged 1 commit intobmad-code-org:mainfrom
alexeyv:feat/prompt-review-skill

Conversation

@alexeyv
Copy link
Contributor

@alexeyv alexeyv commented Mar 2, 2026

Summary

  • Adds bmad-os-review-prompt skill that reviews LLM workflow step prompts for known failure modes
  • PromptSentinel v1.2: 17-item failure mode catalog (silent ignoring, negation fragility, scope creep, etc.)
  • Three parallel review tracks: adversarial sub-agent, catalog scan + execution simulation, and path tracing
  • Structured output format with severity scoring and concrete mitigations

Test plan

  • Run skill against a sample BMAD workflow prompt and verify output format
  • Confirm sub-agent spawning works for Track A and Track C
  • Validate all 17 failure modes are checked in Track B
  • Test INPUT_NOT_A_PROMPT guard with non-prompt input

PromptSentinel v1.2 - reviews LLM workflow step prompts for known
failure modes including silent ignoring, negation fragility, scope
creep, and 14 other catalog items. Uses parallel review tracks
(adversarial, catalog scan, path tracing) with structured output.
@alexeyv alexeyv marked this pull request as ready for review March 2, 2026 18:08
@github-actions
Copy link
Contributor

github-actions bot commented Mar 2, 2026

@coderabbitai review

@alexeyv alexeyv requested a review from bmadcode March 2, 2026 18:08
@augmentcode
Copy link

augmentcode bot commented Mar 2, 2026

🤖 Augment PR Summary

Summary: Adds a new Claude skill (bmad-os-review-prompt) to audit LLM workflow-step prompts using the PromptSentinel v1.2 failure-mode framework.

Changes:

  • Introduces .claude/skills/bmad-os-review-prompt/SKILL.md with a full, self-contained prompt-auditing procedure.
  • Defines a 4-step workflow: input validation, dependency/context inventory, three parallel review tracks, then merge + synthesis.
  • Implements three analysis tracks: adversarial sub-agent review, catalog scan + execution simulation, and a path-tracing sub-agent.
  • Adds a 17-item failure-mode catalog covering common prompt reliability risks (ignoring, negation fragility, scope creep, missing success criteria, etc.).
  • Provides a strict Markdown output template with severity counts, quoted locations, risk framing, and rewritten mitigations.

Technical Notes: The skill is aimed at production-scale prompt execution across heterogeneous frontier models and emphasizes deterministic, mergeable findings with concrete rewrites.

🤖 Was this summary useful? React with 👍 or 👎

Copy link

@augmentcode augmentcode bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Review completed. 5 suggestions posted.

Fix All in Augment

Comment augment review to trigger a new review at any time.

@bmadcode bmadcode merged commit 9536e1e into bmad-code-org:main Mar 4, 2026
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants