Add parallel adversarial review to catch GPD errors#89
Open
madeleinesong wants to merge 4 commits intomainfrom
Open
Add parallel adversarial review to catch GPD errors#89madeleinesong wants to merge 4 commits intomainfrom
madeleinesong wants to merge 4 commits intomainfrom
Conversation
Introduces gpd-parallel-reviewer, an independent review agent that runs alongside the primary six-stage peer-review panel to catch errors arising from single-agent self-consistency bias. Includes: - Agent spec (gpd-parallel-reviewer.md) with two-phase process: independent manuscript analysis followed by cross-panel divergence detection - Adversarial review protocol reference defining information barriers, verification strategies, and divergence classification - Standalone parallel-review workflow for direct invocation ENG-454
Updates the peer-review workflow to support --parallel flag: - Launches parallel reviewer concurrently with Stages 1-5 - Runs divergence analysis after Stages 1-5 complete - Feeds parallel review and divergence report to Stage 6 referee - Adds reconciliation guardrail to recommendation process - Updates summary output and success criteria Updates gpd-referee to reconcile material divergences from parallel review before issuing final recommendation. Updates peer-review-panel reference to document parallel review integration, additional artifacts, and usage guidance. ENG-454
Adds gpd-parallel-reviewer to MODEL_PROFILES, AGENT_DEFAULT_TIERS, and the model-profiles reference table. Uses tier-1 for deep-theory, numerical, review, and paper-writing profiles (adversarial review requires strongest reasoning). Updates agent count from 23 to 24. ENG-454
9c3c042 to
03e4741
Compare
The 3 new files added by this PR (gpd-parallel-reviewer agent, parallel-review workflow, adversarial-review-protocol reference) were not reflected in repo_graph_contract.json, the agent count assertion in test_config.py, or the commit matrix in agent-infrastructure.md. Regenerated the contract via sync script and fixed all stale counts.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Implements ENG-454: Adds a parallel adversarial review capability to create tension between the primary peer-review panel and an independent reviewer, preventing single-agent self-consistency bias from letting errors slip through.
gpd-parallel-reviewerruns independently alongside the 6-stage panel, using alternative derivation paths, assumption stress-testing, counter-narrative construction, and LLM error catalog cross-referencingparallel-review.mdfor standalone invocation; integrated intopeer-review.mdvia--parallelflagadversarial-review-protocol.mddefines information barriers, verification strategies, divergence classification, and reconciliation rulesgpd-refereenow reconciles material divergences from the parallel reviewer before issuing final recommendationsPARALLEL-REVIEW.json(independent checks) andDIVERGENCE-REPORT.json(cross-panel comparison)Key Design Decisions
--parallelflag) since it adds an extra agent passTest plan
/gpd:peer-review --parallelon a test manuscript to verify end-to-end flow🤖 Generated with Claude Code