Hydra

Your code review has blind spots. Use more eyes.

Six specialists ask six different questions about your code, cross-examine each other's answers, and deliver one verdict. Inspired by Karpathy's LLM Council — same principle (independent perspectives, cross-examination, synthesis), adapted for specialist code review with cross-model diversity (Claude Opus + OpenAI Codex).

What You Get

## Hydra Verdict: auth-middleware-refactor

**Solid refactor with one critical gap in token refresh handling.**

The middleware correctly centralizes auth checks, but the refresh token
flow has a race condition under concurrent requests — Cassandra and
Sentinel (cross-model consensus) both flagged this independently.
Mies identified two abstraction layers that can be collapsed.

**Top Actions:**
1. Add mutex around token refresh in auth/middleware.ts:47-62
2. Remove SessionValidatorFactory — inline the 3-line check (auth/validators.ts)
3. Add integration test for concurrent refresh scenario

**Key Tensions:**
- Navigator vs Mies on separating auth/authz modules (Stranger sided
  with Mies — cross-model). Ruling: keep combined until second consumer exists.

Full report: .hydra/reports/hydra-20260331T144523-auth-middleware-refactor.md

Best for: Architecture decisions, security-critical code, refactoring tradeoffs, pre-merge deep reviews. Just ask Claude for: Syntax fixes, factual lookups, code generation, style questions.

Quick Start

# Install
git clone https://github.com/Zandereins/hydra.git ~/.claude/skills/hydra

# Review
hydra this: [paste code or describe decision]

# Fix issues, then iterate
hydra iterate

Hydra asks for cost confirmation before running. Auto-detects Codex; falls back to Opus-only if unavailable. Iterations default to Lite mode (~$1.50-2) and show a delta: what's fixed, what remains, what's new.

Requirements: Claude Code (required) | Codex CLI plugin (optional — enables cross-model analysis)

The 6 Advisors

Each advisor asks a fundamentally different question. Four run on Claude Opus, two on OpenAI Codex — different model, different blind spots. When Opus and Codex independently agree, that's the strongest signal. When they disagree, that's the highest-value finding.

#	Name	Model	Core Question
1	Cassandra	Opus	"How does this break at 3am?" — compound failures, unguarded assumptions
2	Mies	Opus	"What can be deleted?" — dead code, over-engineering
3	Navigator	Opus	"What depends on what?" — coupling, boundary violations
4	The Stranger	Codex	"Can a stranger understand this in 15 min?" — naming, cognitive load
5	Volta	Opus	"What does this cost at 10x load?" — N+1 queries, invisible costs
6	Sentinel	Codex	"How do I break this on purpose?" — auth gaps, injection, race conditions

Advisors run in parallel, then 5 peer reviewers cross-examine their work (3 Opus + 2 Codex), then a chairman (Opus) synthesizes the final verdict.

Modes

Mode	Agents	Est. Cost
Full (default)	12 (6 advisors + 5 reviewers + chairman)	~$3-5
`--no-review`	7 (6 advisors + chairman)	~$2
`--no-codex`	10 (all Opus)	~$4
`--mode lite`	4 (Cassandra + Mies + Navigator + chairman)	~$1.50-2

Flags combine: --no-review --no-codex = 7 agents. --mode lite implies both. --transcript saves raw agent outputs separately.

Costs are for API calls to Claude and Codex — charged to your own accounts. Hydra always shows the estimate and asks before running.

Iterate

Hydra reviews aren't one-shot. Fix the issues, then run hydra iterate to verify:

## Hydra Delta: auth-middleware-refactor

**Progress: 2/3 previous actions addressed**

**Fixed:** Mutex added around token refresh. SessionValidatorFactory removed.
**Remaining:** Integration test for concurrent refresh not yet added.
**New Issues:** None.

**Next Step:** Add test in auth/__tests__/refresh.test.ts

Iterations auto-detect the last report, diff only what changed, and default to Lite mode. Run as many cycles as needed — each one costs ~$1.50-2.

Triggers: hydra iterate, hydra re-review, hydra follow-up, check my fixes.

Privacy

In full mode, your code is sent to both Anthropic (Claude Opus) and OpenAI (Codex GPT-5.4). Use --no-codex to keep everything Anthropic-only. Hydra shows which providers receive your code and asks for confirmation before any agents run.

Without the Codex plugin, Hydra runs all 6 advisors on Opus (10 agents). You still get all perspectives — just without cross-model diversity.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
references		references
LICENSE		LICENSE
README.md		README.md
SKILL.md		SKILL.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hydra

What You Get

Quick Start

The 6 Advisors

Modes

Iterate

Privacy

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Hydra

What You Get

Quick Start

The 6 Advisors

Modes

Iterate

Privacy

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages