feat: cross-run regression detection

## Problem
When a task that previously solved starts failing, there's no automatic alert. Regressions hide in the noise.

## Acceptance Criteria
- [ ] Compare current run results against bench_history.json
- [ ] Flag tasks that solved in previous run but failed in current run
- [ ] Add "Regressions" section to sentinel report with task names and what changed

## Hints
- `evolve/history.py:detect_trends()` already has improving/regressing logic
- Wire into `sentinel_analyze.py` after the per-task breakdown

---
*Generated by /forge cascade*

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: cross-run regression detection #3

Problem

Acceptance Criteria

Hints

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

feat: cross-run regression detection #3

Description

Problem

Acceptance Criteria

Hints

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions