Problem
Near-misses (50%+ tests passing) are the highest-leverage improvement targets, but the sentinel report only shows pass/fail counts. Need detailed per-assertion analysis for near-miss tasks.
Acceptance Criteria
Hints
- ctrf.json at
task_dir/verifier/ctrf.json has per-test results
- Near-miss tasks identified by
classify_task() returning "near_miss"
Generated by /forge cascade
Problem
Near-misses (50%+ tests passing) are the highest-leverage improvement targets, but the sentinel report only shows pass/fail counts. Need detailed per-assertion analysis for near-miss tasks.
Acceptance Criteria
Hints
task_dir/verifier/ctrf.jsonhas per-test resultsclassify_task()returning "near_miss"Generated by /forge cascade