Skip to content

fix(eval): ground LLM judge with command reference to prevent false negatives #342

fix(eval): ground LLM judge with command reference to prevent false negatives

fix(eval): ground LLM judge with command reference to prevent false negatives #342

Triggered via pull request April 10, 2026 11:14
@BYKBYK
synchronize #712
Status Success
Total duration 8s
Artifacts

eval-skill-fork.yml

on: pull_request_target
Reset eval labels
4s
Reset eval labels
Run skill eval
0s
Run skill eval
Fit to window
Zoom out
Zoom in