Skip to content

Commit 737cf41

Browse files
dgerogclaude
andcommitted
Update FSLF docs: 10 per judge call, budget-capped at 15, auto-trains per experiment
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
1 parent 1f52e42 commit 737cf41

1 file changed

Lines changed: 2 additions & 2 deletions

File tree

docs/docs/concepts/llm-as-judge.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -29,10 +29,10 @@ The judge considers multiple dimensions when evaluating a response:
2929

3030
## Few-Shot Learning (FSLF)
3131

32-
The judge improves over time through the **Few-Shot Learning Framework**. Human-annotated examples (quality rank 1--2) and high-confidence auto-labeled examples (quality rank 3, >=85% confidence) are used to calibrate the judge for your specific agent. Up to 15 few-shot examples are included per judge call.
32+
The judge improves over time through the **Few-Shot Learning Framework**. Human-annotated examples (quality rank 1--2) and high-confidence auto-labeled examples (quality rank 3, >=85% confidence) are used to calibrate the judge for your specific agent. Up to 10 few-shot examples are included per judge call, budget-capped at 15 per project with PASS/FAIL balanced selection.
3333

3434
!!! info "Enable FSLF"
35-
Set the `few_shot_framework_enabled` flag on your project via the platform dashboard or API. The framework auto-trains nightly at 02:00 UTC.
35+
Set the `few_shot_framework_enabled` flag on your project via the platform dashboard or API. The framework auto-trains after each adversarial experiment completes and runs a nightly sweep at 02:00 UTC.
3636

3737
## Choosing a Judge Provider
3838

0 commit comments

Comments
 (0)