Skip to content

Releases: thinkwright/agent-evals

v0.3.0

17 Feb 02:53

Choose a tag to compare

Changelog

  • bc25644 Add recursive directory scanning and content-hash deduplication

v0.2.0

16 Feb 21:53

Choose a tag to compare

Changelog

  • f0d39ae Add 429 retry with exponential backoff for LLM providers
  • 920aad3 Add Works With section listing supported coding agents
  • a5793d5 Add goroutine panic recovery in probe runner
  • 0364e3c Add pluggable domain definitions via config
  • 2280bf1 Add tests for panic recovery and domain summary formatting
  • 30a092b Configure GoReleaser to use HOMEBREW_TAP_GITHUB_TOKEN for formula push
  • 2428b15 Fix domain count in README (19 → 18 built-in)
  • 51be5e3 Log warnings to stderr when agent files are skipped
  • ffaa8da Merge quality fixes and pluggable domain definitions

v0.1.0

14 Feb 21:58

Choose a tag to compare

Changelog