Skip to content

Actions: anomalyco/opencode-bench

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
455 workflow runs
455 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

feat: add binary scoring for evals
.github/workflows/publish-benchmark.yml #24: Pull request #1 opened by tmickleydoyle
researching llm judges and binary scores
.github/workflows/publish-benchmark.yml #22: Commit a2f70eb pushed by Aslemammad
19m 52s main
Benchmark GLM-4.6
.github/workflows/publish-benchmark.yml #21: Commit 7393690 pushed by fwang
7m 45s frank
claude too
.github/workflows/publish-benchmark.yml #20: Commit b24f196 pushed by Aslemammad
16m 26s main
update
.github/workflows/publish-benchmark.yml #19: Commit 6128fca pushed by Aslemammad
15m 35s main
update
.github/workflows/publish-benchmark.yml #18: Commit e4df372 pushed by Aslemammad
38m 4s main
update
.github/workflows/publish-benchmark.yml #17: Commit 4cdc0ed pushed by Aslemammad
25m 2s main
discord messages
.github/workflows/publish-benchmark.yml #16: Commit 5ea2bbd pushed by Aslemammad
14m 58s main
update
.github/workflows/publish-benchmark.yml #15: Commit 6374afa pushed by Aslemammad
17m 55s main
handle list of commits properly
.github/workflows/publish-benchmark.yml #14: Commit 44e464b pushed by Aslemammad
1m 58s main
vague planning instead of exact and direct robotic instructions
.github/workflows/publish-benchmark.yml #13: Commit 329fcd1 pushed by Aslemammad
12m 52s main
better checks score
.github/workflows/publish-benchmark.yml #12: Commit cc6e19f pushed by Aslemammad
14s main
fix ci
.github/workflows/publish-benchmark.yml #11: Commit a9789c3 pushed by Aslemammad
9m 9s main
fix ci
.github/workflows/publish-benchmark.yml #10: Commit a3741c0 pushed by Aslemammad
4m 50s main
stable results
.github/workflows/publish-benchmark.yml #9: Commit f83c2cf pushed by Aslemammad
33s main
some serious changes
.github/workflows/publish-benchmark.yml #8: Commit fb9fc59 pushed by Aslemammad
25m 14s main
update
.github/workflows/publish-benchmark.yml #7: Commit 29ef094 pushed by Aslemammad
6m 13s main
log env vars to see what's going on
.github/workflows/publish-benchmark.yml #6: Commit aadacb0 pushed by Aslemammad
32s main
log env vars to see what's going on
.github/workflows/publish-benchmark.yml #5: Commit 7f1ca30 pushed by Aslemammad
1m 1s main
log the logs to debug the opencode error
.github/workflows/publish-benchmark.yml #4: Commit 3273061 pushed by Aslemammad
45s main
paralleization
.github/workflows/publish-benchmark.yml #3: Commit 9755229 pushed by Aslemammad
57s main
update ci
.github/workflows/publish-benchmark.yml #2: Commit 8b7e08c pushed by Aslemammad
56s main
update ci
.github/workflows/publish-benchmark.yml #1: Commit a6f6f08 pushed by Aslemammad
59s main
ProTip! You can narrow down the results and go further in time using created:<2025-10-11 or the other filters available.