Skip to content

add opus 4.5 to claude code and update the sdks to the latest version #173

add opus 4.5 to claude code and update the sdks to the latest version

add opus 4.5 to claude code and update the sdks to the latest version #173

Triggered via push November 24, 2025 19:40
Status Failure
Total duration 2h 26m 3s
Artifacts 74
Matrix: run-benchmarks / benchmark
run-benchmarks  /  Prepare Judge Analysis Matrix
3s
run-benchmarks / Prepare Judge Analysis Matrix
Matrix: run-benchmarks / eval-analysis
run-benchmarks  /  notify
13s
run-benchmarks / notify
Fit to window
Zoom out
Zoom in

Annotations

1 error, 202 warnings, and 1 notice
run-benchmarks / notify
Process completed with exit code 1.
publish
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0tbQkaQAAAACwskhN+VdlT5SFgqU56aRhUEFPRURHRTA2MDcARWRnZQ==
publish
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark claude-code / claude-sonnet-4-5 / HelixDB/helix-db@ac6d036..651aef3
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>077UkaQAAAABTIhHjlm6oSbLK8BL7cqIRUEhMMzBFREdFMDIxMABFZGdl
run-benchmarks / Benchmark claude-code / claude-sonnet-4-5 / HelixDB/helix-db@ac6d036..651aef3
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/glm-4.6 / HelixDB/helix-db@ac6d036..651aef3
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0FrYkaQAAAABVZC0G0MZ7RZ8l3OVqhzNIUEFPRURHRTA2MTcARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/glm-4.6 / HelixDB/helix-db@ac6d036..651aef3
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/claude-sonnet-4-5 / HelixDB/helix-db@ac6d036..651aef3
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0FrYkaQAAAADd6eIHQdtFQqx5x+qgd6qpUEFPRURHRTA2MTgARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/claude-sonnet-4-5 / HelixDB/helix-db@ac6d036..651aef3
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/qwen3-coder / HelixDB/helix-db@ac6d036..651aef3
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0QLYkaQAAAAA5+mO+rbibT6yUOW035aKsRE0yRURHRTAxMDgARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/qwen3-coder / HelixDB/helix-db@ac6d036..651aef3
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark codex / gpt-5.1-codex / HelixDB/helix-db@35a9e73..12ebac2
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0I7ckaQAAAACNbMAuWhFsSauKkCyghd1HREVOMzAxMDAwMTAyMDI3AEVkZ2U=
run-benchmarks / Benchmark codex / gpt-5.1-codex / HelixDB/helix-db@35a9e73..12ebac2
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark codex / gpt-5-codex / HelixDB/helix-db@ac6d036..651aef3
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0N7ckaQAAAABxt28n06OpTbaPWu7Xm0FAREVOMzAxMDAwMTA1MDUzAEVkZ2U=
run-benchmarks / Benchmark codex / gpt-5-codex / HelixDB/helix-db@ac6d036..651aef3
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/glm-4.6 / HelixDB/helix-db@35a9e73..12ebac2
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0YrckaQAAAABVCoOvnYG2Sq8vE/NX3m3uUEFPRURHRTA2MTgARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/glm-4.6 / HelixDB/helix-db@35a9e73..12ebac2
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark claude-code / claude-opus-4-5 / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0abckaQAAAAAeSMQfdmCsRYuRX7uC2uwFUEhMMzBFREdFMDIxOQBFZGdl
run-benchmarks / Benchmark claude-code / claude-opus-4-5 / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark claude-code / claude-sonnet-4-5 / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0arckaQAAAAAm7yYwavyWRbkbDjnk0oFwUEhYMzFFREdFMDUxOABFZGdl
run-benchmarks / Benchmark opencode / opencode/qwen3-coder / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0hrckaQAAAAAwLet1oXlhSZNnZKf2pUELUEFPRURHRTA2MjAARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/claude-sonnet-4-5 / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0wLckaQAAAACOmIfeddSBRrpLhUIVgsPjQ0hHRURHRTE3MjIARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/gpt-5-codex / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0LLgkaQAAAAAyYE6KzJv8Q5O3Lsc8r962RE0yRURHRTAxMDkARWRnZQ==
run-benchmarks / Benchmark codex / gpt-5-codex / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0TbgkaQAAAAAudt6rKEfETYrMfgNelDrlUEFPRURHRTA2MTcARWRnZQ==
run-benchmarks / Benchmark codex / gpt-5-codex / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gpt-5.1-codex / getsentry/sentry@e7968da..6ceee3b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0kLgkaQAAAAAVJBKcV1D0TLDLlUbZEfZaUEhMMzBFREdFMDQxNwBFZGdl
run-benchmarks / Benchmark opencode / opencode/gpt-5.1-codex / getsentry/sentry@e7968da..6ceee3b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark codex / gpt-5.1-codex / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0mLgkaQAAAAAwT1E6mE5kQYVrNtJKflLSUEFPRURHRTA1MDkARWRnZQ==
run-benchmarks / Benchmark codex / gpt-5.1-codex / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark codex / gpt-5.1-codex / HelixDB/helix-db@ac6d036..651aef3
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0D7kkaQAAAAApsTijSiloR7Xh7cL7MU9wUEhYMzFFREdFMDUyMABFZGdl
run-benchmarks / Benchmark codex / gpt-5.1-codex / HelixDB/helix-db@ac6d036..651aef3
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gpt-5.1-codex / HelixDB/helix-db@ac6d036..651aef3
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>04bkkaQAAAADXCd0TI+azR5chQOZ4UumFUEhMMzBFREdFMDExMQBFZGdl
run-benchmarks / Benchmark opencode / opencode/gpt-5.1-codex / HelixDB/helix-db@ac6d036..651aef3
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/claude-sonnet-4-5 / getsentry/sentry@62c4c65..b950f76
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>08bkkaQAAAABd1b+TYjN4SKBq1DYQkFHPQ0hHRURHRTE5MDgARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/claude-sonnet-4-5 / getsentry/sentry@62c4c65..b950f76
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gpt-5.1-codex / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0hLokaQAAAABeGZY8K+ROTa8GjQmFoK8aUEFPRURHRTA1MTAARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/gpt-5-codex / HelixDB/helix-db@ac6d036..651aef3
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0mLokaQAAAADnaaWOUJYoQZYT3bm029P4Q0hHRURHRTE3MjAARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/gpt-5-codex / HelixDB/helix-db@ac6d036..651aef3
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / HelixDB/helix-db@35a9e73..12ebac2
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0urokaQAAAABiGSweNC5PRqApi5jODftDUEhMMzBFREdFMDExMQBFZGdl
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / HelixDB/helix-db@35a9e73..12ebac2
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gpt-5-codex / HelixDB/helix-db@35a9e73..12ebac2
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0yLokaQAAAADedcAaufeCS62yAk+wJivyUEhYMzFFREdFMDIwOQBFZGdl
run-benchmarks / Benchmark opencode / opencode/gpt-5-codex / HelixDB/helix-db@35a9e73..12ebac2
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gpt-5.1-codex / getsentry/sentry@62c4c65..b950f76
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>05bokaQAAAAA/2aqdGerBQJlVd1gSPTa6UEhMMzBFREdFMDExNwBFZGdl
run-benchmarks / Benchmark opencode / opencode/gpt-5.1-codex / getsentry/sentry@62c4c65..b950f76
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / sst/opencode@090d27d..b3c6d0b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0LrskaQAAAACPXDpMBNewT7rE6tpPtob5RE0yRURHRTA0MjEARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / sst/opencode@090d27d..b3c6d0b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark claude-code / claude-opus-4-5 / HelixDB/helix-db@ac6d036..651aef3
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0VrskaQAAAACE+HNLkNoxQo/67y902e8NUEhMMzBFREdFMDEyMgBFZGdl
run-benchmarks / Benchmark claude-code / claude-opus-4-5 / HelixDB/helix-db@ac6d036..651aef3
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark codex / gpt-5-codex / getsentry/sentry@62c4c65..b950f76
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0XrskaQAAAACDRus4eV/MTJ3YnQdW+WhyRE0yRURHRTAxMTIARWRnZQ==
run-benchmarks / Benchmark codex / gpt-5-codex / getsentry/sentry@62c4c65..b950f76
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark claude-code / claude-opus-4-5 / HelixDB/helix-db@35a9e73..12ebac2
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0cLskaQAAAAAZCvXU+fgNTLiTY3XTej76RE0yRURHRTA3MTAARWRnZQ==
run-benchmarks / Benchmark claude-code / claude-opus-4-5 / HelixDB/helix-db@35a9e73..12ebac2
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/claude-sonnet-4-5 / HelixDB/helix-db@35a9e73..12ebac2
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0v7skaQAAAACaNYOb/ujnS6ymRMqxRvwEREVOMzAxMDAwMTAyMDA5AEVkZ2U=
run-benchmarks / Benchmark opencode / opencode/claude-sonnet-4-5 / HelixDB/helix-db@35a9e73..12ebac2
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/qwen3-coder / getsentry/sentry@e7968da..6ceee3b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0LrwkaQAAAADTrUCQAKjTSZBfK8zHWBpdUEhMMzBFREdFMDIwNwBFZGdl
run-benchmarks / Benchmark opencode / opencode/qwen3-coder / getsentry/sentry@e7968da..6ceee3b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark claude-code / claude-sonnet-4-5 / sst/opencode@5f7e1e0..a96365f
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0qbwkaQAAAADHcXBkcXRTQb6veWm2w6LPUEhYMzFFREdFMDIyMQBFZGdl
run-benchmarks / Benchmark claude-code / claude-sonnet-4-5 / sst/opencode@5f7e1e0..a96365f
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/qwen3-coder / getsentry/sentry@62c4c65..b950f76
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0rLwkaQAAAABS5uClMGb/TI/Jfh+PQrC8UEhMMzBFREdFMDIxMQBFZGdl
run-benchmarks / Benchmark opencode / opencode/qwen3-coder / getsentry/sentry@62c4c65..b950f76
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / getsentry/sentry@e7968da..6ceee3b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0vLwkaQAAAAACKnoQQH1jRrk8BgiBg0vAUEhMMzBFREdFMDIxOABFZGdl
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / getsentry/sentry@e7968da..6ceee3b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/qwen3-coder / sst/opencode@5f7e1e0..a96365f
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0Q70kaQAAAAD/ZBiIIHJRQ6Np81ucnn3UUEhMMzBFREdFMDEwOABFZGdl
run-benchmarks / Benchmark opencode / opencode/qwen3-coder / sst/opencode@5f7e1e0..a96365f
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark claude-code / claude-opus-4-5 / getsentry/sentry@e7968da..6ceee3b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0Rb0kaQAAAAC0b2vsODFaTb7MUjQqX6NOUEFPRURHRTA1MTcARWRnZQ==
run-benchmarks / Benchmark claude-code / claude-opus-4-5 / getsentry/sentry@e7968da..6ceee3b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark codex / gpt-5-codex / getsentry/sentry@e7968da..6ceee3b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0eb0kaQAAAAASgbv7S3vfTJG334TQkIXbQ0hHRURHRTE5MTIARWRnZQ==
run-benchmarks / Benchmark codex / gpt-5-codex / getsentry/sentry@e7968da..6ceee3b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gpt-5-codex / getsentry/sentry@62c4c65..b950f76
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0y70kaQAAAABBReSLpSquTIkBNFP3Vd71UEhMMzBFREdFMDQyMABFZGdl
run-benchmarks / Benchmark opencode / opencode/gpt-5-codex / getsentry/sentry@62c4c65..b950f76
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0LL4kaQAAAABOlZ98HqkSRqiW2vY7AuwSUEhMMzBFREdFMDEwOABFZGdl
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / DataDog/datadog-lambda-python@93d4a07..d776378
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/grok-code / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0ML4kaQAAAABe8Jr+19x0Q5K1/qJfMY3kUEhMMzBFREdFMDIxOABFZGdl
run-benchmarks / Benchmark opencode / opencode/grok-code / DataDog/datadog-lambda-python@93d4a07..d776378
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/grok-code / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/glm-4.6 / getsentry/sentry@62c4c65..b950f76
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0OL4kaQAAAABzUZJN3/pqQJpbqG8Ts6/pQ0hHRURHRTE4MDUARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/glm-4.6 / getsentry/sentry@62c4c65..b950f76
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / getsentry/sentry@e7968da..6ceee3b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0Ob4kaQAAAAAJJrhZEwwtR6cU1fgNZ4cmUEFPRURHRTA1MjAARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / getsentry/sentry@e7968da..6ceee3b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / HelixDB/helix-db@ac6d036..651aef3
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0QL4kaQAAAADE+YB8R1SsTLVvUlD0qSYgUEhYMzFFREdFMDIyMQBFZGdl
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / HelixDB/helix-db@ac6d036..651aef3
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / HelixDB/helix-db@ac6d036..651aef3
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gpt-5-codex / sst/opencode@5f7e1e0..a96365f
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0Rb4kaQAAAABJ39qAVmIKQo8DQdwWm1qmUEhYMzFFREdFMDUwNwBFZGdl
run-benchmarks / Benchmark opencode / opencode/gpt-5-codex / sst/opencode@5f7e1e0..a96365f
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/glm-4.6 / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0hr4kaQAAAAA2hn+n9JroT48u9HV7bH/SREVOMzAxMDAwMTAyMDI3AEVkZ2U=
run-benchmarks / Benchmark opencode / opencode/glm-4.6 / DataDog/datadog-lambda-python@93d4a07..d776378
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/glm-4.6 / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/claude-sonnet-4-5 / sst/opencode@090d27d..b3c6d0b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0nr4kaQAAAAAm0J96PoVAQZ8Mg+P2Vk03UEhYMzFFREdFMDYxNgBFZGdl
run-benchmarks / Benchmark opencode / opencode/claude-sonnet-4-5 / sst/opencode@090d27d..b3c6d0b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark codex / gpt-5.1-codex / sst/opencode@5f7e1e0..a96365f
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0xr4kaQAAAAABJgOQdbdMS4PLEMX5OcOmUEhMMzBFREdFMDIxMQBFZGdl
run-benchmarks / Benchmark codex / gpt-5.1-codex / sst/opencode@5f7e1e0..a96365f
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark codex / gpt-5-codex / sst/opencode@090d27d..b3c6d0b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>06r4kaQAAAADrpv48cjW9QJ+zzscZIP6KUEFPRURHRTA2MTgARWRnZQ==
run-benchmarks / Benchmark codex / gpt-5-codex / sst/opencode@090d27d..b3c6d0b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark codex / gpt-5-codex / sst/opencode@5f7e1e0..a96365f
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0U78kaQAAAAD/5pq7elkOSoRToNPyzofTUEhYMzFFREdFMDYxOQBFZGdl
run-benchmarks / Benchmark codex / gpt-5-codex / sst/opencode@5f7e1e0..a96365f
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark claude-code / claude-opus-4-5 / sst/opencode@090d27d..b3c6d0b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0Yr8kaQAAAABUr1YEt8BiTKzfPXmGJCJzQ0hJMzBFREdFMDEyMQBFZGdl
run-benchmarks / Benchmark claude-code / claude-opus-4-5 / sst/opencode@090d27d..b3c6d0b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark claude-code / claude-sonnet-4-5 / HelixDB/helix-db@35a9e73..12ebac2
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0ar8kaQAAAADDUEsIw6bQQKNrOaoP5sApUEhMMzBFREdFMDIyMgBFZGdl
run-benchmarks / Benchmark claude-code / claude-sonnet-4-5 / HelixDB/helix-db@35a9e73..12ebac2
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / getsentry/sentry@62c4c65..b950f76
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0d78kaQAAAAAku/anpgjwSpEHnO6DMabeUEFPRURHRTA1MjAARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / getsentry/sentry@62c4c65..b950f76
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / getsentry/sentry@62c4c65..b950f76
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark codex / gpt-5-codex / HelixDB/helix-db@35a9e73..12ebac2
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0f78kaQAAAACQJwfCjlFEQ72e1iCV6aFlQ0hHRURHRTE4MTMARWRnZQ==
run-benchmarks / Benchmark codex / gpt-5-codex / HelixDB/helix-db@35a9e73..12ebac2
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark claude-code / claude-sonnet-4-5 / sst/opencode@090d27d..b3c6d0b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0hL8kaQAAAABNQr4WW9nvQ70YLFpvrFi4UEhMMzBFREdFMDIyMQBFZGdl
run-benchmarks / Benchmark claude-code / claude-sonnet-4-5 / sst/opencode@090d27d..b3c6d0b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / getsentry/sentry@62c4c65..b950f76
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0nr8kaQAAAACJWOPFifr0TqQdNsUD1jOCREVOMzAxMDAwMTAyMDE3AEVkZ2U=
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / getsentry/sentry@62c4c65..b950f76
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / getsentry/sentry@62c4c65..b950f76
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark claude-code / claude-opus-4-5 / sst/opencode@5f7e1e0..a96365f
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0uL8kaQAAAABOMjyng3flRINzJTAyJQINUEFPRURHRTA2MTAARWRnZQ==
run-benchmarks / Benchmark claude-code / claude-opus-4-5 / sst/opencode@5f7e1e0..a96365f
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / HelixDB/helix-db@ac6d036..651aef3
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0x78kaQAAAADqd/pzXSyKR61fJRQYjhZnUEhYMzFFREdFMDYxMQBFZGdl
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / HelixDB/helix-db@ac6d036..651aef3
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / HelixDB/helix-db@ac6d036..651aef3
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark codex / gpt-5.1-codex / getsentry/sentry@e7968da..6ceee3b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0O8AkaQAAAADhM9amZnH8RZQ2OeA+OagpUEhMMzBFREdFMDIwNwBFZGdl
run-benchmarks / Benchmark codex / gpt-5.1-codex / getsentry/sentry@e7968da..6ceee3b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/grok-code / HelixDB/helix-db@ac6d036..651aef3
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0mcAkaQAAAABELFl91o4gSbqEvZOaW0fHUEhMMzBFREdFMDIyMgBFZGdl
run-benchmarks / Benchmark opencode / opencode/grok-code / HelixDB/helix-db@ac6d036..651aef3
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/grok-code / HelixDB/helix-db@ac6d036..651aef3
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gpt-5-codex / sst/opencode@090d27d..b3c6d0b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0zMAkaQAAAAAWvzfR1Un4QKlbZLjTMvgXUEFPRURHRTA2MjAARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/gpt-5-codex / sst/opencode@090d27d..b3c6d0b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / sst/opencode@5f7e1e0..a96365f
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0AsEkaQAAAADu9BAZuryNQb+60e1VAK5EUEhMMzBFREdFMDIyMgBFZGdl
run-benchmarks / Benchmark opencode / opencode/kimi-k2 / sst/opencode@5f7e1e0..a96365f
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark codex / gpt-5.1-codex / sst/opencode@090d27d..b3c6d0b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0BMEkaQAAAACMCKV1rdzPRYpUhJ5ZN8b0UEhMMzBFREdFMDQxOQBFZGdl
run-benchmarks / Benchmark codex / gpt-5.1-codex / sst/opencode@090d27d..b3c6d0b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark claude-code / claude-sonnet-4-5 / getsentry/sentry@62c4c65..b950f76
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0IMEkaQAAAAAXbIKH+TzvSqxrFKfaSF3cUEhYMzFFREdFMDYwOABFZGdl
run-benchmarks / Benchmark claude-code / claude-sonnet-4-5 / getsentry/sentry@62c4c65..b950f76
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gpt-5-codex / getsentry/sentry@e7968da..6ceee3b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0M8EkaQAAAACRX9pSDTR5RahW7nZIqcykUEhMMzBFREdFMDEwOQBFZGdl
run-benchmarks / Benchmark opencode / opencode/gpt-5-codex / getsentry/sentry@e7968da..6ceee3b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/glm-4.6 / getsentry/sentry@e7968da..6ceee3b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0TcEkaQAAAAARBJSmsZR3RIU9Bw3d3ECfUEhMMzBFREdFMDQxOQBFZGdl
run-benchmarks / Benchmark opencode / opencode/glm-4.6 / getsentry/sentry@e7968da..6ceee3b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark claude-code / claude-opus-4-5 / getsentry/sentry@62c4c65..b950f76
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0XMEkaQAAAADjGsentKKWSadbwnfCkz2UQ0hJMzBFREdFMDIwOABFZGdl
run-benchmarks / Benchmark claude-code / claude-opus-4-5 / getsentry/sentry@62c4c65..b950f76
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gpt-5.1-codex / sst/opencode@5f7e1e0..a96365f
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0acEkaQAAAAB7R3ZFiKhuRLdIVURVfmF/UEhYMzFFREdFMDIwNgBFZGdl
run-benchmarks / Benchmark opencode / opencode/gpt-5.1-codex / sst/opencode@5f7e1e0..a96365f
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / DataDog/datadog-lambda-python@93d4a07..d776378
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0xsEkaQAAAADe2JeanPpaTaM1VOOLb6SwRE0yRURHRTA3MjEARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / DataDog/datadog-lambda-python@93d4a07..d776378
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / HelixDB/helix-db@35a9e73..12ebac2
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>058EkaQAAAACBoI5yEn+tTacBbWT+tn/DUEhMMzBFREdFMDIxNwBFZGdl
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / HelixDB/helix-db@35a9e73..12ebac2
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / HelixDB/helix-db@35a9e73..12ebac2
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/glm-4.6 / sst/opencode@090d27d..b3c6d0b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0HsIkaQAAAADYdMlY1U0sSbEVq3nMAOh9Q0hHRURHRTE5MjEARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/glm-4.6 / sst/opencode@090d27d..b3c6d0b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gpt-5.1-codex / sst/opencode@090d27d..b3c6d0b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0S8IkaQAAAADXACHajcWZQL7Cd+6Ol+LlQ0hHRURHRTE5MTgARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/gpt-5.1-codex / sst/opencode@090d27d..b3c6d0b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/claude-sonnet-4-5 / sst/opencode@5f7e1e0..a96365f
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0msIkaQAAAADFB2WLQ6wFRoIxhJbbYJUyUEhMMzBFREdFMDExOABFZGdl
run-benchmarks / Benchmark opencode / opencode/claude-sonnet-4-5 / sst/opencode@5f7e1e0..a96365f
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/claude-sonnet-4-5 / getsentry/sentry@e7968da..6ceee3b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0rcIkaQAAAACiThk+GCTIR4n05Qtmu9gpQ0hJMzBFREdFMDQxNwBFZGdl
run-benchmarks / Benchmark opencode / opencode/claude-sonnet-4-5 / getsentry/sentry@e7968da..6ceee3b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/glm-4.6 / sst/opencode@5f7e1e0..a96365f
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0vMIkaQAAAADMaYO9LMnWSIULN6YoBE6MUEhMMzBFREdFMDEwOQBFZGdl
run-benchmarks / Benchmark opencode / opencode/glm-4.6 / sst/opencode@5f7e1e0..a96365f
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark claude-code / claude-sonnet-4-5 / getsentry/sentry@e7968da..6ceee3b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0w8IkaQAAAADrFOpZriiNTIxAiXHsQI42UEhMMzBFREdFMDQwNwBFZGdl
run-benchmarks / Benchmark claude-code / claude-sonnet-4-5 / getsentry/sentry@e7968da..6ceee3b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark codex / gpt-5.1-codex / getsentry/sentry@62c4c65..b950f76
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>02MIkaQAAAADtgaItOoWvQ6M02y8q/kHzQ0hJMzBFREdFMDQyMQBFZGdl
run-benchmarks / Benchmark codex / gpt-5.1-codex / getsentry/sentry@62c4c65..b950f76
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / sst/opencode@090d27d..b3c6d0b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0PMMkaQAAAAAnkix3TF5CRaZXfjvB8jOzUEFPRURHRTA1MDcARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / sst/opencode@090d27d..b3c6d0b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gpt-5.1-codex / HelixDB/helix-db@35a9e73..12ebac2
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0XcMkaQAAAABR0nzQ5IhlTrKzLu2tMGtXUEhMMzBFREdFMDIyMQBFZGdl
run-benchmarks / Benchmark opencode / opencode/gpt-5.1-codex / HelixDB/helix-db@35a9e73..12ebac2
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / sst/opencode@5f7e1e0..a96365f
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0TMQkaQAAAADK6Ggj53HJSL5qdR7ns4ZlRE0yRURHRTA1MTIARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/gemini-3-pro / sst/opencode@5f7e1e0..a96365f
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/grok-code / HelixDB/helix-db@35a9e73..12ebac2
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0acQkaQAAAADlpcZjybF0TbumaSSQ3c4EUEFPRURHRTA1MTcARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/grok-code / HelixDB/helix-db@35a9e73..12ebac2
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/grok-code / HelixDB/helix-db@35a9e73..12ebac2
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/grok-code / getsentry/sentry@62c4c65..b950f76
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0OcYkaQAAAABcHGxYdzrrSZrzq8m6sE7cUEhMMzBFREdFMDIyMgBFZGdl
run-benchmarks / Benchmark opencode / opencode/grok-code / getsentry/sentry@62c4c65..b950f76
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/grok-code / getsentry/sentry@62c4c65..b950f76
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/qwen3-coder / sst/opencode@090d27d..b3c6d0b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0tMckaQAAAADK86GHopK8Sb+xcMXapeBNUEFPRURHRTA1MDgARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/qwen3-coder / sst/opencode@090d27d..b3c6d0b
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/qwen3-coder / sst/opencode@090d27d..b3c6d0b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/grok-code / sst/opencode@5f7e1e0..a96365f
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0acgkaQAAAADQ6GEygrsLTYL1jkzdboBlUEhMMzBFREdFMDQxMgBFZGdl
run-benchmarks / Benchmark opencode / opencode/grok-code / sst/opencode@5f7e1e0..a96365f
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/grok-code / sst/opencode@5f7e1e0..a96365f
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/qwen3-coder / HelixDB/helix-db@35a9e73..12ebac2
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>02sgkaQAAAAApB40doNU3TL00mmsDJX2KQ0hHRURHRTE5MTkARWRnZQ==
run-benchmarks / Benchmark opencode / opencode/qwen3-coder / HelixDB/helix-db@35a9e73..12ebac2
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/qwen3-coder / HelixDB/helix-db@35a9e73..12ebac2
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/grok-code / sst/opencode@090d27d..b3c6d0b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0uMkkaQAAAACQhIrGLZsySIzZuZXIZsnDUEhYMzFFREdFMDYwNwBFZGdl
run-benchmarks / Benchmark opencode / opencode/grok-code / sst/opencode@090d27d..b3c6d0b
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/grok-code / sst/opencode@090d27d..b3c6d0b
Failed to restore: Cache service responded with 400
run-benchmarks / Benchmark opencode / opencode/grok-code / getsentry/sentry@e7968da..6ceee3b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>05MkkaQAAAABLi/ct/IzyTLWP9kXtFb7aUEhMMzBFREdFMDQwOQBFZGdl
run-benchmarks / Benchmark opencode / opencode/grok-code / getsentry/sentry@e7968da..6ceee3b
No files were found with the provided path: benchmark.json. No artifacts will be uploaded.
run-benchmarks / Benchmark opencode / opencode/grok-code / getsentry/sentry@e7968da..6ceee3b
Failed to restore: Cache service responded with 400
run-benchmarks / Judge Analysis - sst/opencode@090d27d..b3c6d0b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0ctYkaQAAAACuGxx/y7ukSpGxHhVAZQ5MUEFPRURHRTA2MTcARWRnZQ==
run-benchmarks / Judge Analysis - sst/opencode@090d27d..b3c6d0b
Failed to restore: Cache service responded with 400
run-benchmarks / Judge Analysis - HelixDB/helix-db@35a9e73..12ebac2
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0fdYkaQAAAABEGCjtsXj9TLRs1BeJLyQcUEFPRURHRTA1MjIARWRnZQ==
run-benchmarks / Judge Analysis - HelixDB/helix-db@35a9e73..12ebac2
Failed to restore: Cache service responded with 400
run-benchmarks / Judge Analysis - sst/opencode@5f7e1e0..a96365f
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0hNYkaQAAAAD6oVqyuEKvTYCA0BTrhvLWQ0hJMzBFREdFMDEyMABFZGdl
run-benchmarks / Judge Analysis - sst/opencode@5f7e1e0..a96365f
Failed to restore: Cache service responded with 400
run-benchmarks / Judge Analysis - HelixDB/helix-db@ac6d036..651aef3
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0s9YkaQAAAAAzeuI9NkdkTKdfMFGmkfAyUEFPRURHRTA2MDcARWRnZQ==
run-benchmarks / Judge Analysis - HelixDB/helix-db@ac6d036..651aef3
Failed to restore: Cache service responded with 400
run-benchmarks / Judge Analysis - getsentry/sentry@62c4c65..b950f76
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0t9YkaQAAAAAZOp9brqF/TIIYFvxuGxHjUEhMMzBFREdFMDIxMgBFZGdl
run-benchmarks / Judge Analysis - getsentry/sentry@62c4c65..b950f76
Failed to restore: Cache service responded with 400
run-benchmarks / Judge Analysis - getsentry/sentry@e7968da..6ceee3b
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0vNYkaQAAAADO8Tvx9tTQSYOS7a7O7a3+UEhMMzBFREdFMDIwNwBFZGdl
run-benchmarks / Judge Analysis - getsentry/sentry@e7968da..6ceee3b
Failed to restore: Cache service responded with 400
run-benchmarks / Judge Analysis - DataDog/datadog-lambda-python@93d4a07..d776378
Failed to save: <h2>Our services aren't available right now</h2><p>We're working to restore all services as soon as possible. Please check back soon.</p>0w9YkaQAAAACy0ozA8zHJSKpsQWEcJ1QkUEFPRURHRTA2MjAARWRnZQ==
run-benchmarks / Judge Analysis - DataDog/datadog-lambda-python@93d4a07..d776378
Failed to restore: Cache service responded with 400
run-benchmarks / notify
Failed to restore: Cache service responded with 400
publish
{ "workflowData": { "owner": "sst", "repo": "opencode-bench", "sha": "bfbf907770954ce6d3a28c1c090185bfc66e819c", "ref": "main" }, "key": "dHHsXmiZ5x", "runId": 19646986944, "webhookDebug": { "action": "requested", "head_branch": "main", "head_repository_full_name": "sst/opencode-bench", "full_name": "sst/opencode-bench", "isPullRequest": false, "prNumber": null, "prNumberType": "object", "isNewPullRequest": false, "isOldPullRequest": false, "prKey": "sst/opencode-bench:main", "oldPrDataHash": "lPirWwto44", "lookupKey": "lPirWwto44", "data": { "owner": "sst", "repo": "opencode-bench", "sha": "bfbf907770954ce6d3a28c1c090185bfc66e819c", "ref": "main" } } }

Artifacts

Produced during runtime
Name Size Digest
analysis-DataDog-datadog-lambda-python@93d4a07..d776378
3.22 KB
sha256:a49aed12a7d874e1caa739c87b3085a3d19eb36555f7557726d41bf618c93eb2
analysis-HelixDB-helix-db@35a9e73..12ebac2
3.5 KB
sha256:dd366bbb7f2c467fd26bbcd45c2430a5a00ae83c8beed6280ebc528340e465e1
analysis-HelixDB-helix-db@ac6d036..651aef3
3.29 KB
sha256:8068eb0f8e9c5682f63495dac03e299fb0fd42299783d7157ee0ef0e9667cad2
analysis-getsentry-sentry@62c4c65..b950f76
3 KB
sha256:e1b593376499e0de37715c23cdf48ac1ed5ee0efc105c962540f2c86b95014fb
analysis-getsentry-sentry@e7968da..6ceee3b
3.76 KB
sha256:f2054b39a83684e551f5e1f582a6647c20e29132183db07dbc3f7105b9aad440
analysis-sst-opencode@090d27d..b3c6d0b
3.83 KB
sha256:f6784ba9390238945b73bbd245cab4e9d65d73bc9478bac4d4956723b44e157f
analysis-sst-opencode@5f7e1e0..a96365f
3.33 KB
sha256:b89f70a8e486b1790a965a9d95fa7442f92164c98fffaaa71797bf0f6da39718
benchmark-claude-code-claude-opus-4-5-DataDog-datadog-lambda-python@93d4a07..d776378
26.7 KB
sha256:da795f02c8bd70a68ff8a326f8c682562296a051c47ab561ac4ca4d33b69fdb1
benchmark-claude-code-claude-opus-4-5-HelixDB-helix-db@35a9e73..12ebac2
24.7 KB
sha256:36ce21b29ff4c2014c3267c24279bac8577ae1161df24f393b68ffb208eb1ab2
benchmark-claude-code-claude-opus-4-5-HelixDB-helix-db@ac6d036..651aef3
18.1 KB
sha256:d10fb8186e08b4c486502cc20daa39066be41f3ed2f947f03a413be087f7d9e5
benchmark-claude-code-claude-opus-4-5-getsentry-sentry@62c4c65..b950f76
27.6 KB
sha256:28255353448c91c2d844b2204c7f28be0b061cdf59a403f62b762275e28ac738
benchmark-claude-code-claude-opus-4-5-getsentry-sentry@e7968da..6ceee3b
19.7 KB
sha256:d8f6c82421f66243694fd45e0f66884a262108412eed933c80dc487108870b46
benchmark-claude-code-claude-opus-4-5-sst-opencode@090d27d..b3c6d0b
10.2 KB
sha256:2e149f27d0de7f4fc54984f697e8b8a0b845f4bc875314399f5d9224fad533a6
benchmark-claude-code-claude-opus-4-5-sst-opencode@5f7e1e0..a96365f
9.74 KB
sha256:02beac057c2ccab407d4dc763285a33415d48b4750ea3fec6a56b3c8a6c9cbff
benchmark-claude-code-claude-sonnet-4-5-DataDog-datadog-lambda-python@93d4a07..d776378
29.4 KB
sha256:3ba8ee1bb47df4c97d75fe3b952c79cf0d73e6aa78f6203ca6873c2d96d95725
benchmark-claude-code-claude-sonnet-4-5-HelixDB-helix-db@35a9e73..12ebac2
24 KB
sha256:895c21c1110fe41320d754fae70646c79bab722bf862a1b9a3a8a04f3b57e065
benchmark-claude-code-claude-sonnet-4-5-HelixDB-helix-db@ac6d036..651aef3
22.8 KB
sha256:a38bd9498e1416d40144c9f7394f12acadce4aa3f31c054527d69cb1fefb3d5e
benchmark-claude-code-claude-sonnet-4-5-getsentry-sentry@62c4c65..b950f76
24.2 KB
sha256:38a8ecc58960c19844f2d6ee999cc790f4549c7ccc972d76b11615c58a869e44
benchmark-claude-code-claude-sonnet-4-5-getsentry-sentry@e7968da..6ceee3b
22.1 KB
sha256:541eabc1fe8995e09d5d8865edd6741af01eb9d8b08155cc8b54e0378a56e840
benchmark-claude-code-claude-sonnet-4-5-sst-opencode@090d27d..b3c6d0b
1.87 KB
sha256:7a48f3d0255398451b2c530f35c9ab4ff0bc658fa004d86cf34b1cd23eb2daf0
benchmark-claude-code-claude-sonnet-4-5-sst-opencode@5f7e1e0..a96365f
16.1 KB
sha256:b557b9c80a783471c927473fb12ecd224a04cf45e8040f4d5ada1e0cf5d30352
benchmark-codex-gpt-5-codex-DataDog-datadog-lambda-python@93d4a07..d776378
30.8 KB
sha256:a2ae21b6b9b5092f785f93743de3c10b8e09a9283ab634cbb53d35ad4f038d19
benchmark-codex-gpt-5-codex-HelixDB-helix-db@35a9e73..12ebac2
26.6 KB
sha256:7ec152fafc0b90b27faf60bc5b4880952138c3def4b1b7f57af8dec0a6df9196
benchmark-codex-gpt-5-codex-HelixDB-helix-db@ac6d036..651aef3
26.5 KB
sha256:9281afd9c143876d997a0ce9b4789254a9570a54c7f06ab34edc01eb07ded5b6
benchmark-codex-gpt-5-codex-getsentry-sentry@62c4c65..b950f76
25.7 KB
sha256:582ed0a924ed454f25dde4bf1ac69b39d2a99c81c02f61e9b77573950513e13c
benchmark-codex-gpt-5-codex-getsentry-sentry@e7968da..6ceee3b
20.4 KB
sha256:d9fd255fa15be3c2a15e1215701020b4f665b2fc6ca3f5f6e82c7b7ddb6418d9
benchmark-codex-gpt-5-codex-sst-opencode@090d27d..b3c6d0b
9.31 KB
sha256:575e89c4a110128418b39154b9606febb3fbad3576f44468f959218cd6b23abd
benchmark-codex-gpt-5-codex-sst-opencode@5f7e1e0..a96365f
9.23 KB
sha256:274e3a389dff51d7a7963dd1e9e4399ead5809c65af9711d0c16073d209008ff
benchmark-codex-gpt-5.1-codex-DataDog-datadog-lambda-python@93d4a07..d776378
29.7 KB
sha256:63a1eb9be1aa6cb2e32e7241cf3f6ca60a56378772f7818b22fc62c60fd9030a
benchmark-codex-gpt-5.1-codex-HelixDB-helix-db@35a9e73..12ebac2
19.7 KB
sha256:14cb328ec47cb518c112f16ad3c6b40b91692308ac40a0ff777e069ba7a76f52
benchmark-codex-gpt-5.1-codex-HelixDB-helix-db@ac6d036..651aef3
28.2 KB
sha256:09153338a8fe007c39b8a2baa12ce77d7604bd05009be90bb6dc199fc82dcd2f
benchmark-codex-gpt-5.1-codex-getsentry-sentry@62c4c65..b950f76
26.2 KB
sha256:1e0e7b91864b90c3ef0aac3fb64d1105477db260890f42884d06f618b532bff9
benchmark-codex-gpt-5.1-codex-getsentry-sentry@e7968da..6ceee3b
22.2 KB
sha256:8e2cd0d5ec6704517332ae3ecf2551657ffa4edd7b5cc317d7675199d203201a
benchmark-codex-gpt-5.1-codex-sst-opencode@090d27d..b3c6d0b
5.94 KB
sha256:e6a0a5c153d2ffcc01e476336f592d767ebea84b0c83837c7b8a85ea84712e7e
benchmark-codex-gpt-5.1-codex-sst-opencode@5f7e1e0..a96365f
13 KB
sha256:4fb6e3c89a8e32d6415b693555323851c21fb6b656d8f29357d0bd51c1605614
benchmark-opencode-opencode-claude-sonnet-4-5-DataDog-datadog-lambda-python@93d4a07..d776378
31.3 KB
sha256:fe82017589f6a9649311cd4949b50339e9a0c9db0eaa6fa6150865dafa96ff83
benchmark-opencode-opencode-claude-sonnet-4-5-HelixDB-helix-db@35a9e73..12ebac2
18.5 KB
sha256:e2579b8f457fd94a17934b468cbee25c467f0fd3ce81fdd8cafb7829913eccd2
benchmark-opencode-opencode-claude-sonnet-4-5-HelixDB-helix-db@ac6d036..651aef3
24.5 KB
sha256:b6ea995850cb1adcd0f13bb88e70d144fccaa9370731d34b012aad2c6a9f16cd
benchmark-opencode-opencode-claude-sonnet-4-5-getsentry-sentry@62c4c65..b950f76
24.5 KB
sha256:db8578d5e5c57142338f77fe32136cd08cb313d5473516ae03531af55488246f
benchmark-opencode-opencode-claude-sonnet-4-5-getsentry-sentry@e7968da..6ceee3b
22.2 KB
sha256:44111a139112b6f36ea32acdde0b967d87a3eaabc830b4529f65272b6cfb2a32
benchmark-opencode-opencode-claude-sonnet-4-5-sst-opencode@090d27d..b3c6d0b
11 KB
sha256:a4fbc2a564e9fb95b01b8149bf08847e1cf892f96a246b73bda226f2009d87ba
benchmark-opencode-opencode-claude-sonnet-4-5-sst-opencode@5f7e1e0..a96365f
13 KB
sha256:c9ae927dc2968157fa3e0efa62dc7f25a918e87fdb7d2c0c33b8ffd46b3e8702
benchmark-opencode-opencode-gemini-3-pro-getsentry-sentry@e7968da..6ceee3b
21.2 KB
sha256:61df023ca54fac186e1b6e4defc691dc0f73013f42eee2cf2ccec469332969d2
benchmark-opencode-opencode-gemini-3-pro-sst-opencode@090d27d..b3c6d0b
10.8 KB
sha256:b64211eaefcffb4efba3eb9778589365df739840e30239a9b4b3d03cdd2461d2
benchmark-opencode-opencode-gemini-3-pro-sst-opencode@5f7e1e0..a96365f
13.8 KB
sha256:62845e88f813ca28b29486afb4cb094c73f76d6e791e9e667abc6e2f4200556a
benchmark-opencode-opencode-glm-4.6-HelixDB-helix-db@35a9e73..12ebac2
23.9 KB
sha256:16840806f1a6258ea20057dd224e306d130961cc436e8a3f8a74102e51a8c3bd
benchmark-opencode-opencode-glm-4.6-HelixDB-helix-db@ac6d036..651aef3
24.2 KB
sha256:7ed41f7d6fe703025c4d710047bc5f4d104ef4424b1827525f7115d02181df8a
benchmark-opencode-opencode-glm-4.6-getsentry-sentry@62c4c65..b950f76
29.9 KB
sha256:8c4f89d006c26ce83a55bb955be1e849cd841e29e5dfcccd2abd3a642abec99b
benchmark-opencode-opencode-glm-4.6-getsentry-sentry@e7968da..6ceee3b
23.9 KB
sha256:3b8324455c57df393bef8554859a708916d4cd986de0e1fd34b9eca6b310ec12
benchmark-opencode-opencode-glm-4.6-sst-opencode@090d27d..b3c6d0b
6.83 KB
sha256:786ccd40a11f8c4b982bb6b843d6e8a7107c62b3d102e0ce1c8749e519a741ac
benchmark-opencode-opencode-glm-4.6-sst-opencode@5f7e1e0..a96365f
18.5 KB
sha256:7c041a038f17b74c5720eccca81a7816458e336aff66592f010c9977a54cbe31
benchmark-opencode-opencode-gpt-5-codex-DataDog-datadog-lambda-python@93d4a07..d776378
31.9 KB
sha256:47938a781cdc03c711ce1e40ee85145acc828b107109726531d54b02e8f2faa7
benchmark-opencode-opencode-gpt-5-codex-HelixDB-helix-db@35a9e73..12ebac2
24.3 KB
sha256:29d252363bb104773fb25a569d00d9dbc4ac0c2c44d79502c55748c3aa6674c8
benchmark-opencode-opencode-gpt-5-codex-HelixDB-helix-db@ac6d036..651aef3
21.4 KB
sha256:9ba0dfa3681315d5943852732716db753fdeaf4a015cc865e577ee9a1a70c818
benchmark-opencode-opencode-gpt-5-codex-getsentry-sentry@62c4c65..b950f76
24.3 KB
sha256:11c46d13a0306c177592122ad5afe9b880bbb1874cea8f6a999fb446de630d4c
benchmark-opencode-opencode-gpt-5-codex-getsentry-sentry@e7968da..6ceee3b
23.2 KB
sha256:21a0b285dbc75ed8b5b69c45e39b31bf82ac7a75585c2ba2a80dccffc1f1c0a4
benchmark-opencode-opencode-gpt-5-codex-sst-opencode@090d27d..b3c6d0b
8.28 KB
sha256:ba2fb10fb746dcf89ab82faec0a5960717343f8fb68fc288bcc1b282ae16e946
benchmark-opencode-opencode-gpt-5-codex-sst-opencode@5f7e1e0..a96365f
15.6 KB
sha256:dcc1c02f249775c8b64422c4a5f498a01f41ac637360b6ff9560c6beb1657c4b
benchmark-opencode-opencode-gpt-5.1-codex-DataDog-datadog-lambda-python@93d4a07..d776378
29.1 KB
sha256:2fa51a28a02837b367948f41c03bb2c9ce26f90cc92147d863b28cbce6cae2e8
benchmark-opencode-opencode-gpt-5.1-codex-HelixDB-helix-db@35a9e73..12ebac2
18.7 KB
sha256:4ed204e7468a7b553f47657bc5f72b90cb72a712f089c1a43850a2c20cb8f11b
benchmark-opencode-opencode-gpt-5.1-codex-HelixDB-helix-db@ac6d036..651aef3
24.6 KB
sha256:352d6bdbd05cf1ae25a60cbb676c9d5db9323b18f3b2b18188e1c77d9bb65b16
benchmark-opencode-opencode-gpt-5.1-codex-getsentry-sentry@62c4c65..b950f76
26.4 KB
sha256:826198ccb4f60fcf3f5caafaac0314995ca24fba543fe77c690a90c21081c8f6
benchmark-opencode-opencode-gpt-5.1-codex-getsentry-sentry@e7968da..6ceee3b
22 KB
sha256:19763aae0267d8ce4f03dcc2468ff77d9661a52f98d8b7dcc0a956163dae91b7
benchmark-opencode-opencode-gpt-5.1-codex-sst-opencode@090d27d..b3c6d0b
8.9 KB
sha256:570b76af31b80d22c7a2c043972dc1032480f87033a49db5eef39be298c19438
benchmark-opencode-opencode-gpt-5.1-codex-sst-opencode@5f7e1e0..a96365f
12.3 KB
sha256:018e975a13ed51dd0998e4ee09be6f627af8e90401cb8514bd656b9c698b7611
benchmark-opencode-opencode-kimi-k2-HelixDB-helix-db@35a9e73..12ebac2
30.7 KB
sha256:d55df849c6e7afbefcf137e4a4ae40623cf66b24b2931bf5151a28eb259e8379
benchmark-opencode-opencode-kimi-k2-getsentry-sentry@e7968da..6ceee3b
22.3 KB
sha256:914d2b6655899f9952eb0cb1fedea6c0a568b83c17bc212c8eb2b71e897df902
benchmark-opencode-opencode-kimi-k2-sst-opencode@090d27d..b3c6d0b
1.95 KB
sha256:c799ed1ceb6743ec7702499d18f0a8bff40ab862d08250d35e5abce166d6aefb
benchmark-opencode-opencode-kimi-k2-sst-opencode@5f7e1e0..a96365f
17.3 KB
sha256:9eee0bcf5be4df640494239ae3ee93652eb56e215e629c7590484fd27854b182
benchmark-opencode-opencode-qwen3-coder-DataDog-datadog-lambda-python@93d4a07..d776378
33.2 KB
sha256:1d3e1577536c3749b571a400371fb1e1d2fc1e8008199c6f0bddd38fb5dba491
benchmark-opencode-opencode-qwen3-coder-HelixDB-helix-db@ac6d036..651aef3
27.9 KB
sha256:135d8c2901821708a770177af985771bf70fe00d85f5594cf04eb92d22b03292
benchmark-opencode-opencode-qwen3-coder-getsentry-sentry@62c4c65..b950f76
26 KB
sha256:bb54d0ab91769bb37adbcfc923fc77016a6d4d20c1f00a736ddbcd7b1b03ed2d
benchmark-opencode-opencode-qwen3-coder-getsentry-sentry@e7968da..6ceee3b
23.5 KB
sha256:08023816e1f9a7943186f3e76c51402686a2ea794e3f9d2947d2118d85da8849
benchmark-opencode-opencode-qwen3-coder-sst-opencode@5f7e1e0..a96365f
16 KB
sha256:6cf149663bef736c81c32b2249176ab5d7d8c6d42f03a5e6ecac61fc9d1ca549