-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathREADME
More file actions
29 lines (25 loc) · 870 Bytes
/
README
File metadata and controls
29 lines (25 loc) · 870 Bytes
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
Set the percentage of tests to run during the benchmark. 100% runs all tests.
<!-- CONFIG_START -->
RUN_PERCENTAGE: 100
SHARED_PROMPT: "Provide production-ready and maintainable JavaScript code. Apply code golfing practices but don't put everything in a single line. No comments. Your code will execute in the browser."
<!-- CONFIG_END -->
The following models are included in the benchmark run.
<!-- MODELS_START -->
x-ai/grok-4.20-beta
openai/gpt-5.4
google/gemini-3.1-flash-lite-preview
openai/gpt-5.3-codex
google/gemini-3.1-pro-preview
anthropic/claude-sonnet-4.6
minimax/minimax-m2.5
z-ai/glm-5
anthropic/claude-opus-4.6
moonshotai/kimi-k2.5
z-ai/glm-4.7
google/gemini-3-flash-preview TEMP:0.35
deepseek/deepseek-v3.2
google/gemini-3-pro-preview TEMP:0.35
anthropic/claude-opus-4.5 TEMP:0.7
x-ai/grok-4
anthropic/claude-sonnet-4.5 TEMP:0.7
<!-- MODELS_END -->