Skip to content

Commit cf7d5aa

Browse files
authored
Fix dark mode text visibility and remove inaccurate blog section (#53)
Fix nearly invisible muted text in dark mode by changing --ao-text-muted from #3a3a37 to #9a9a95. This affects the homepage eyebrow label, stat labels, and decorative rules. Also remove the "Budget Alternatives" paragraph which incorrectly claimed every benchmark has a 10-100x cheaper alternative (GPQA does not).
1 parent c4ea457 commit cf7d5aa

2 files changed

Lines changed: 1 addition & 5 deletions

File tree

docs/blog/posts/technical-deep-dive.md

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -181,10 +181,6 @@ We validated AgentOpt across four diverse benchmarks using 9 models on Amazon Be
181181

182182
Arm Elimination is the consistent winner: it achieves near-brute-force accuracy across all four benchmarks while using 40-60% less budget. LM Proposal (asking GPT-4.1 to predict the best combo) matches brute force on GPQA (where the answer is intuitive) but collapses to 34% on HotpotQA and 45% on BFCL. It can't predict that Ministral outperforms Opus as a planner.
183183

184-
### Budget Alternatives
185-
186-
As shown in the cost-savings table above, for every benchmark there exists a combination within 3-5% of the best accuracy that costs 10-100x less. You don't need the most expensive model to get near-optimal results.
187-
188184
## Get Started
189185

190186
```bash

docs/stylesheets/extra.css

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@
1010
--ao-border-hover: rgba(255, 255, 255, 0.13);
1111
--ao-text: #edede9;
1212
--ao-text-secondary: #9a9a95;
13-
--ao-text-muted: #3a3a37;
13+
--ao-text-muted: #9a9a95;
1414
--ao-accent: #9f98f0;
1515
--ao-accent-dim: rgba(159, 152, 240, 0.15);
1616
--ao-teal: #5dcaa5;

0 commit comments

Comments
 (0)