Update blog post (#45)

Wenyueh · Wenyueh · web-flow · commit dde0146641d9 · 2026-03-23T11:32:05.000-04:00
* Update blog post:

* Correct model versions in technical deep dive

Updated model references in the discussion on LLM routing systems.

---------

Co-authored-by: Wenyueh &lt;norahua1996@outlook.com&gt;
diff --git a/docs/blog/posts/technical-deep-dive.md b/docs/blog/posts/technical-deep-dive.md
@@ -19,7 +19,7 @@ categories:
 
 *\* Equal contribution*
 
-Most teams pick a model, usually the latest frontier release, and run every step of their agent on it. Planner? GPT-4o. Solver? GPT-4o. Critic? GPT-4o. It works, so nobody questions it.
+Most teams pick a model, usually the latest frontier release, and run every step of their agent on it. Planner? GPT-5.4. Solver? GPT-5.4. Critic? GPT-5.4. It works, so nobody questions it.
 
 But "it works" is not "it's optimal." What if the same accuracy costs 20x less with a different combination? What if a *weaker* model actually performs *better* at one of those steps? These aren't hypotheticals. We ran the experiments.
 
@@ -67,7 +67,7 @@ These are real numbers from real benchmarks. Same accuracy band, 20-100x cost di
 
 ## Agent Routing Is Not LLM Routing
 
-If you've seen LLM routing systems (the ones that pick GPT-4 for hard questions and GPT-3.5 for easy ones), you might think: "Can't I just do that for each step of my agent?"
+If you've seen LLM routing systems (the ones that pick GPT-5.4 for hard questions and GPT-4o for easy ones), you might think: "Can't I just do that for each step of my agent?"
 
 No. And here's why.