Update README and docs for HN launch#557
Merged
waleedkadous merged 1 commit intomainfrom Feb 25, 2026
Merged
Conversation
- Replace R1 comparison data (92-95 vs 12-15) with R4 results (7.0 vs 5.8 on 1-10 scale, consistent +1.2 across 4 rounds) - Add production metrics table from Feb 2026 development analysis: 106 PRs in 14 days, 57min median, 85% autonomous, 20 bugs caught - Fix inaccurate VIBE description (claimed "working code" when R1 produced 0% functionality) - Fix duplicate Quick Start entry in table of contents - Link hero blockquote to both case study and production data sections
Contributor
Author
Architect ReviewLow risk — README only, +43/-35. Clean update:
Approved. Architect review |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Closes #556
Summary
Linked docs reviewed (no changes needed)
All 5 linked docs were reviewed for currency:
docs/faq.md— current, accurate comparison of Codev concepts vs Claude Code conceptsdocs/tips.md— current, all CLI examples and tips are accuratedocs/why.md— historical article about Codev's origin; model references (Opus 4.1) are accurate for when the experiment was runcodev/resources/cheatsheet.md— current, protocol/role/tool tables accuratecodev/resources/commands/overview.md— current, all command summaries accurateKey decisions
R4 over R1 for primary comparison: R1 was dramatic (VIBE scored 0-15 on everything) but R4 is more credible for an HN audience since both approaches produced working code. R4's +1.2 delta is more defensible than R1's catastrophic VIBE failure.
Production data in hero: The development analysis numbers (106 PRs, 57min median) are more impressive to a technical audience than controlled experiment scores. Led with these.
Kept linked docs unchanged: All linked docs are current.
docs/why.mdis a historical article — updating model names would misrepresent the original experiment.Test plan
codev/resources/development-analysis-2026-02-17.md