RAG Experiments

Controlled experiments answering: When do RAG optimizations actually help?

Experiments

#	Question	Status
1	Does chunk SIZE matter more than chunk STRATEGY?	Done
2	Do complex queries need complex retrieval?	Planned
3	Can embedding + chunking be optimized independently?	Planned
4	Does scale change optimal strategy?	Planned

Finding: Chunk SIZE matters more than chunk STRATEGY.

Strategy	Requested	Actual	Recall
token	1024	934	75%
recursive	1024	667	68%
sentence	1024	3677	98%

Sentence chunking ignores your config and produces 3.6x larger chunks.

See chunking/ for code and raw data.

uv sync
export OPENAI_API_KEY=sk-...

./run.sh chunking --list        # See experiments
./run.sh chunking --analyze     # Run analysis

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
chunking		chunking
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
run.sh		run.sh
uv.lock		uv.lock
uv.toml		uv.toml