docs: Add comprehensive documentation and semantic router example by yurekami · Pull Request #229 · ovg-project/kvcached

yurekami · 2025-12-27T17:04:44Z

Summary

This PR addresses multiple documentation and feature requests to improve the kvcached user experience:

Documentation (docs/)

API.md - Comprehensive API reference including:
- Environment variables documentation
- CLI tools reference (kvctl commands)
- Python API with code examples for vLLM and SGLang
- Integration APIs and manual integration guide
- Controller REST API endpoints
COMPATIBILITY.md - Version compatibility guide:
- PyTorch version support matrix (2.4.x - 2.8.x)
- vLLM version compatibility (0.8.4 - 0.11.x)
- SGLang version support
- Known issues (PyTorch 2.8.0 undefined symbol error)
- GPU architecture support table
- Container/Kubernetes compatibility notes
ARCHITECTURE.md - System architecture documentation:
- System overview with ASCII diagrams
- Engine decoupling explanation
- IPC mechanism details
- Memory lifecycle (allocation/deallocation flow)
- Configuration guidance for multi-engine setups

Examples (examples/)

09_semantic_router/ - Content-based request routing:
- FastAPI-based semantic router
- Sentence-transformers integration for query classification
- Fallback keyword matching when embeddings unavailable
- Statistics and monitoring endpoints

Bug Fixes

Improved error messages in ElasticBlockPool.get_new_blocks():
- Shows available vs requested blocks
- Displays current memory usage percentage
- Provides actionable suggestions

Issues Addressed

Closes Any documentation? #48 (API documentation)
Closes Question About kvcached Ability to Dynamically Recognize and Utilize Kubernetes Elastic Scaled GPU Memory Resources #87 (Kubernetes integration notes)
Closes [TODO] Add an example of vLLM semantics router? #91 (Semantic router example)
Closes Question about decoupling the engines and models #117 (Engine decoupling documentation)
Closes ValueError: Cannot get 31 free blocks from the pool #197 (Better error messages for block allocation)
Closes undefined symbol: _ZNK2at10TensorBase4nameEv when using torch2.8.0 #222 (PyTorch version compatibility docs)

Test Plan

Verify docs render correctly on GitHub
Run semantic router example with test models
Verify error message improvements in block allocation

🤖 Generated with Claude Code

This PR addresses multiple documentation and feature requests: - Add docs/API.md with comprehensive API reference (ovg-project#48) - Environment variables documentation - CLI tools reference (kvctl) - Python API with code examples - Integration APIs for vLLM and SGLang - Controller REST API endpoints - Add docs/COMPATIBILITY.md for version compatibility (ovg-project#222) - PyTorch version support (2.4.x - 2.8.x) - vLLM version matrix (0.8.4 - 0.11.x) - SGLang version support - Known issues including PyTorch 2.8.0 undefined symbol error - GPU architecture support table - Container/Kubernetes compatibility notes (ovg-project#87) - Add docs/ARCHITECTURE.md for system architecture (ovg-project#117) - System overview diagram - Engine decoupling explanation - IPC mechanism details - Memory lifecycle documentation - Configuration guidance for decoupled operation - Add examples/09_semantic_router/ for content-based routing (ovg-project#91) - FastAPI-based semantic router - Sentence-transformers integration for query classification - Fallback keyword matching - Statistics and monitoring endpoints - Improve error messages in ElasticBlockPool (ovg-project#197) - Show available vs requested blocks - Display current usage percentage - Provide actionable suggestions Closes ovg-project#48, ovg-project#87, ovg-project#91, ovg-project#117, ovg-project#197, ovg-project#222 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Add comprehensive documentation and semantic router example#229

docs: Add comprehensive documentation and semantic router example#229
yurekami wants to merge 1 commit intoovg-project:mainfrom
yurekami:feat/multiple-improvements-batch

yurekami commented Dec 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

yurekami commented Dec 27, 2025

Summary

Documentation (docs/)

Examples (examples/)

Bug Fixes

Issues Addressed

Test Plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant