Concurrent prefix caching support for vLLM by qinganrice · Pull Request #274 · ovg-project/kvcached

qinganrice · 2026-03-18T19:07:21Z

Create a PR to support vLLM Prefix Caching step 1: prefix caching for concurrent requests.
Do all implementations in patch file and don't touch KVCached and vLLM core design to make it compatible with other serving engines.

concurrent prefix caching

d540088

qinganrice changed the title ~~Concurrent prefix caching~~ Concurrent prefix caching support for vLLM Mar 18, 2026

cui36 self-requested a review March 18, 2026 19:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concurrent prefix caching support for vLLM#274

Concurrent prefix caching support for vLLM#274
qinganrice wants to merge 1 commit intoovg-project:mainfrom
qinganrice:apc

qinganrice commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

qinganrice commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant