feat: support gpt-oss in sglang-0.5.9 with clean allocator by ztang2370 · Pull Request #268 · ovg-project/kvcached

ztang2370 · 2026-03-06T14:04:58Z

Summary

Compared to the previous KVGroup approach proposed in #263, this approach replaces the internal KVGroup struct with a multiton pattern, using one FTensorAllocator instance per group_id, lazily created via global_allocator(group_id).

What changed

allocator.hpp: Replaced single g_allocator_ with g_allocators_ map.
allocator.cpp: global_allocator(group_id) lazily creates per-group allocators.
torch_bindings.cpp: Routes group_id to the correct allocator at the binding layer.

Tested gpt-oss-20b on sglang-0.5.9.

…ator clean

ivanium · 2026-03-06T18:42:37Z

Thanks for the PR! This one does look simpler. QQ: how do we set the kv cache config, such as tensor size, number of layers, etc., into the cpp extension?

ztang2370 · 2026-03-07T03:33:21Z

Thanks for the PR! This one does look simpler. QQ: how do we set the kv cache config, such as tensor size, number of layers, etc., into the cpp extension?

The kv cache config is set when the Python side calls create_kv_tensors(size, dtype_size, dev_str, num_layers, num_kv_buffers, group_id), which is the same entry point as before.
torch_bindings.cpp receives all config + group_id, and calls global_allocator(group_id) to get or lazily create the right allocator instance.
Then in allocator.cpp, create_kv_tensors() stores the config into its own members, creates the zero page, and builds the FTensors.

So each allocator instance gets configured the first time create_kv_tensors is called on it with group_id.

ivanium · 2026-03-08T22:08:24Z

Oh right I missed that part before. But yeah that makes sense. I am okay with getting the PR in. Thanks for the work!

ztang2370 added 3 commits March 6, 2026 22:02

support gpt-oss in sglang-0.5.9: remove KVGroup struct and keep alloc…

78ba87a

…ator clean

update

8deea94

nit

edf440a

ivanium self-requested a review March 8, 2026 22:05

ivanium approved these changes Mar 8, 2026

View reviewed changes

ztang2370 mentioned this pull request Mar 10, 2026

feat: support gpt-oss in sglang-0.5.9 #263

Closed

cui36 merged commit 90222cd into ovg-project:main Mar 17, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support gpt-oss in sglang-0.5.9 with clean allocator#268

feat: support gpt-oss in sglang-0.5.9 with clean allocator#268
cui36 merged 3 commits intoovg-project:mainfrom
ztang2370:feat/support-gpt-oss-in-sglang-0.5.9-simplify-allocator-remove-kvgroup

ztang2370 commented Mar 6, 2026 •

edited

Loading

Uh oh!

ivanium commented Mar 6, 2026

Uh oh!

ztang2370 commented Mar 7, 2026

Uh oh!

ivanium commented Mar 8, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ztang2370 commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changed

Uh oh!

ivanium commented Mar 6, 2026

Uh oh!

ztang2370 commented Mar 7, 2026

Uh oh!

ivanium commented Mar 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ztang2370 commented Mar 6, 2026 •

edited

Loading

ivanium commented Mar 8, 2026 •

edited

Loading