feat: support gpt-oss in vllm by ztang2370 · Pull Request #259 · ovg-project/kvcached

ztang2370 · 2026-02-27T05:38:47Z

No description provided.

jiarong0907 · 2026-03-17T05:34:08Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces support for multiple KV cache groups in the vLLM integration, which is a key step for supporting hybrid models like gpt-oss. The changes correctly remove the single-group limitation and add validation for geometric compatibility across groups. Support for SlidingWindowSpec is also added. My review includes a couple of critical fixes to prevent potential IndexError exceptions when handling the list of KV cache groups, and a suggestion to improve code consistency.

kvcached/integration/vllm/patches.py

…ator patch

…l blocks) in free_blocks

ztang2370 · 2026-03-17T15:12:16Z

/gemini review

gemini-code-assist

Code Review

This pull request adds support for gpt-oss models in vLLM by enabling handling of multiple KV cache groups. The changes correctly generalize the logic for initialization, allocation, and management of KV caches to work with multiple groups. This includes updating how the number of layers is calculated, validating that all cache groups have compatible geometry, and correctly iterating over all groups and layers when setting up caches. I've found a couple of places where removing the restriction to a single KV cache group has also removed an implicit check for an empty list of groups. This could lead to an IndexError if the configuration provides an empty list of kv_cache_groups. I've added comments with suggestions to add explicit checks for this case to provide better error handling.

gemini-code-assist · 2026-03-17T15:13:37Z

kvcached/integration/vllm/patches.py

+            for grp in kv_groups[1:]:
+                grp_spec = grp.kv_cache_spec
+                grp_block_size = grp_spec.block_size
+                grp_cell_size = grp_spec.page_size_bytes // grp_block_size // 2
+                if grp_block_size != block_size or grp_cell_size != cell_size:
+                    raise ValueError(
+                        "kvcached requires all KV cache groups to have the "
+                        f"same block geometry. Group 0: block_size={block_size},"
+                        f" cell_size={cell_size}; another group: "
+                        f"block_size={grp_block_size}, cell_size={grp_cell_size}"
+                    )


While this new validation for multiple groups is great, a potential issue has been introduced by removing the if len(kv_groups) != 1: check. There is no longer a guard against kv_groups being empty. This can lead to an IndexError on line 251 (kv_cache_group = kv_groups[0]).

Please add a check for empty kv_groups after line 249, for example:

if not kv_groups: raise ValueError("kv_cache_config.kv_cache_groups cannot be empty when kvcached is enabled.")

Pushing back as overly defensive. If kv_cache_groups were somehow empty, that would indicate a fundamental bug in vLLM's model initialization, and should surface loudly as an IndexError with a full traceback.

gemini-code-assist · 2026-03-17T15:13:37Z

kvcached/integration/vllm/patches.py

+            for grp in kv_cache_config.kv_cache_groups:
+                grp_spec = grp.kv_cache_spec
+                if not isinstance(grp_spec, (FullAttentionSpec, SlidingWindowSpec)):
+                    raise ValueError(
+                        f"kvcached only supports FullAttentionSpec and SlidingWindowSpec layers, "
+                        f"got {type(grp_spec).__name__}"
+                    )


Similar to another part of this PR, removing the check for multiple kv_cache_groups also removes the implicit check for an empty list. If kv_cache_config.kv_cache_groups is empty, the access on line 605 (first_kv_cache_group = kv_cache_config.kv_cache_groups[0]) will raise an IndexError. It would be safer to add a check for an empty list before proceeding, for example by raising a ValueError.

same as above

gemini-code-assist bot reviewed Mar 17, 2026

View reviewed changes

kvcached/integration/vllm/patches.py Show resolved Hide resolved

kvcached/integration/vllm/patches.py Show resolved Hide resolved

kvcached/integration/vllm/patches.py Outdated Show resolved Hide resolved

ztang2370 added 5 commits March 17, 2026 21:35

support gpt-oss in vllm

445b52d

update

90e1dde

auto-disable hybrid KV cache manager and fix num_layer bug in coordin…

6a82642

…ator patch

remove the disable_hybrid_kv_cache_manager force and filter None (nul…

9fff1a1

…l blocks) in free_blocks

remove verbose comment

2e748d7

ztang2370 force-pushed the feat/support-gpt-oss-in-vllm branch from 067fe0a to 2e748d7 Compare March 17, 2026 15:10

gemini-code-assist bot reviewed Mar 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support gpt-oss in vllm#259

feat: support gpt-oss in vllm#259
ztang2370 wants to merge 5 commits intoovg-project:mainfrom
ztang2370:feat/support-gpt-oss-in-vllm

ztang2370 commented Feb 27, 2026

Uh oh!

jiarong0907 commented Mar 17, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ztang2370 commented Mar 17, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Mar 17, 2026

Uh oh!

ztang2370 Mar 17, 2026

Uh oh!

gemini-code-assist bot Mar 17, 2026

Uh oh!

ztang2370 Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ztang2370 commented Feb 27, 2026

Uh oh!

jiarong0907 commented Mar 17, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ztang2370 commented Mar 17, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

ztang2370 Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

ztang2370 Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants