[Issue #245] Add pipeline parallelism support by iMmAseu · Pull Request #246 · ovg-project/kvcached

iMmAseu · 2026-02-21T03:49:13Z

Summary

This PR fixes a TimeoutError during vLLM initialization when Pipeline Parallelism is enabled.

Fixes #243

Root Cause

KVCacheManager only used tensor_parallel_size, which caused incorrect process detection in PP setups.

Changes

Fix IPC synchronization using global world_size
Compute consistent global_rank
Rename tp_size → world_size
Update listener rank detection

Test Plan

ivanium · 2026-02-21T06:32:40Z

Just curious does this PR support enabling both TP and PP, say TP=2 and PP=2? In that case, how do we get the consistent global_rank here?

iMmAseu · 2026-02-22T08:32:05Z

@ivanium Thanks for pointing this out, your concern is absolutely valid. I’ve updated the implementation to correctly handle the TP and PP setup and compute a consistent global_rank across all processes. I’ve also run tests with configurations like TP=2 and PP=2, and the initialization now works seemly as expected without synchronization issues.

cui36 · 2026-02-22T22:34:26Z

Hi @iMmAseu, thanks for the update! Let's split it into two parts, and I will go through the vllm patch.

cui36 · 2026-02-24T02:42:05Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces support for pipeline parallelism by refactoring the handling of distributed process ranks. The changes primarily involve renaming tp_size to world_size and adding pp_rank to correctly manage inter-process communication for KV cache synchronization across different pipeline stages. The implementation correctly namespaces IPC sockets by pp_rank to prevent conflicts.

My review has identified a few areas for improvement:

There is some duplicated code for resolving distributed ranks in the SGLang integration patches.
The use of broad except Exception clauses could mask unexpected errors.
The docstrings for the world_size parameter are inconsistent with its actual usage, which could lead to confusion.

Overall, the changes are well-structured and address the issue described. Addressing the feedback will improve the code's maintainability and clarity.

kvcached/kv_cache_manager.py

kvcached/page_allocator.py

kvcached/integration/sglang/patches.py

kvcached/integration/vllm/patches.py

cui36 self-requested a review February 22, 2026 00:37

gemini-code-assist bot reviewed Feb 24, 2026

View reviewed changes

kvcached/kv_cache_manager.py Outdated Show resolved Hide resolved

kvcached/page_allocator.py Outdated Show resolved Hide resolved

kvcached/integration/sglang/patches.py Show resolved Hide resolved

kvcached/integration/vllm/patches.py Show resolved Hide resolved

cui36 and others added 10 commits February 28, 2026 03:55

feat: support vllm-0.14.0

74dd8ef

fix: pre-commit issue

b14f9f6

Feat: add pipeline parallel for vllm 0.14.0

9817282

Support pp for sgl

6408aa2

fix world_size conflicts, support tp+pp

2987298

support pp on sgl

d597ae4

support PP on vLLM

6e98d21

change the test file

1c6feba

Fix commentary and improve exception handling

6c40d99

fix: make scripts executable

709a5d2

iMmAseu force-pushed the feat-pp-support branch from 8cbc6ad to 709a5d2 Compare February 28, 2026 09:15

fix: convert shell scripts to LF

d5dd883

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Issue #245] Add pipeline parallelism support#246

[Issue #245] Add pipeline parallelism support#246
iMmAseu wants to merge 11 commits intoovg-project:mainfrom
iMmAseu:feat-pp-support

iMmAseu commented Feb 21, 2026 •

edited

Loading

Uh oh!

ivanium commented Feb 21, 2026

Uh oh!

iMmAseu commented Feb 22, 2026

Uh oh!

cui36 commented Feb 22, 2026

Uh oh!

cui36 commented Feb 24, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

iMmAseu commented Feb 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Root Cause

Changes

Test Plan

Uh oh!

ivanium commented Feb 21, 2026

Uh oh!

iMmAseu commented Feb 22, 2026

Uh oh!

cui36 commented Feb 22, 2026

Uh oh!

cui36 commented Feb 24, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

iMmAseu commented Feb 21, 2026 •

edited

Loading