Skip to content

Conversation

@Xunzhuo
Copy link
Member

@Xunzhuo Xunzhuo commented Dec 15, 2025

This PR added proposal: refactor core for extensible router architecture

Signed-off-by: bitliu <bitliu@tencent.com>
@Xunzhuo Xunzhuo changed the title Proposal: refactor core for extensible router architecture [Proposal] refactor core for extensible router architecture Dec 15, 2025
@bhks
Copy link

bhks commented Jan 3, 2026

Really great proposal !! Could not have been started by other person than you and the team.

Given your experience in AI gateway world and envoy,

  1. I am wondering what prompted us to build this ?
  2. From proxy perspective I believe prefix-aware and design of prefill+decode aware routing can be 2 differentiator to build this router.
  3. I am also wondering how would this be different from [envoy + K8s EPP ] Implementation ? llm-d project along with other k8s sigs like lws has been making quite some progress. I guess this gateway can be coupled with those as plugin.
  4. Would we be compatible with semantic router and how do we see these 2 being integrated ?
  5. Would the router be infrastructure agnostic like k8s vs bare metal vs slurm ?

Thanks again, I am looking forward to this project and would love to contribute as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants