Skip to content

Initial MLP Speculative Decoding implementation#160

Open
christinaexyou wants to merge 2 commits intovllm-project:mainfrom
christinaexyou:add-mlp-speculator
Open

Initial MLP Speculative Decoding implementation#160
christinaexyou wants to merge 2 commits intovllm-project:mainfrom
christinaexyou:add-mlp-speculator

Conversation

@christinaexyou
Copy link

NOTE: This is currently a draft since I have a question

This PR addresses #41 by adding:

  • MLPSpeculatorConfig
  • MLPSpeculator
  • Relevant unit tests

Questions

  • Currently the forward method generates logits for all possible candidate sequences, should I be choosing the top candidates here through Tree Attention ?

Signed-off-by: Christina Xu <chrxu@redhat.com>
@christinaexyou christinaexyou marked this pull request as ready for review November 3, 2025 21:39
@fynnsu fynnsu self-requested a review November 4, 2025 19:12
@dsikka
Copy link
Collaborator

dsikka commented Jan 15, 2026

Hi @christinaexyou

Has this been tested? Are you still interested in getting this landed?

We will be looking into MTP fine tuning if you’d like to help with that instead?

@dsikka
Copy link
Collaborator

dsikka commented Jan 15, 2026

@Mergifyio refresh

@mergify
Copy link

mergify bot commented Jan 15, 2026

refresh

✅ Pull request refreshed

@christinaexyou
Copy link
Author

Hi @dsikka ! No, I haven't tested it extensively i.e there are benchmark performance tests that I would like to contribute. I'll continue on working on this but I totally understand if this is de-prioritized. Would love to help out with MTP fine tuning in the meantime !

@dsikka
Copy link
Collaborator

dsikka commented Feb 3, 2026

FYi: #267

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants