Skip to content

[ROCm][MLA] Enable MLA persistent kernel with fp8 and bf16 support#817

Open
zejunchen-zejun wants to merge 1 commit intodev/perffrom
zejun/port_ganyi_mla_to_dev_perf
Open

[ROCm][MLA] Enable MLA persistent kernel with fp8 and bf16 support#817
zejunchen-zejun wants to merge 1 commit intodev/perffrom
zejun/port_ganyi_mla_to_dev_perf

Conversation

@zejunchen-zejun
Copy link

@zejunchen-zejun zejunchen-zejun commented Nov 20, 2025

Port the PR: vllm-project#27380

B, self.num_heads, self.kv_lora_rank, dtype=q.dtype, device=q.device
)
B, self.num_heads, self.kv_lora_rank, dtype=torch.bfloat16, device=q.device
).fill_(-1)
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ganyi1996ppo do we need this fill?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think we do

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

removed

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>
@zejunchen-zejun zejunchen-zejun force-pushed the zejun/port_ganyi_mla_to_dev_perf branch from b76c385 to 331643e Compare November 26, 2025 05:49
@github-actions
Copy link

This pull request has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this pull request should remain open. Thank you!

@github-actions github-actions bot added the stale label Feb 25, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants