Skip to content

Conversation

@huydhn
Copy link
Contributor

@huydhn huydhn commented Dec 13, 2025

Several small tweaks:

  • Disable arm benchmark for now as it's not working and there is no owner
  • Only need to parse the list of model from serving benchmark config. It always has all the models because serving benchmark is the most basic one, and it has tensor_parallel_size field set correctly. This is to fix a non deterministic issue where a model's tensor_parallel_size could be missing when glob decides to return the list of benchmark configs differently
  • Run daily, saving the capacity for the upcoming nightly run

@huydhn huydhn requested a review from yangw-dev December 13, 2025 08:32
@meta-cla meta-cla bot added the cla signed label Dec 13, 2025
Signed-off-by: Huy Do <huydhn@gmail.com>
@huydhn huydhn force-pushed the vllm-benchmark-tweaks-1213 branch from 043f5b1 to c4dcf9f Compare December 13, 2025 08:34
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants