Skip to content

[CI] Enable Qwen3-Omni Performance Benchmark#85

Open
sammysun0711 wants to merge 19 commits intozejunchen-zejun:dev/perffrom
sammysun0711:enable_qwen3_omni_bench
Open

[CI] Enable Qwen3-Omni Performance Benchmark#85
sammysun0711 wants to merge 19 commits intozejunchen-zejun:dev/perffrom
sammysun0711:enable_qwen3_omni_bench

Conversation

@sammysun0711
Copy link
Collaborator

Motivation

This PR aim to enable Qwen3-Omni-Instruct performance benchmark in CI.

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Signed-off-by: Xiake Sun <xiake.sun@amd.com>
Signed-off-by: Xiake Sun <xiake.sun@amd.com>
@sammysun0711 sammysun0711 requested a review from Copilot December 10, 2025 09:35
Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR enables performance benchmarking for the Qwen3-Omni model in the CI pipeline. It adds a new performance benchmark step to the GitHub Actions workflow and configures the benchmark parameters to test with multiple images at higher resolution.

Key Changes:

  • Added performance benchmark workflow step for Qwen3-Omni in GitHub Actions
  • Updated server launch configuration with new environment variables and adjusted memory/prefill parameters
  • Modified benchmark parameters to test with 20 images at 960x1280 resolution instead of 1 image at 800x800

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File Description
scripts/ci/sglang_benchmark_workflow.sh Added ROCm environment variables, adjusted chunked-prefill and max-prefill-tokens sizes, reduced cuda-graph-max-bs, and updated performance benchmark parameters for Qwen3-Omni
.github/workflows/sglang_benchmark_workflow.yaml Added new "Run performance benchmark" step that executes the performance benchmark for Qwen3-Omni model

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants