[model] feat: Add ascend fused operators for Qwen3VL #323

phdddd · 2025-12-24T02:42:10Z

What does this PR do?

This PR is about the ascent fusion operator patch for the qwen3vl model.

Checklist Before Starting

Search for similar PRs. Paste at least one query link here: ...
Format the PR title as [{modules}] {type}: {description} (This will be checked by the CI)
- {modules} include misc, ci, config, docs, data, dist, omni, logging, model, optim, ckpt, release, task, perf, ops, parallel
- If this PR involves multiple modules, separate them with , like [ci, data, model]
- {type} is in feat, fix, refactor, chore, test
- If this PR breaks any API (CLI arguments, config, function signature, etc.), add [BREAKING] to the beginning of the title.
- Example: [BREAKING][parallel, model] feat: dynamic batching

Test

For changes that can not be tested by CI (e.g., algorithm implementation, new model support), validate by experiment(s) and show results like training curve plots, evaluation results, etc.

API and Usage Example

Demonstrate how the API changes if any, and provide usage example(s) if possible.

# Add code snippet or script demonstrating how to use this

Design & Code Changes

Demonstrate the high-level design if this PR is complex, and list the specific changes.

Checklist Before Submitting

Important

Please check all the following items before requesting a review, otherwise the reviewer might deprioritize this PR for review.

Read the Contribute Guide.
Apply pre-commit checks: pre-commit install && pre-commit run --all-files --show-diff-on-failure --color=always
Add / Update the documentation.
Add unit or end-to-end test(s) to the CI workflow to cover all the code. If not feasible, explain why: ...

gemini-code-assist

Code Review

This pull request introduces fused operators for Ascend NPUs, focusing on AdamW and RoPE for Qwen3VL models. While the RoPE implementation appears solid, the fused AdamW implementation has a couple of significant issues. Firstly, it contains hardcoded CUDA device calls, which is a critical error for NPU-targeted code and will cause it to fail. Secondly, the implementation of the fused AdamW operator uses a loop over parameters, which negates the performance benefits of fusion. I have provided specific suggestions to address these high-priority issues.

veomni/ops/npu/fused_adamw.py

veomni/models/transformers/qwen3_vl/npu_patch.py

zhihaofang1017 · 2025-12-30T11:16:34Z

Could you provide a visual comparison of accuracy and performance?
How much performance improvement is there for each fusion operator?

phdddd · 2026-01-04T03:36:51Z

code format

phdddd added 5 commits December 2, 2025 09:44

Update the transformer version in the environment installation

e4fe464

Update the transformer version in the environment installation

3b05ec4

Update the transformer version in the environment installation

514b08c

Update the transformer version in the environment installation

8a5f5b4

modify in qwen3vl fused operators

f10810f

github-actions bot added the ascend everything about Ascend support label Dec 24, 2025

gemini-code-assist bot reviewed Dec 24, 2025

View reviewed changes

veomni/ops/npu/fused_adamw.py Outdated Show resolved Hide resolved

veomni/ops/npu/fused_adamw.py Outdated Show resolved Hide resolved

phdddd added 6 commits December 25, 2025 11:39

Add ascend fused operators for Qwen3VL

af11399

Add ascend fused operators for Qwen3VL

8edd7d0

Merge branch 'main' into add_ascend_fused_operators

18a72bf

Add ascend fused operators for Qwen3VL

a6d73eb

Add ascend fused operators for Qwen3VL

c57047d

Add ascend fused operators for Qwen3VL

00bda64

onehaitao reviewed Dec 30, 2025

View reviewed changes

veomni/models/transformers/qwen3_vl/npu_patch.py Outdated Show resolved Hide resolved

phdddd added 2 commits December 30, 2025 17:51

Add ascend fused operators for Qwen3VL

e43cc97

Add ascend fused operators for Qwen3VL

0bd55c3

Add ascend fused operators for Qwen3VL

7fd7859

FoolPlayer changed the title ~~[Modify]Add ascend fused operators for Qwen3VL~~ [model] feat: Add ascend fused operators for Qwen3VL Dec 30, 2025

Add ascend fused operators for Qwen3VL

3b61072

FoolPlayer approved these changes Dec 31, 2025

View reviewed changes

Add ascend fused operators for Qwen3VL

8d3ec6e

phdddd added 7 commits January 4, 2026 11:44

code format check

51b4b8c

fix qwen3vl

0c64071

code format check

4843834

Merge branch 'main' into add_ascend_fused_operators

b28e485

Add Dockerfile and workflow for Ascend image

6ff61d2

ci: trigger re-run CI (updated PR title)

a2a2da8

fix path error

e543b15

phdddd added 19 commits January 5, 2026 14:34

Merge branch 'main' into NPU_workflow_fix

9e37f4b

bug fix

ed0d2cc

bug fix

ee4003e

bug fix

29c8af5

bug fix

03ba105

fix bug

d1a9851

fix bug

7e5f712

Merge branch 'main' of github.com:phdddd/VeOmni

e907418

fix to arm

9546d08

fix to arm

12da385

fix to arm

c319222

fix to arm

9f09f84

ci: trigger re-run CI (updated PR title)

caae03a

fix to arm

2ded031

Merge branch 'main' of github.com:phdddd/VeOmni

59dee0f

Merge branch 'main' into add_ascend_fused_operators

72f2973

Merge branch 'main' of github.com:phdddd/VeOmni

d4fc089

Merge branch 'main' into add_ascend_fused_operators

a5228c2

fix some problems

adf1f05

Crystal-jiang merged commit e0db332 into ByteDance-Seed:main Jan 8, 2026
14 of 15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[model] feat: Add ascend fused operators for Qwen3VL #323

[model] feat: Add ascend fused operators for Qwen3VL #323

phdddd commented Dec 24, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zhihaofang1017 commented Dec 30, 2025

Uh oh!

phdddd commented Jan 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[model] feat: Add ascend fused operators for Qwen3VL #323

[model] feat: Add ascend fused operators for Qwen3VL #323

Conversation

phdddd commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Checklist Before Starting

Test

API and Usage Example

Design & Code Changes

Checklist Before Submitting

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zhihaofang1017 commented Dec 30, 2025

Uh oh!

phdddd commented Jan 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

phdddd commented Dec 24, 2025 •

edited

Loading