Skip to content

Qwen3 next -- fixed sigmoid and mul broadcast issue#86

Open
IzacharyI wants to merge 1 commit intozejunchen-zejun:qwen3_next_hyxfrom
IzacharyI:qwen3_next_hyx_local
Open

Qwen3 next -- fixed sigmoid and mul broadcast issue#86
IzacharyI wants to merge 1 commit intozejunchen-zejun:qwen3_next_hyxfrom
IzacharyI:qwen3_next_hyx_local

Conversation

@IzacharyI
Copy link

Motivation

fixed sigmoid and mul broadcast issue

Modifications

  • Remove autotune to fix repeated triggering issue across different sequence lengths
  • Use fixed config BLOCK_N=32, BLOCK_H=1024
  • Use in-place operation to avoid tensor allocation overhea

Accuracy Tests

Benchmarking and Profiling

Checklist

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant