Skip to content

perf(MoE): Use TE quant/dequant for SwiGLU fp8 input store to improve performance and stability#1753

Draft
xiaoxi-wangfj wants to merge 3 commits intoNVIDIA:mainfrom
021ai:optimize-swiglu-input-fp8-quant
Draft

perf(MoE): Use TE quant/dequant for SwiGLU fp8 input store to improve performance and stability#1753
xiaoxi-wangfj wants to merge 3 commits intoNVIDIA:mainfrom
021ai:optimize-swiglu-input-fp8-quant

Commits

Commits on Dec 31, 2025

Commits on Feb 3, 2026