Skip to content

Cuda graphed torch grouped mm for MoE inference.#3677

Draft
sidsingh-nvidia wants to merge 105 commits intoNVIDIA:mainfrom
sidsingh-nvidia:add-cuda-graphed-torch-grouped-gemm
Draft

Cuda graphed torch grouped mm for MoE inference.#3677
sidsingh-nvidia wants to merge 105 commits intoNVIDIA:mainfrom
sidsingh-nvidia:add-cuda-graphed-torch-grouped-gemm

Commits

Commits on Jan 14, 2026

Commits on Jan 23, 2026

Commits on Jan 26, 2026

Commits on Jan 29, 2026

Commits on Jan 30, 2026

Commits on Feb 5, 2026

Commits on Feb 9, 2026

Commits on Feb 13, 2026

Commits on Feb 16, 2026

Commits on Feb 17, 2026

Commits on Feb 20, 2026

Commits on Feb 21, 2026

Commits on Feb 25, 2026

Commits on Feb 26, 2026

Commits on Feb 28, 2026

Commits on Mar 2, 2026

Commits on Mar 3, 2026