GroupedGemm: NVFP4 via cuBLAS #2455

Open

Assignees

Labels

opened

Implement GroupedGemm for NVFP4 format using cuBLAS kernels. Ensure integration with GroupedTensor utilities and grouped quantization pathways for Sync-Free MoE training.

Metadata

Assignees

pggPL

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests