Support ptpc fp8 moe by ZhangLirong-amd · Pull Request #271 · ROCm/ATOM

ZhangLirong-amd · 2026-03-06T07:35:10Z

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

Copilot

Pull request overview

Adds support for PTPTC-style FP8 MoE quantization by extending MoE weight/scale handling and propagating per-activation-token quantization flags through the fused MoE quant config, along with related checkpoint-loading fixes for per-channel scale tensors.

Changes:

Extend Fp8MoEMethod to support per-token/per-channel (PTPTC) FP8 MoE via per-channel weight scales and updated post-load processing.
Propagate per_act_token_quant into FusedMoEQuantConfig group-shape derivation for correct activation quant descriptor behavior.
Make linear weight loading more robust for per-channel scale tensors (shape edge-cases) and update merged replicated sharding logic for per-channel scale layouts.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
`atom/model_ops/moe.py`	Adds PTPTC/per-channel FP8 MoE scale allocation and refactors post-load processing by quant strategy.
`atom/model_ops/linear.py`	Improves checkpoint weight loading for per-channel scales (shape handling) and fixes sharding offsets for per-channel scales in merged replicated linears.
`atom/model_ops/fused_moe/config.py`	Ensures `per_act_token_quant` influences activation group-shape selection in fused MoE quant config creation.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-06T07:41:02Z

atom/model_ops/linear.py

            param.data = param.data.view(loaded_weight.dtype)
-        param.data.copy_(post_process_func(loaded_weight))
+        loaded_weight = post_process_func(loaded_weight)
+        if loaded_weight.shape != param.data.shape and loaded_weight.numel() == param.data.numel():


The new fallback reshaping in weight_loader_process will silently accept any shape mismatch as long as numel() matches, which can mask real checkpoint/parameter layout bugs (e.g., transposed tensors) and lead to incorrect weights without an error. Consider restricting reshapes to the specific expected cases (e.g., 1D -> (N,1) or squeezing singleton dimensions), and otherwise raise an error with a helpful message that includes both shapes.

Suggested change

if loaded_weight.shape != param.data.shape and loaded_weight.numel() == param.data.numel():

# Only allow very specific, safe reshapes. Arbitrary reshapes based solely

# on matching numel() can hide real layout/transpose bugs.

if loaded_weight.shape != param.data.shape:

if loaded_weight.numel() != param.data.numel():

raise RuntimeError(

f"Loaded weight shape {tuple(loaded_weight.shape)} with numel="

f"{loaded_weight.numel()} does not match parameter shape "

f"{tuple(param.data.shape)} with numel={param.data.numel()}."

)

# Allow reshapes that differ only by singleton (size-1) dimensions.

def _strip_ones(shape: torch.Size) -> tuple[int, ...]:

return tuple(int(d) for d in shape if int(d) != 1)

core_loaded = _strip_ones(loaded_weight.shape)

core_param = _strip_ones(param.data.shape)

if core_loaded != core_param:

raise RuntimeError(

"Loaded weight shape is incompatible with parameter shape even "

"though numel() matches, which may indicate a layout/transpose "

"mismatch.\n"

f" Checkpoint shape: {tuple(loaded_weight.shape)}\n"

f" Parameter shape: {tuple(param.data.shape)}"

)

Copilot AI review requested due to automatic review settings March 6, 2026 07:35

Copilot started reviewing on behalf of ZhangLirong-amd March 6, 2026 07:36 View session

Copilot AI reviewed Mar 6, 2026

View reviewed changes

ZhangLirong-amd added 3 commits March 6, 2026 15:41

support PTPC fp8 in Moe

0b1b0bb

add per channel in config

e1c030b

format

0d24431

ZhangLirong-amd force-pushed the zlr/ptpc_fp8_moe branch from 21bbc93 to 0d24431 Compare March 6, 2026 07:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support ptpc fp8 moe #271

Support ptpc fp8 moe #271
ZhangLirong-amd wants to merge 3 commits intomainfrom
zlr/ptpc_fp8_moe

ZhangLirong-amd commented Mar 6, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

-        if loaded_weight.shape != param.data.shape and loaded_weight.numel() == param.data.numel():
+        # Only allow very specific, safe reshapes. Arbitrary reshapes based solely
+        # on matching numel() can hide real layout/transpose bugs.
+        if loaded_weight.shape != param.data.shape:
+            if loaded_weight.numel() != param.data.numel():
+                raise RuntimeError(
+                    f"Loaded weight shape {tuple(loaded_weight.shape)} with numel="
+                    f"{loaded_weight.numel()} does not match parameter shape "
+                    f"{tuple(param.data.shape)} with numel={param.data.numel()}."
+                )
+            # Allow reshapes that differ only by singleton (size-1) dimensions.
+            def _strip_ones(shape: torch.Size) -> tuple[int, ...]:
+                return tuple(int(d) for d in shape if int(d) != 1)
+            core_loaded = _strip_ones(loaded_weight.shape)
+            core_param = _strip_ones(param.data.shape)
+            if core_loaded != core_param:
+                raise RuntimeError(
+                    "Loaded weight shape is incompatible with parameter shape even "
+                    "though numel() matches, which may indicate a layout/transpose "
+                    "mismatch.\n"
+                    f"  Checkpoint shape: {tuple(loaded_weight.shape)}\n"
+                    f"  Parameter shape:  {tuple(param.data.shape)}"
+                )

Conversation

ZhangLirong-amd commented Mar 6, 2026

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants