[fnuz] transform ocp e4m3 to e4m3_fnuz during loading by ZhiweiYan-96 · Pull Request #1 · ZhiweiYan-96/ao

ZhiweiYan-96 · 2026-02-03T15:27:43Z

Motivation: Reusing public float8_e4m3 models

TorchAO official release FP8 checkpoint with float8_e4m3. Popular models like DeepSeek-R1 also release model with float8_e4m3, since it is directly trained with FP8.

Reusing existing model by transforming the weight to float8_e4m3fnuz has value, especially the model trained with fp8.

Design

I have two method for reusing.

Method 1: : Hook the fp8 subtensor initialization.

This is based on the fact that, AO checkpoint binds kernel dispatch behavior with weight tensor. Hook the subclass the initialization and modify the raw data without re-quantizing the model from scratch. The change is intrusive. Model conversion happens quietly when loading, small memory overhead.

I have verified with the checkpoint released at https://huggingface.co/pytorch/Qwen3-32B-FP8. The inference result is

Method 2 Dequantize the fp8_e4m3 linear weight, and then re-quantized the weight using float8_e4m3fnuz.

Using this method, we do not introduce any intrusive change in TorchAO. I verify that, the inference can work well. However, the user need write scripts for dequantizing and re-quantization, which is tricky for users.
What's worse, float8_e4m3fnuz serialization is
not supported in safetensor. Even though the user can write the scripts, they cannot save a model with float8_e4m3fnuz weight.

ZhiweiYan-96 · 2026-02-03T15:40:02Z

@wuhuikx @zejunchen-zejun @

ZhiweiYan-96 · 2026-02-03T15:50:34Z

@xytpai

ZhiweiYan-96 · 2026-02-04T01:52:52Z

@XiaobingSuper

[fnuz] transform ocp format to fnuz during loading

293c22d

ZhiweiYan-96 changed the title ~~[fnuz] transform ocp format to fnuz during loading~~ [fnuz] transform ocp e4m3 to e4m3_fnuz during loading Feb 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fnuz] transform ocp e4m3 to e4m3_fnuz during loading#1

[fnuz] transform ocp e4m3 to e4m3_fnuz during loading#1
ZhiweiYan-96 wants to merge 1 commit intomainfrom
zhiwei/extension

ZhiweiYan-96 commented Feb 3, 2026 •

edited

Loading

Uh oh!

ZhiweiYan-96 commented Feb 3, 2026

Uh oh!

ZhiweiYan-96 commented Feb 3, 2026

Uh oh!

ZhiweiYan-96 commented Feb 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ZhiweiYan-96 commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation: Reusing public float8_e4m3 models

Design

Uh oh!

ZhiweiYan-96 commented Feb 3, 2026

Uh oh!

ZhiweiYan-96 commented Feb 3, 2026

Uh oh!

ZhiweiYan-96 commented Feb 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ZhiweiYan-96 commented Feb 3, 2026 •

edited

Loading