Make QuantizeTensorToFloat8Kwargs a frozen Dataclass to enable torch export by asfiyab-nvidia · Pull Request #4011 · pytorch/ao

asfiyab-nvidia · 2026-03-05T23:25:08Z

Regular Dataclasses are not supported by the Dynamo Tracer as they're dispatched to UserDefinedObjectVariable which doesn't have an implementation for as_proxy. However, Frozen Dataclasses are supported by the Dynamo Tracer as the as_proxy method is defined for them (source).

Marking QuantizeTensorToFloat8Kwargs as a frozen dataclass enables torch.export on the Float8Tensor subclass.

Fixes #3928

…export Signed-off-by: Asfiya Baig <asfiyab@nvidia.com>

pytorch-bot · 2026-03-05T23:25:13Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/4011

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit b491570 with merge base 5a5029d ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

asfiyab-nvidia · 2026-03-05T23:28:42Z

@jcaip can you please help with the review? Thanks

jcaip

Thanks for the fix @asfiyab-nvidia. Can we also add an export test to test/quantization/quantize_/workflows/float8/test_float8_tensor.py so that this doesn't happen again?

asfiyab-nvidia · 2026-03-06T23:01:35Z

@jcaip I added another comment on the issue that motivates this change. Based on the comment, would you still recommend the change the PR proposes to maintain compatibility with PyTorch < 2.7 where unwrap_tensor_subclass is neeeded?

Make QuantizeTensorToFloat8Kwargs a frozen Dataclass to enable torch.…

b491570

…export Signed-off-by: Asfiya Baig <asfiyab@nvidia.com>

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 5, 2026

asfiyab-nvidia mentioned this pull request Mar 5, 2026

[Float8DynamicActivationFloat8WeightConfig] Enable Dynamic Quantized models to be exportable using torch.export.export #3928

Open

jcaip added the module: inference quantize_ api inference flow label Mar 6, 2026

jcaip reviewed Mar 6, 2026

View reviewed changes

jcaip closed this Mar 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make QuantizeTensorToFloat8Kwargs a frozen Dataclass to enable torch export#4011

Make QuantizeTensorToFloat8Kwargs a frozen Dataclass to enable torch export#4011
asfiyab-nvidia wants to merge 1 commit intopytorch:mainfrom
asfiyab-nvidia:dev-asfiyab-enable-fp8-export

asfiyab-nvidia commented Mar 5, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Mar 5, 2026 •

edited

Loading

Uh oh!

asfiyab-nvidia commented Mar 5, 2026

Uh oh!

jcaip left a comment

Uh oh!

asfiyab-nvidia commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

asfiyab-nvidia commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/4011

✅ No Failures

Uh oh!

asfiyab-nvidia commented Mar 5, 2026

Uh oh!

jcaip left a comment

Choose a reason for hiding this comment

Uh oh!

asfiyab-nvidia commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

asfiyab-nvidia commented Mar 5, 2026 •

edited

Loading

pytorch-bot bot commented Mar 5, 2026 •

edited

Loading