Enable fp16+int4 mixed precission path for int4 xpu path with int zero point #2240

liangan1 · 2025-05-22T10:41:18Z

Backgroup
For XPU device, when user select the int zero point, the _torch.ops.aten.weight_int4pack_mm_with_scales_and_zeros kernel operator will be used to do A16W4 computation. Both Afp16W4 and ABF16int4 are supported in this op on XPU device, while only the BF16 activation is supported in the torchAO now, In this PR we want to unlock the FP16 activation support.

pytorch-bot · 2025-05-22T10:41:21Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2240

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit fa6ca5d with merge base 2c901b3 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

liangan1 · 2025-05-22T10:59:06Z

@jerryzh168 can you help to review?

liangan1 · 2025-05-22T11:01:21Z

@EikanWang

jerryzh168

can you add a test? maybe add one under

ao/test/dtypes/test_affine_quantized.py

Line 299 in 4d5f657

class TestAffineQuantizedBasic(TestCase):

for now, current tests are not very well structured

liangan1 · 2025-05-29T08:43:42Z

@pytorchbot label new feature

pytorch-bot · 2025-05-29T08:43:45Z

Didn't find following labels among repository labels: new,feature

liangan1 · 2025-05-29T08:44:45Z

@pytorchbot label quantize

liangan1 · 2025-05-29T08:45:35Z

@pytorchbot merge

pytorchmergebot · 2025-05-29T08:46:16Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-05-29T08:46:30Z

Merge failed

Reason: 1 jobs have failed, first few of them are: PR Label Check / Check PR Labels

Details for Dev Infra team

Raised by workflow job

liangan1 · 2025-05-29T08:48:39Z

@pytorchbot label ciflow/xpu

pytorch-bot · 2025-05-29T08:48:43Z

To add these label(s) (ciflow/xpu) to the PR, please first approve the workflows that are awaiting approval (scroll to the bottom of this page).

This helps ensure we don't trigger CI on this PR until it is actually authorized to do so. Please ping one of the reviewers if you do not have access to approve and run workflows.

liangan1 · 2025-05-29T08:54:28Z

@pytorchbot label ciflow/xpu

pytorch-bot · 2025-05-29T08:54:32Z

Didn't find following labels among repository labels: ciflow/xpu

liangan1 · 2025-05-29T09:00:54Z

@pytorchbot label ci

liangan1 · 2025-05-29T09:09:14Z

can you add a test? maybe add one under

ao/test/dtypes/test_affine_quantized.py

Line 299 in 4d5f657

class TestAffineQuantizedBasic(TestCase):

for now, current tests are not very well structured

Yes. I checked the test in test_affine_quantized.py. xpu is not enabled for a lot of UTs(no only for dtype), I will open a new pr to enable these UTs.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 22, 2025

liangan1 added 3 commits May 22, 2025 18:34

Enable fp16 path for int4 xpu path with int zero point

7d5ba43

Update int4_xpu_layout.py

88642d4

Fix typo

fa6ca5d

jerryzh168 approved these changes May 29, 2025

View reviewed changes

pytorch-bot bot added the quantize label May 29, 2025

pytorchmergebot added the merging label May 29, 2025

pytorchmergebot removed the merging label May 29, 2025

pytorch-bot bot added the ci label May 29, 2025

Xia-Weiwen added the topic: new feature Use this tag if this PR adds a new feature label May 29, 2025

Xia-Weiwen merged commit 0aa8dbd into pytorch:main May 29, 2025
20 of 24 checks passed

liangan1 mentioned this pull request May 30, 2025

[RFC][API-Unstable]Enable A16W4 on XPU Device pytorch/pytorch#153019

Open

5 tasks

Enable fp16+int4 mixed precission path for int4 xpu path with int zero point #2240

Enable fp16+int4 mixed precission path for int4 xpu path with int zero point #2240

Uh oh!

Conversation

liangan1 commented May 22, 2025

Uh oh!

pytorch-bot bot commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2240

✅ No Failures

Uh oh!

liangan1 commented May 22, 2025

Uh oh!

liangan1 commented May 22, 2025

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

liangan1 commented May 29, 2025

Uh oh!

pytorch-bot bot commented May 29, 2025

Uh oh!

liangan1 commented May 29, 2025

Uh oh!

liangan1 commented May 29, 2025

Uh oh!

pytorchmergebot commented May 29, 2025

Merge started

Uh oh!

pytorchmergebot commented May 29, 2025

Merge failed

Uh oh!

liangan1 commented May 29, 2025

Uh oh!

pytorch-bot bot commented May 29, 2025

Uh oh!

liangan1 commented May 29, 2025

Uh oh!

pytorch-bot bot commented May 29, 2025

Uh oh!

liangan1 commented May 29, 2025

Uh oh!

Uh oh!

liangan1 commented May 29, 2025

Uh oh!

Uh oh!

pytorch-bot bot commented May 22, 2025 •

edited

Loading