[WIP] Initial prototype of differentiable grouped_scaled_mm function for torchao #1969

danielvegamyhre · 2025-03-26T22:00:04Z

Summary

The _grouped_scaled_mm function in torchao will do:

float8 rowwise quantization on inputs
grouped scaled mm
do this in a differentiable way

Test plan

TODO add more robust tests w/ gradient checks

pytorch-bot · 2025-03-26T22:00:07Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1969

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures

As of commit 281950c with merge base 923242e ():

NEW FAILURES - The following jobs have failed:

.github/workflows/float8nocompile_test.yaml (gh)
Code Analysis with Ruff / build (3.9) (gh)
Process completed with exit code 1.
Run TorchAO Experimental Tests / test-cpu-ops (macos-14) (gh)
test_shared_embedding
Run TorchAO Experimental Tests / test-mps-ops (macos-m1-stable) (gh)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

danielvegamyhre · 2025-03-28T03:44:46Z

torchao/float8/float8_tensor.py

+            None,
+            0,
+            -1,
+            -2,


Note for reviewer: we need to allow expand this constraint so we can do rowwise scaling of the 2D subtensors embedded in 3D tensors.

e.g.

A = (M,K) and B = (B,K,N)

A_scale should be (M,)

B_scale should be (B,N) <-- because we computed rowwise scales for the (K,N) subtensors

danielvegamyhre added 2 commits March 26, 2025 13:50

grouped_mm forward pass

134242b

add unit test

2113753

danielvegamyhre added the topic: not user facing Use this tag if you don't want this PR to show up in release notes label Mar 26, 2025

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 26, 2025

danielvegamyhre added 2 commits March 26, 2025 15:56

only support float8

0a90f0b

rowwise scaling test passing

a761549

danielvegamyhre force-pushed the grouped-mm branch from 7244136 to a761549 Compare March 27, 2025 01:40

add 3Dx3D test

8d15a8a

danielvegamyhre force-pushed the grouped-mm branch from 7afbe08 to 8d15a8a Compare March 27, 2025 03:08

numeric unit tests passing

cced381

danielvegamyhre force-pushed the grouped-mm branch from 8268b63 to cced381 Compare March 27, 2025 15:21

danielvegamyhre changed the title ~~[WIP] Initial prototype of grouped_mm API for torchao~~ [GroupedMM] Initial prototype of grouped_mm API for torchao Mar 27, 2025

danielvegamyhre added 2 commits March 27, 2025 08:22

lint

46d7e42

update 3Dx3D case

e32d528

danielvegamyhre requested a review from vkuzo March 27, 2025 15:27

danielvegamyhre changed the title ~~[GroupedMM] Initial prototype of grouped_mm API for torchao~~ [GroupedMM] Initial prototype of grouped_mm API for torchao (forward pass only) Mar 27, 2025

danielvegamyhre added 3 commits March 27, 2025 09:54

lint

c42af73

lint

e61c71d

change func name

94a0cba

danielvegamyhre changed the title ~~[GroupedMM] Initial prototype of grouped_mm API for torchao (forward pass only)~~ [GroupedMM] Initial prototype of _grouped_scaled_mm prototype function for torchao (forward pass only) Mar 27, 2025

danielvegamyhre changed the title ~~[GroupedMM] Initial prototype of _grouped_scaled_mm prototype function for torchao (forward pass only)~~ [GroupedMM] Initial prototype of _grouped_scaled_mm function for torchao (forward pass only) Mar 27, 2025

danielvegamyhre added 4 commits March 27, 2025 10:02

lint

fce469b

B must be 3D

3899bb2

add docstring

5099838

allow other axiswise dims so we can pass in 3D B tensor tranposed

4e04022

danielvegamyhre force-pushed the grouped-mm branch from a472122 to 4e04022 Compare March 27, 2025 18:29

danielvegamyhre added 3 commits March 27, 2025 11:43

clean up

4117a9e

add todo

61f0ee4

lint

dc40622

danielvegamyhre added 3 commits March 27, 2025 12:13

rename var

80b7630

check input dims are compatible

dc013a3

add detailed comments

4f385e5

danielvegamyhre marked this pull request as draft March 27, 2025 23:44

danielvegamyhre changed the title ~~[GroupedMM] Initial prototype of _grouped_scaled_mm function for torchao (forward pass only)~~ [WIP] Initial prototype of _grouped_scaled_mm function for torchao (forward pass only) Mar 27, 2025

danielvegamyhre added 5 commits March 27, 2025 18:39

update comments

72a9b9f

update comments

c4c6c99

add backward pass

cf42af1

add detailed comments

dc6bcf3

2d-2d working

4c5e9db

danielvegamyhre force-pushed the grouped-mm branch from 0dc4c7f to 4c5e9db Compare March 28, 2025 03:03

danielvegamyhre added 2 commits March 27, 2025 20:20

backward working for everything except 2d-3d

c9d30b6

all test cases working

c19bc88

danielvegamyhre force-pushed the grouped-mm branch from 7042cef to c19bc88 Compare March 28, 2025 03:31

docstring

90b99ba

danielvegamyhre commented Mar 28, 2025

View reviewed changes

danielvegamyhre changed the title ~~[WIP] Initial prototype of _grouped_scaled_mm function for torchao (forward pass only)~~ [WIP] Initial prototype of _grouped_scaled_mm function for torchao Mar 28, 2025

danielvegamyhre changed the title ~~[WIP] Initial prototype of _grouped_scaled_mm function for torchao~~ [WIP] Initial prototype of differentiable grouped_scaled_mm function for torchao Mar 28, 2025

danielvegamyhre added 3 commits March 27, 2025 20:49

update test

526d88c

handle jagged 2d tensors

25fa1c8

lint

281950c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Initial prototype of differentiable grouped_scaled_mm function for torchao #1969

[WIP] Initial prototype of differentiable grouped_scaled_mm function for torchao #1969

danielvegamyhre commented Mar 26, 2025 •

edited

Loading

pytorch-bot bot commented Mar 26, 2025 •

edited

Loading

danielvegamyhre Mar 28, 2025 •

edited

Loading

+                          None,
+,
+                          -1,
+                          -2,

[WIP] Initial prototype of differentiable grouped_scaled_mm function for torchao #1969

Are you sure you want to change the base?

[WIP] Initial prototype of differentiable grouped_scaled_mm function for torchao #1969

Conversation

danielvegamyhre commented Mar 26, 2025 • edited Loading

Summary

Test plan

pytorch-bot bot commented Mar 26, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1969

❌ 4 New Failures

danielvegamyhre Mar 28, 2025 • edited Loading

Choose a reason for hiding this comment

danielvegamyhre commented Mar 26, 2025 •

edited

Loading

pytorch-bot bot commented Mar 26, 2025 •

edited

Loading

danielvegamyhre Mar 28, 2025 •

edited

Loading