Add amax as default per-row scaling factor for fp8_gemm benchmark#341
Add amax as default per-row scaling factor for fp8_gemm benchmark#341jananisriram wants to merge 2 commits intometa-pytorch:mainfrom
Conversation
|
This pull request was exported from Phabricator. Differential Revision: D80590746 |
…ta-pytorch#341) Summary: Add `amax` (absolute maximum) as the default scaling factor for per-row scaling for fp8 GEMMs, as is used in practice. Reviewed By: NikhilAPatel Differential Revision: D80590746
01177cd to
9ee2e8e
Compare
|
This pull request was exported from Phabricator. Differential Revision: D80590746 |
…ta-pytorch#341) Summary: Add `amax` (absolute maximum) as the default scaling factor for per-row scaling for fp8 GEMMs, as is used in practice. Reviewed By: NikhilAPatel Differential Revision: D80590746
9ee2e8e to
d676989
Compare
…ta-pytorch#341) Summary: Add `amax` (absolute maximum) as the default scaling factor for per-row scaling for fp8 GEMMs, as is used in practice. Reviewed By: NikhilAPatel Differential Revision: D80590746
d676989 to
eac3b24
Compare
|
This pull request was exported from Phabricator. Differential Revision: D80590746 |
|
This pull request was exported from Phabricator. Differential Revision: D80590746 |
…ta-pytorch#341) Summary: Pull Request resolved: meta-pytorch#341 Add `amax` (absolute maximum) as the default scaling factor for per-row scaling for fp8 GEMMs, as is used in practice. Reviewed By: NikhilAPatel Differential Revision: D80590746
eac3b24 to
72d907c
Compare
72d907c to
b48289d
Compare
…ta-pytorch#341) Summary: Add `amax` (absolute maximum) as the default scaling factor for per-row scaling for fp8 GEMMs, as is used in practice. Reviewed By: NikhilAPatel Differential Revision: D80590746
|
This pull request was exported from Phabricator. Differential Revision: D80590746 |
…ta-pytorch#341) Summary: Pull Request resolved: meta-pytorch#341 Add `amax` (absolute maximum) as the default scaling factor for per-row scaling for fp8 GEMMs, as is used in practice. Reviewed By: NikhilAPatel, xuzhao9 Differential Revision: D80590746
|
This pull request was exported from Phabricator. Differential Revision: D80590746 |
…ta-pytorch#341) Summary: Pull Request resolved: meta-pytorch#341 Add `amax` (absolute maximum) as the default scaling factor for per-row scaling for fp8 GEMMs, as is used in practice. Reviewed By: NikhilAPatel, xuzhao9 Differential Revision: D80590746
207889e to
bb8cbe8
Compare
|
This pull request was exported from Phabricator. Differential Revision: D80590746 |
bb8cbe8 to
221d375
Compare
…ta-pytorch#341) Summary: Pull Request resolved: meta-pytorch#341 Add `amax` (absolute maximum) as the default scaling factor for per-row scaling for fp8 GEMMs, as is used in practice. Reviewed By: NikhilAPatel, xuzhao9 Differential Revision: D80590746
|
This pull request was exported from Phabricator. Differential Revision: D80590746 |
…ta-pytorch#341) Summary: Pull Request resolved: meta-pytorch#341 Add `amax` (absolute maximum) as the default scaling factor for per-row scaling for fp8 GEMMs, as is used in practice. Reviewed By: NikhilAPatel, xuzhao9 Differential Revision: D80590746
221d375 to
78559a2
Compare
|
@jananisriram has imported this pull request. If you are a Meta employee, you can view this in D80590746. |
…ta-pytorch#341) Summary: Add `amax` (absolute maximum) as the default scaling factor for per-row scaling for fp8 GEMMs, as is used in practice. Reviewed By: xuzhao9 Test Plan: Imported from GitHub, without a `Test Plan:` line. Rollback Plan: Differential Revision: D80590746 Pulled By: jananisriram
199f881 to
69edf00
Compare
|
This pull request was exported from Phabricator. Differential Revision: D80590746 |
Summary: Add `amax` (absolute maximum) as the default scaling factor for per-row scaling for fp8 GEMMs, as is used in practice. Reviewed By: xuzhao9 Test Plan: Imported from GitHub, without a `Test Plan:` line. Rollback Plan: Differential Revision: D80590746 Pulled By: jananisriram
Summary: Add
amax(absolute maximum) as the default scaling factor for per-row scaling for fp8 GEMMs, as is used in practice.Reviewed By: NikhilAPatel
Differential Revision: D80590746