-
Notifications
You must be signed in to change notification settings - Fork 119
Pull requests: pytorch/ao
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[low-bit optim] Fix load state dict when device is different
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1021
opened Oct 5, 2024 by
gau-nernst
Loading…
Add generic fake quantized linear for QAT
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1020
opened Oct 4, 2024 by
andrewor14
Loading…
Make module swap the main QAT flow again
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1019
opened Oct 4, 2024 by
andrewor14
Loading…
Add quantized embedding kernels to torchao
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
fb-exported
#1018
opened Oct 4, 2024 by
metascroy
Loading…
Dynamic Float8 benchmarking llama
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1017
opened Oct 4, 2024 by
jainapurva
Loading…
[not for land] float8 training -> quantize_
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1016
opened Oct 4, 2024 by
vkuzo
Loading…
Re-run regression tests with CUDA_LAUNCH_BLOCKING
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Temporary Fix: Skip TestAffineQuantizedTensorParallel on H100
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1001
opened Oct 3, 2024 by
jainapurva
•
Draft
Enable ROCM in CI
ciflow/rocm
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: rocm
#999
opened Oct 3, 2024 by
msaroufim
Loading…
Kleidi 4b blockwise gemv prototype
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#997
opened Oct 2, 2024 by
digantdesai
•
Draft
Subclass API (#966)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
fb-exported
#995
opened Oct 2, 2024 by
metascroy
Loading…
manual expert from fbcode
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
add "_gemm_input_role" to dunder slots
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#984
opened Oct 1, 2024 by
crcrpar
Loading…
[wip] SpinQuant
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#983
opened Oct 1, 2024 by
tobiasvanderwerff
Loading…
4 of 6 tasks
[autoquant] Fix the autoquant multi_head_attention torch_function dispatch
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#977
opened Sep 30, 2024 by
IvanKobzarev
Loading…
cleaned up and tested tp support
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#976
opened Sep 30, 2024 by
debajyotidatta
Loading…
Introduce lowbit quantized linear MPS kernels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
fb-exported
#954
opened Sep 26, 2024 by
manuelcandales
Loading…
float8 training axiswise scaling support with per-gemm-argument configuration
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#940
opened Sep 24, 2024 by
vkuzo
Loading…
add axiswise scaling to Float8Linear
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#920
opened Sep 23, 2024 by
vkuzo
Loading…
add axiswise granularity to Float8Tensor
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#919
opened Sep 23, 2024 by
vkuzo
Loading…
[wip] fp8 + 24sparse benchmarking
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
[float8] fuse abs/max with torch.linalg.vector_norm
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
float8 profile script: add activation checkpointing
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#892
opened Sep 16, 2024 by
vkuzo
Loading…
Add blocksparse_int_addmm. Eliminate unnecessary contiguous calls which leads to performance increase.
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Previous Next
ProTip!
no:milestone will show everything without a milestone.