Enable blockwise FP8 dense training kernels on ROCm#4036
Open
brucechanglongxu wants to merge 2 commits intopytorch:mainfrom
Open
Enable blockwise FP8 dense training kernels on ROCm#4036brucechanglongxu wants to merge 2 commits intopytorch:mainfrom
brucechanglongxu wants to merge 2 commits intopytorch:mainfrom