We have to use `HIPBLASLT_ALLOW_TF32=1` to enable tf32.. * https://github.com/ROCm/pytorch/pull/1838/files * https://github.com/ROCm/pytorch/issues/1911 *