Open
Description
The section on auto-parallelism sometimes fails and raise out of memory CUDA runtime errors on CI randomly. See the failing CI for more details here.
Pitch: Maybe reduce the size of the tensors.
cc @astonzhang
The section on auto-parallelism sometimes fails and raise out of memory CUDA runtime errors on CI randomly. See the failing CI for more details here.
Pitch: Maybe reduce the size of the tensors.
cc @astonzhang