Hi, thanks a lot for your team hardworks, beside, can you guys guide me to encounter this errors while build this application within a docker image?
Description
I’m encountering the following CUDA runtime error when running fused_sim from NVIDIA’s 3dgrut library inside a Docker environment:
RuntimeError: CUDA error: no kernel image is available for execution on the device
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.
Important note:
The exact same code works correctly in a Conda environment. The error appears only when running inside Docker.
Environment
- GPUs: 2× NVIDIA H100
- PyTorch: 2.1.2
- **3dgrut version: ** latest
- Cuda version: 11.8
What I Have Tried
-
Checked the architecture list used by PyTorch:
echo "TORCH_CUDA_ARCH_LIST=$TORCH_CUDA_ARCH_LIST"
Output:
→ SM 90 (Hopper) is included.
-
Rebuilt the Docker image multiple times with different settings.
-
Verified that outside Docker (Conda environment) everything runs fine → issue seems isolated to how CUDA kernels or extensions are compiled inside the container.
Hi, thanks a lot for your team hardworks, beside, can you guys guide me to encounter this errors while build this application within a docker image?
Description
I’m encountering the following CUDA runtime error when running
fused_simfrom NVIDIA’s 3dgrut library inside a Docker environment:Important note:
The exact same code works correctly in a Conda environment. The error appears only when running inside Docker.
Environment
What I Have Tried
Checked the architecture list used by PyTorch:
Output:
→ SM 90 (Hopper) is included.
Rebuilt the Docker image multiple times with different settings.
Verified that outside Docker (Conda environment) everything runs fine → issue seems isolated to how CUDA kernels or extensions are compiled inside the container.