-
Notifications
You must be signed in to change notification settings - Fork 146
1.19.0 breaks on CUDA 12.9 #207
Copy link
Copy link
Open
Description
Upgrading to nvidia-cudnn-frontend 1.19.0 seems to break for CUDA 12.9 environment. In particular, I run into this error RuntimeError: Unable to load any libcudart.so.* library a lot. If I downgrade to 1.18.0 then it works.
To reproduce:
import torch
from flashinfer.gemm.gemm_base import build_cudnn_gemm_with_per_tensor_q_graph
graph = build_cudnn_gemm_with_per_tensor_q_graph(
a_shape=(1, 128, 4096), a_stride=(128*4096, 4096, 1),
b_shape=(1, 4096, 3072), b_stride=(4096*3072, 3072, 1),
a_type=torch.float8_e4m3fn, b_type=torch.float8_e4m3fn,
o_type=torch.bfloat16, device=torch.device('cuda:0'))
I looked at the commit history and saw there is this change
df73764#diff-fa77515a2e2c2b179c7d1734c20ed2f56ec6aaf419ac6c4f3184c1c1c9fb5f23R88
but I didn't see any mention of CUDA version support change in the release notes. Is this expected?
My environment:
CUDA: Cuda compilation tools, release 12.9, V12.9.86 Build cuda_12.9.r12.9/compiler.36037853_0
Nvidia driver version: 580.65.06
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels