Added support and fixed parallel scan in CSR kernels for Blackwell (SM_120) architecture. Added extra CUDA matrix tests. #2012
+1,153
−4
This job was skipped
Loading