Commit 92cbc28
committed
Optimize GPU reductions
- Use a 2-level approach with atomics
- Support DSA_REDUCTION_MUL for nested for directices
- Clean up code1 parent 3fb7673 commit 92cbc28
File tree
3 files changed
+169
-246
lines changed- src/numba/openmp/libs/pass
3 files changed
+169
-246
lines changed
0 commit comments