Skip to content

Commit 92cbc28

Browse files
committed
Optimize GPU reductions
- Use a 2-level approach with atomics - Support DSA_REDUCTION_MUL for nested for directices - Clean up code
1 parent 3fb7673 commit 92cbc28

File tree

3 files changed

+169
-246
lines changed

3 files changed

+169
-246
lines changed

0 commit comments

Comments
 (0)