Commit 4f929fe
authored
[TRTLLM-10407][perf] Add cute dsl single pass multi cta cluster topk (NVIDIA#12354)
Signed-off-by: Mindy Li <11663212+limin2021@users.noreply.github.com>1 parent 7aa1383 commit 4f929fe
File tree
4 files changed
+1109
-232
lines changed- tensorrt_llm/_torch
- custom_ops
- cute_dsl_kernels/blackwell/top_k
- tests/unittest/_torch/thop/parallel
4 files changed
+1109
-232
lines changed
0 commit comments