add tileN = 8,16 for SM120 blockscale GEMM. by b8zhong · Pull Request #3292 · NVIDIA/cutlass

b8zhong · 2026-06-02T15:36:30Z

It will be for use with SwapAB.

b8zhong · 2026-06-02T15:37:13Z

Hi @depaulmillz , I was wondering if you could take a look at this PR? Since I noticed you were the last one to add TileN = 32. Thanks!

depaulmillz · 2026-06-03T20:24:10Z

Awesome. Have you been able to try with group GEMM as well?

b8zhong · 2026-06-03T21:41:20Z

@depaulmillz Yes. Technically, it works (this PR is also compatible with group GEMM changes as well). But for example when testing on two common cases, DSR1 TP = 8 and Qwen-3 MoE TP = 1, the speedup can only be 3-5% for BS = 1. So it's faster (as expected), but not by much.

depaulmillz · 2026-06-06T00:16:09Z

It looks like you will need to add an assertion to prevent compiling ping-pong with MMA_N=8 which will expect a (2,2,1) layout shape for the MMA. I saw some ref check errors when testing the MR on pingpong MMA_N=8 kernels due to this.

add tileN = 8,16

c4e701b

b8zhong mentioned this pull request Jun 2, 2026

[draft] add tileN = 8,16 to SM120 blockscale GEMM. flashinfer-ai/flashinfer#3495

Draft

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add tileN = 8,16 for SM120 blockscale GEMM.#3292

add tileN = 8,16 for SM120 blockscale GEMM.#3292
b8zhong wants to merge 1 commit into
NVIDIA:mainfrom
bzhng-development:brayden/sm120-tile-n-16

b8zhong commented Jun 2, 2026 •

edited

Loading

Uh oh!

b8zhong commented Jun 2, 2026

Uh oh!

depaulmillz commented Jun 3, 2026

Uh oh!

b8zhong commented Jun 3, 2026

Uh oh!

depaulmillz commented Jun 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

b8zhong commented Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

b8zhong commented Jun 2, 2026

Uh oh!

depaulmillz commented Jun 3, 2026

Uh oh!

b8zhong commented Jun 3, 2026

Uh oh!

depaulmillz commented Jun 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

b8zhong commented Jun 2, 2026 •

edited

Loading