[GPUHeuristics] Remove MNT boost for VeryLargeGemm on CDNA4 by Yu-Zhewen · Pull Request #23876 · iree-org/iree

Yu-Zhewen · 2026-03-20T16:28:11Z

#23652 added boostMNTileCountPerSubgroup=32 for CDNA4 LargeGemm but
applied the same boost to VeryLargeGemm. That PR only benchmarked
LargeGemm shapes and didn't cover VeryLargeGemm.

For LargeGemm the heuristic selects the MFMA_F32_16x16x32 intrinsic
where MNT=32 fits within register limits. However, for VeryLargeGemm
shapes (e.g. 16384x16384x16384), the heuristic prefers the larger
MFMA_F32_32x32x16 intrinsic, and the boosted MNT=32 results in VGPR
spilling, causing a ~10x regression on mi355x:

Metric	Before	After
Time	10 ms	104 ms
Scratch Allocation	0 B/work-item	1208 B/work-item
VGPRs	216	256 (max)
VMEM instructions	71M	618M (8.7x)

This patch removes boostMNTileCountPerSubgroup and
minUtilizationThreshold from VeryLargeGemm CDNA4 seeds, reverting
to default. LargeGemm seeds are unchanged.

Signed-off-by: Yu-Zhewen <zhewenyu@amd.com>

init commit

e58ac6f

Signed-off-by: Yu-Zhewen <zhewenyu@amd.com>

Yu-Zhewen marked this pull request as ready for review March 20, 2026 17:03

Yu-Zhewen requested review from Groverkss, Max191, krzysz00, nirvedhmeshram and qedawkins as code owners March 20, 2026 17:03

Yu-Zhewen requested review from jerryyin and lialan March 20, 2026 17:03

lialan approved these changes Mar 20, 2026

View reviewed changes

Yu-Zhewen merged commit 7a5fc38 into iree-org:main Mar 21, 2026
53 of 57 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GPUHeuristics] Remove MNT boost for VeryLargeGemm on CDNA4#23876

[GPUHeuristics] Remove MNT boost for VeryLargeGemm on CDNA4#23876
Yu-Zhewen merged 1 commit intoiree-org:mainfrom
Yu-Zhewen:revert_verylarge_gemm

Yu-Zhewen commented Mar 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Yu-Zhewen commented Mar 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants