Add quick-tune support for Attention #2169

mirza-halilcevic · 2025-12-11T14:18:54Z

Motivation

Implement proper quick tuning support for Attention operations.

Technical Details

Remove InitParams, and use Rock attributes directly instead
Combine WmmaGemmParams and MfmaGemmParams into AccelGemmParams and implement deserialization
Store serialized perfconfigs inside QuickTuningPerfconfigs.inc and update ParamLookupTable accordingly
Rename AttnPerfConfigAttr to GemmGemmParams
Rename other occurences of Attn to GemmGemm in order to match the Rock interface
Introduce GridwiseGemmGemmParams.h/cpp with PopulateParamsGemmGemm implementation
Refactor and update quickTuningGen.py

Resolves https://github.com/ROCm/rocMLIR-internal/issues/1887
Resolves https://github.com/ROCm/rocMLIR-internal/issues/1447

Test Plan

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

mlir/include/mlir/Dialect/Rock/Tuning/GridwiseGemmGemmParams.h

mlir/include/mlir/Dialect/Rock/Tuning/QuickTuningPerfconfigs.inc

dhernandez0 · 2025-12-12T09:03:43Z

mlir/include/mlir/Dialect/Rock/Tuning/QuickTuningPerfconfigs.inc

nit: own file for attention/g+g?

mlir/lib/Dialect/Rock/Tuning/GridwiseGemmGemmParams.cpp

mlir/include/mlir/Dialect/Rock/Tuning/GridwiseGemmGemmParams.h

mlir/lib/Dialect/Rock/Tuning/GridwiseGemmGemmParams.cpp

mlir/lib/Dialect/Rock/Tuning/RockTuningImpl.cpp

…-tune

dhernandez0 · 2025-12-15T11:30:09Z

mlir/include/mlir/Dialect/Rock/Tuning/QuickTuningPerfconfigs.inc

+
+#ifdef Attn_LOOKUP_TABLE_GEN
+
+{"gfx900_attention_f32", {PopulateParamsAttn::initParametersAttentionGfx900, PopulateParamsAttn::nInitParametersAttentionGfx900}},


I guess temporarily, gemm+gemm kernels will use attention quick tuning list but eventually, we'll have a list for gemm_gemm? (and potentially conv+gemm). We already have a tier1-gemmgemm list

Yes, the ticket is in progress https://github.com/ROCm/rocMLIR-internal/issues/2019

- Retire InitParamsNonAccel and InitParamsAccel - Combine MfmaGemmParamsAttr and WmmaGemmParamsAttr into AccelGemmParamsAttr

mlir/include/mlir/Dialect/Rock/Tuning/GridwiseGemmGemmParams.h

mlir/lib/Dialect/Rock/IR/RockDialect.cpp

mlir/lib/Dialect/Rock/Tuning/GridwiseGemmGemmParams.cpp

mlir/lib/Dialect/Rock/Tuning/RockTuningImpl.cpp

…-tune

Add quick-tune support for Attention.

6d49d88

mirza-halilcevic requested review from dhernandez0, dorde-antic and umangyadav December 11, 2025 14:19

dhernandez0 reviewed Dec 11, 2025

View reviewed changes

mlir/include/mlir/Dialect/Rock/Tuning/GridwiseGemmGemmParams.h Outdated Show resolved Hide resolved

mlir/include/mlir/Dialect/Rock/Tuning/QuickTuningPerfconfigs.inc Outdated Show resolved Hide resolved

Remove usage of InitParamsAttn and simplify approach.

651f797

dhernandez0 reviewed Dec 12, 2025

View reviewed changes

dhernandez0 approved these changes Dec 12, 2025

View reviewed changes

dorde-antic and others added 5 commits December 12, 2025 11:30

Merge branch 'develop' into attn-quick-tune

4bbb0f8

Remove InitParamsAttn and store serialized perf configs in inc file.

d89f1df

Merge remote-tracking branch 'origin/develop' into attn-quick-tune

aa15aee

Merge remote-tracking branch 'origin/attn-quick-tune' into attn-quick…

f3df95a

…-tune

Merge remote-tracking branch 'origin/develop' into attn-quick-tune

fd3bd77

dhernandez0 reviewed Dec 15, 2025

View reviewed changes

mirza-halilcevic added 4 commits December 15, 2025 22:42

Store quick-tune perf configs in serialized format.

89278d9

Refactor tuning parameters:

d54d54b

- Retire InitParamsNonAccel and InitParamsAccel - Combine MfmaGemmParamsAttr and WmmaGemmParamsAttr into AccelGemmParamsAttr

Merge remote-tracking branch 'origin/develop' into attn-quick-tune

17eba09

Fix tests.

bc2da92

dhernandez0 reviewed Dec 16, 2025

View reviewed changes

mirza-halilcevic and others added 9 commits December 16, 2025 08:58

Fix formatting.

fa861d9

Merge branch 'develop' into attn-quick-tune

e189c11

Merge remote-tracking branch 'origin/attn-quick-tune' into attn-quick…

f2cbc5e

…-tune

Merge remote-tracking branch 'origin/develop' into attn-quick-tune

4fdb68f

Merge branch 'develop' into attn-quick-tune

690b333

Merge remote-tracking branch 'origin/develop' into attn-quick-tune

6cdbb14

Rename Attn to GemmGemm for generic constructs.

ecddeba

Improve naming.

c6a00d6

Update and refactor quickTuningGen.py.

b986135

mirza-halilcevic marked this pull request as ready for review December 28, 2025 00:32

mirza-halilcevic requested a review from causten as a code owner December 28, 2025 00:32

mirza-halilcevic requested a review from dhernandez0 December 29, 2025 00:20

Merge branch 'develop' into attn-quick-tune

1cc3688

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add quick-tune support for Attention #2169

Add quick-tune support for Attention #2169

Uh oh!

mirza-halilcevic commented Dec 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

dhernandez0 Dec 12, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dhernandez0 Dec 15, 2025

Uh oh!

mirza-halilcevic Dec 15, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		#ifdef Attn_LOOKUP_TABLE_GEN

		{"gfx900_attention_f32", {PopulateParamsAttn::initParametersAttentionGfx900, PopulateParamsAttn::nInitParametersAttentionGfx900}},

Add quick-tune support for Attention #2169

Are you sure you want to change the base?

Add quick-tune support for Attention #2169

Uh oh!

Conversation

mirza-halilcevic commented Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Uh oh!

Uh oh!

dhernandez0 Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dhernandez0 Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

mirza-halilcevic Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mirza-halilcevic commented Dec 11, 2025 •

edited

Loading