Extend CK fmha_batch_prefill kernel coverage to head_dim=256 #3328

vivienfanghuagood · 2025-11-28T08:15:45Z

Proposed changes

I add CK to generate fmha_batch_prefill kernels for hdim_q=hdim_v=256 in group mode (paged KV). It's required because in Qwen3-Next, the head_dim is 256 which is not supported in AITER Attention Backend(the default option in AMD GPUs)

Checklist

Please put an x into the boxes that apply. You can also fill these out after creating the PR. If you're not sure, please don't hesitate to ask.

I have added tests relevant to the introduced functionality, and the unit tests are passing locally
I have added the test to REGRESSION_TESTS list defined at the top of CMakeLists.txt in tests/CMakeLists.txt, IF the test takes more than 30 seconds to run.
I have added inline documentation which enables the maintainers with understanding the motivation
I have removed the stale documentation which is no longer relevant after this pull request
(If this change is user-facing) I have added release notes which provide the end users with a brief summary of the improvement from this pull request
I have run clang-format on all changed files
Any dependent changes have been merged

Discussion

If this is a relatively large or complex change, feel free to start a discussion by explaining why you chose the solution you did and what alternatives you considered

Extend CK fmha_batch_prefill kernel coverage to head_dim=256

c53e12b

vivienfanghuagood requested review from ThomasNing, afagaj, andriy-ca, aosewski, asleepzzz, bartekxk, carlushuang, cgmillette, coderfeli, geyyer, illsilin, poyenc, qianfengz, shumway, tenpercent and vidyasagar-amd as code owners November 28, 2025 08:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Extend CK fmha_batch_prefill kernel coverage to head_dim=256 #3328

Extend CK fmha_batch_prefill kernel coverage to head_dim=256 #3328

vivienfanghuagood commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Extend CK fmha_batch_prefill kernel coverage to head_dim=256 #3328

Are you sure you want to change the base?

Extend CK fmha_batch_prefill kernel coverage to head_dim=256 #3328

Conversation

vivienfanghuagood commented Nov 28, 2025

Proposed changes

Checklist

Discussion

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant