Skip to content

[AMD] Enable TDM with column major tensors and partitioned encodings#10044

Draft
plognjen wants to merge 1 commit intotriton-lang:mainfrom
plognjen:partition_column_major_tdm
Draft

[AMD] Enable TDM with column major tensors and partitioned encodings#10044
plognjen wants to merge 1 commit intotriton-lang:mainfrom
plognjen:partition_column_major_tdm

Conversation

@plognjen
Copy link
Copy Markdown
Contributor

Fixes AMD TDM lowering for partitioned shared layouts when tensors are column-major: logical partitionDim from the encoding must line up with descriptor blockShape / warpsPerCTA, where col-major already applies swapTrailingDims (only the last two axes are exchanged).

@plognjen plognjen force-pushed the partition_column_major_tdm branch from eaadd9a to 8e543bc Compare April 16, 2026 12:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants