Commit 0bff14b
[AMD]Support 8-Warp Pingpong and Refactor MXGEMM Kernel on GFX1250 (#9356)
This PR:
- Refactored MXGEMM kernel to support various schedules
- Supported 8-warp scheduling and 8-warp pingpong scheduling
---------
Co-authored-by: Lei Zhang <antiagainst@gmail.com>1 parent 0019e67 commit 0bff14b
1 file changed
Lines changed: 563 additions & 132 deletions
0 commit comments