-
Notifications
You must be signed in to change notification settings - Fork 143
Add support for single wave stream-k kernels #1975
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
# - [16,16,16,1, 1, 8,8, 1,2] | ||
- DepthU: [256] | ||
- WorkGroup: | ||
- [4,4,4] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
you can add LSU=2 case as well.. but it looks good to me.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks. I tested LSU=2 locally, but left it out to cut down on time added to pre-checkin CI run since I thought LSU=4 is the more common use-case. If LSU=2 is also frequently used, I can add a test in a future change.
Enable LSU + stream-k and add related test cases
Enable LSU + stream-k and add related test cases
Allow wavegroup to be less than 4 in stream-k kernels. This change updates the partials and fixup code to tkae number of waves into account.
Added new test cases to the hgemm stream-k unit tests for wave groups (2,1), (1,2), and (1,1).