-
Notifications
You must be signed in to change notification settings - Fork 30
add int8/tf32 transpose A copy traits #319
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: sycl-develop
Are you sure you want to change the base?
Conversation
@aacostadiaz @joeatodd This PR add int8 and tf32 transpose copy traits support including both A and B. all transpose case for bf16/fp16/int8/uint8/tf32 here https://github.com/taozha2/cutlass-fork/blob/zt/gemm_layout_data_type/examples/sycl/pvc/pvc_gemm.cpp#L432 passed(MUST based on latest driver https://ubit-gfx.intel.com/build/21406574 or later).. |
@aacostadiaz @mehdi-goli can you merge this PR? |
ac617ff
to
7a9d570
Compare
@taozha2 can you merge in sycl-develop branch? |
b0f693b
to
23e2f44
Compare
f157825
to
174e71e
Compare
@mehdi-goli @aacostadiaz @t4c1 I have rebased the PR, you can review it |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would prefer to avoid the _cacheopts_
variant since we don't use it elsewhere, but if it's needed for some reason then 👍
No description provided.