Skip to content

Pull requests: codeplaysoftware/cutlass-sycl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Adding Fp8 input support for flash attention prefill
#419 opened Jun 11, 2025 by mehdi-goli Loading…
Template atoms
#417 opened Jun 10, 2025 by t4c1 Draft
A(16bits)xB(8bits) GEMM release
#416 opened Jun 10, 2025 by jiyang1011 Loading…
Fix for U8 transpose release
#392 opened May 27, 2025 by t4c1 Loading…
Update PVC drivers
#391 opened May 26, 2025 by aacostadiaz Loading…
Add documentation for 2D copy
#386 opened May 21, 2025 by aacostadiaz Loading…
Cutlass 4.0
#385 opened May 20, 2025 by aacostadiaz Draft
Enable prefetch iteration
#382 opened May 19, 2025 by t4c1 Loading…
enable splitk for mixed precision gemm release
#381 opened May 19, 2025 by taozha2 Loading…
Check the WarpLayout provided to TiledMMAHelper
#377 opened May 15, 2025 by joeatodd Loading…
Add unit test for FP16 MMA
#368 opened May 12, 2025 by aacostadiaz Loading…
add gemm with rmsnorm
#321 opened Apr 22, 2025 by yuankuns Loading…
add int8/tf32 transpose A copy traits
#319 opened Apr 21, 2025 by taozha2 Loading…
Pure FP8 (W8A8) GEMM support (draft)
#306 opened Apr 14, 2025 by jiyang1011 Loading…
Enable SM90 via sycl-cuda-compat
#276 opened Mar 24, 2025 by FMarno Loading…
ProTip! Add no:assignee to see everything that’s not assigned.