Releases: codeplaysoftware/cutlass-sycl
Releases · codeplaysoftware/cutlass-sycl
Cutlass 3.9 sycl backend version 0.1
Based on CUTLASS 3.9.0 March 2025 release
Platforms
- Support for Intel GPU Data Center Max (1100 and 1550)
- Support for Intel Arc B580 ("Battlemage")
Features
-
GEMM/StreamK/SplitK with support for bfloat16 data type
-
Flash attention prefill and decode with KV cache with support for bfloat16 data type
-
Support for epilogue operations:
- Element-wise, row-wise and column-wise bias
- ReLU, SiLU, GELU activation fns
- Softmax
-
Mixed precision GEMM (bfloat16/int8, half/int4) with dequantization support
-
Dual GEMM & Grouped GEMM
Full Changelog: https://github.com/codeplaysoftware/cutlass-sycl/commits/v3.9-0.1