Skip to content

v0.28.0

Choose a tag to compare

@angeloskath angeloskath released this 07 Aug 07:50
· 240 commits to main since this release
56be773

Highlights

  • First version of fused sdpa vector for CUDA
  • Convolutions in CUDA
  • Speed improvements in CUDA normalization layers, softmax, compiled kernels, overheads and more

What's Changed

New Contributors

Full Changelog: v0.27.1...v0.28.0