Skip to content

v0.30.5

Choose a tag to compare

@awni awni released this 03 Feb 02:56
· 197 commits to main since this release
adcbb91

What's Changed

  • patch by @awni in #3074
  • [CUDA] Fallback Event impl when there is no hardware cpu/gpu coherency by @zcbenz in #3070
  • Tune CUDA gaph sizes on B200 and H100 by @awni in #3077
  • [Docs] Simple example of using MLX distributed by @stefpi in #2973
  • Use lower-right causal mask alignment consistently by @Anri-Lombard in #2967
  • Fix ALiBi slopes for non-power-of-2 num_heads by @vovw in #3071
  • More useful error for large indices by @awni in #3079
  • Fix nax condition for iphone by @awni in #3083
  • Fallback to pinned host memory when managed memory is not supported by @zcbenz in #3075
  • Fix failing python tests on Windows by @zcbenz in #3076
  • [Metal] Tune splitk gemm dispatch conditions and partition sizes by @awni in #3087
  • Fix for NAX overflow. by @awni in #3092

New Contributors

Full Changelog: v0.30.4...v0.30.5