v0.30.5
What's Changed
- patch by @awni in #3074
- [CUDA] Fallback Event impl when there is no hardware cpu/gpu coherency by @zcbenz in #3070
- Tune CUDA gaph sizes on B200 and H100 by @awni in #3077
- [Docs] Simple example of using MLX distributed by @stefpi in #2973
- Use lower-right causal mask alignment consistently by @Anri-Lombard in #2967
- Fix ALiBi slopes for non-power-of-2 num_heads by @vovw in #3071
- More useful error for large indices by @awni in #3079
- Fix nax condition for iphone by @awni in #3083
- Fallback to pinned host memory when managed memory is not supported by @zcbenz in #3075
- Fix failing python tests on Windows by @zcbenz in #3076
- [Metal] Tune splitk gemm dispatch conditions and partition sizes by @awni in #3087
- Fix for NAX overflow. by @awni in #3092
New Contributors
Full Changelog: v0.30.4...v0.30.5