v0.1.2 release
What's Changed
- MoE: Optimize and fix moe_align_block_size & moe_lora_align_block_size kernels by @chaojun-zhang in #133
- [CI] add bmg g31 and update docker file by @jikunshang in #144
- [CI] disable time consuming ci and update seed. by @jikunshang in #145
- Add fp8 mxfp8 block quant kernel by @Yejing-Lai in #138
- [sycl-tla] remove unnecessary headers by @xinyu-intel in #129
- init value before atomic in reduction kernel by @xinyu-intel in #149
- [OneDNN] update onednn to 3.11 by @zufangzhu in #143
- [Quant] update fp8 quant kernel by @zufangzhu in #147
New Contributors
- @Yejing-Lai made their first contribution in #138
Full Changelog: v0.1.1...v0.1.2