v0.2.6

mobicham released this 13 May 11:05

· 40 commits to master since this release

a86e0f4

Fix cuda build
torchcompile() support for hqq_aten
bfloat16 support for vllm/hqq
Update vllm utils to support hqq_gemlite and hqq_torch aliases
FIx vLLM v1 issues
Extend save_to_safetensors to VLMs

Full Changelog: v0.2.5...0.2.6

Assets 2