v0.2.6
- Fix cuda build
torchcompile()support for hqq_aten- bfloat16 support for vllm/hqq
- Update vllm utils to support
hqq_gemliteandhqq_torchaliases - FIx vLLM v1 issues
- Extend
save_to_safetensorsto VLMs
Full Changelog: v0.2.5...0.2.6
torchcompile() support for hqq_atenhqq_gemlite and hqq_torch aliasessave_to_safetensors to VLMsFull Changelog: v0.2.5...0.2.6