Skip to content

v0.2.6

Choose a tag to compare

@mobicham mobicham released this 13 May 11:05
· 40 commits to master since this release
  • Fix cuda build
  • torchcompile() support for hqq_aten
  • bfloat16 support for vllm/hqq
  • Update vllm utils to support hqq_gemlite and hqq_torch aliases
  • FIx vLLM v1 issues
  • Extend save_to_safetensors to VLMs

Full Changelog: v0.2.5...0.2.6