Skip to content

v0.5.0

Compare
Choose a tag to compare
@wenhuach21 wenhuach21 released this 22 Apr 08:05
· 19 commits to main since this release
v0.5.0
e90f991

Highlights

  • refine autoround format inference, support 2,3,4,8 bits and marlin kernel and fix several bugs in auto-round format
  • support xpu in tuning and inference by @wenhuach21 in #481
  • support for more vlms by @n1ck-guo in #390
  • change quantization method name and made several refinements by @wenhuach21 in #500
  • support rtn via iters==0 by @wenhuach21 in #510
  • fix bug of mix calib dataset by @n1ck-guo in #492

What's Changed

Full Changelog: v0.4.7...v0.5.0