Skip to content

Release v0.12.1

Latest

Choose a tag to compare

@nickfraser nickfraser released this 28 Aug 17:01

Highlights

  • New / Updated PTQ Algorithms:
    • Qronos support #1311
    • Fixes / improvements to rotation equalization #1310, #1312
    • DDP-like bias correction for SDXL (experimental) #1342
  • Improved layer support:
    • Quantization of SDPA without FX #1299
  • New export flows:
    • Initial SHARK Export support: #1300
    • Initial GGUF Export: #1291
  • Improved examples:
    • Qronos examples (paper) #1326
    • "Benchmark" experiments for stable diffusion, imagenet #1281
    • Post-training model expansion examples (paper) #1355
  • Allow signed scales #1308
  • QONNX export with dynamo=True #1234

What's Changed

Full Changelog: v0.12.0...v0.12.1