Skip to content

Quantization Guide

Abhiram S edited this page Sep 23, 2025 · 4 revisions

Quantization Guide

Symmetric quantization flows with integer GEMM.

Integer GEMM Variants

Symmetric Quant APIs

  • s8s8s32→f32 (sym): anchor on GEMM page
  • s8s8s32→bf16 (sym): anchor on GEMM page

Tips

  • Calibrate scales and zero-points
  • Validate accuracy vs. float baselines

Clone this wiki locally