You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/index.rst
+18-6Lines changed: 18 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,11 +2,13 @@
2
2
QONNX
3
3
======
4
4
5
-
.. note:: **QONNX** is currently under active development. APIs will likely change.
5
+
..
6
+
note:: **QONNX** is currently under active development. APIs will likely change.
6
7
7
-
QONNX (Quantized ONNX) introduces three new custom operators -- `Quant <docs/qonnx-custom-ops/quant_op.md>`_, `BipolarQuant <docs/qonnx-custom-ops/bipolar_quant_op.md>`_ and `Trunc <docs/qonnx-custom-ops/trunc_op.md>`_ -- in order to represent arbitrary-precision uniform quantization in ONNX. This enables:
8
+
QONNX (Quantized ONNX) introduces four new custom operators -- `IntQuant`_, `BipolarQuant`_, `FloatQuant`_, and `Trunc`_
9
+
-- in order to represent arbitrary-precision uniform quantization in ONNX. This enables:
8
10
9
-
* Representation of binary, ternary, 3-bit, 4-bit, 6-bit or any other quantization.
11
+
* Representation of binary, ternary, 3-bit, 4-bit, 6-bit or any other quantization, or quantized floating-point values.
10
12
11
13
* Quantization is an operator itself, and can be applied to any parameter or layer input.
12
14
@@ -33,11 +35,13 @@ Quickstart
33
35
Operator definitions
34
36
+++++++++++++++++++++
35
37
36
-
* `Quant <docs/qonnx-custom-ops/quant_op.md>`_ for 2-to-arbitrary-bit quantization, with scaling and zero-point
38
+
* `IntQuant`_ for 2-to-arbitrary-bit quantization, with scaling and zero-point
37
39
38
-
* `BipolarQuant<docs/qonnx-custom-ops/bipolar_quant_op.md>`_ for 1-bit (bipolar) quantization, with scaling and zero-point
40
+
* `BipolarQuant`_ for 1-bit (bipolar) quantization, with scaling and zero-point
39
41
40
-
* `Trunc <docs/qonnx-custom-ops/trunc_op.md>`_ for truncating to a specified number of bits, with scaling and zero-point
42
+
* `FloatQuant`_ for arbitrary-precision-float-quantized values
43
+
44
+
* `Trunc`_ for truncating to a specified number of bits, with scaling and zero-point
41
45
42
46
Installation
43
47
+++++++++++++
@@ -90,11 +94,19 @@ QONNX also uses GitHub actions to run the full test suite on PRs.
0 commit comments