We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 1f12d32 commit fd8d18eCopy full SHA for fd8d18e
1 file changed
docs/llm_linux.rst
@@ -152,11 +152,10 @@ Postprocessing
152
===============
153
154
- Model Quantization step produces Pytorch quantized model.
155
-- Model_generate script converts Pytorch quantized model to Onnx format for NPU Execution mode.
+- Model_generate script initially converts Pytorch quantized model to Onnx format and subsequently postprocess to run for NPU Execution mode.
156
157
.. code-block:: bash
158
159
- # Required while converting from Pytorch Quantization to Onnx format
160
pip install onnx-ir
161
162
model_generate --npu <output_dir> <quantized_model_path> --optimize decode
0 commit comments