We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
2 parents fd8d18e + 0da3239 commit edaf3f5Copy full SHA for edaf3f5
1 file changed
docs/oga_model_prepare.rst
@@ -84,6 +84,7 @@ Use following command to run Quantization. In a GPU equipped Linux machine the q
84
85
86
- Use ``--data_type bfloat16`` for bf16 pretrained model. For fp32/fp16 pretrained model use ``--datatype float16``
87
+- Not using ``--exclude_layers`` parameter may result in model-specific defaults which may exclude certain layers like output layers.
88
89
The quantized model is generated in the <quantized safetensor output dir> folder.
90
0 commit comments