Onnx transformers: Quantize option #6

yusufcakmakk · 2020-10-12T08:31:11Z

I've made changes as we talked in pr.

I added option local_model to pipeline. It ignores modelcard to load local models that without having modelcard.

I kept framework as torch. In some cases like loading local models i have got error that InvalidArgument: [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Unexpected input data type.. We can leave it like this to stay safe.

-changed framework type to "pt"

added usage example with Ner pipe.

…rmers

reading configuration from model config

patil-suraj · 2020-10-13T12:16:46Z

onnx_transformers/pipelines.py

-        modelcard = config
+    # TODO: Disable modelcard (below 4 lines) if working with local models.
+    # searches modelcard.json
+    if not local_model:


we can keep it as it is, does it break for local model ? if not let's keep it as it is

patil-suraj · 2020-10-13T12:19:04Z

onnx_transformers/pipelines.py

@@ -694,6 +704,20 @@ def _forward(self, inputs, return_tensors=False):
        else:
            return predictions.numpy()

+    def _create_quantized_graph(self, onnx_opt_model_path):
+        #TODO: add option gpt2 if need
+        opt_options = BertOptimizationOptions('bert')


check model type explicitly, raise an assert or exception if the model is not bert

patil-suraj · 2020-10-13T12:19:48Z

onnx_transformers/pipelines.py

@@ -555,7 +559,13 @@ def __init__(
            logger.info(f"loading onnx graph from {self.graph_path.as_posix()}")
            self.onnx_model = create_model_for_provider(str(graph_path), "CPUExecutionProvider")
            self.input_names = json.load(open(input_names_path))
-            self.framework = "np"
+            self.framework = "pt"


This will cause other things to break. Are all tests passing ?
you can run tests using make test.

patil-suraj · 2020-10-13T12:21:20Z

README.md

 Set `onnx` to `False` for standard torch inference.
+Set `quantized` to `True` for quantize with Onnx. ( set `onnx` to True)


Suggested change

Set `quantized` to `True` for quantize with Onnx. ( set `onnx` to True)

Set `quantized` to `True` for quantize with Onnx. ( set `onnx` to True)

YUSUF CAKMAK and others added 5 commits October 9, 2020 15:35

Added quantized option to pipeline

bd9310e

-changed framework type to "pt"

Update README.md

d13d16a

added usage example with Ner pipe.

fixing forgatten comment lines

38070e0

Merge branch 'master' of https://github.com/yusufcakmakk/onnx_transfo…

9a76ff5

…rmers

added function to create quantized graph

d7190a2

reading configuration from model config

patil-suraj requested changes Oct 13, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Onnx transformers: Quantize option #6

Onnx transformers: Quantize option #6

Uh oh!

yusufcakmakk commented Oct 12, 2020

Uh oh!

patil-suraj Oct 13, 2020

Uh oh!

patil-suraj Oct 13, 2020

Uh oh!

patil-suraj Oct 13, 2020

Uh oh!

patil-suraj Oct 13, 2020

Uh oh!

Uh oh!

		Set `onnx` to `False` for standard torch inference.
		Set `quantized` to `True` for quantize with Onnx. ( set `onnx` to True)

Onnx transformers: Quantize option #6

Are you sure you want to change the base?

Onnx transformers: Quantize option #6

Uh oh!

Conversation

yusufcakmakk commented Oct 12, 2020

Uh oh!

patil-suraj Oct 13, 2020

Choose a reason for hiding this comment

Uh oh!

patil-suraj Oct 13, 2020

Choose a reason for hiding this comment

Uh oh!

patil-suraj Oct 13, 2020

Choose a reason for hiding this comment

Uh oh!

patil-suraj Oct 13, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!