Constant fold initializer for DQ node #23366

chilo-ms · 2025-01-14T23:25:13Z

Some hardware platforms require weights/initializers to be in FP32, FP16, INT8, UINT8 and INT4 if consumed by Q/DQ nodes.
In other words, ORT needs to dequantize "specific data type" initializers to FP32 for them.

This PR leverages ORT ConstantFolding optimizer to dequantize initializer for DQ node if the initializer has a specific data type.

github-actions

You can commit the suggested changes from lintrunner.

github-actions · 2025-01-14T23:30:55Z

include/onnxruntime/core/session/onnxruntime_session_options_config_keys.h

+// Dequantize initializer using ORT ConstantFolding optimizer for dq node if initializer has specific(? TBD) data type.
+// This feature is required by some NPU's. 
+// "0": disable. ORT doesn't constant fold the DQ node. [DEFAULT]


Suggested change

// Dequantize initializer using ORT ConstantFolding optimizer for dq node if initializer has specific(? TBD) data type.

// This feature is required by some NPU's.

// "0": disable. ORT doesn't constant fold the DQ node. [DEFAULT]

// Dequantize initializer using ORT ConstantFolding optimizer for dq node if initializer has specific(? TBD) data type.

// This feature is required by some NPU's.

// "0": disable. ORT doesn't constant fold the DQ node. [DEFAULT]

github-actions · 2025-01-14T23:30:55Z

onnxruntime/test/optimizer/graph_transform_test.cc

+                                        false /*skip_dequantize_linear*/,
+                                        false /*dequantize_initializer_for_dequantize_linear*/, 
+                                        empty_config_options),


Suggested change

false /*skip_dequantize_linear*/,

false /*dequantize_initializer_for_dequantize_linear*/,

empty_config_options),

false /*skip_dequantize_linear*/,

false /*dequantize_initializer_for_dequantize_linear*/,

empty_config_options),

chilo-ms · 2025-01-15T00:26:10Z

I'm working on a prototype which makes ORT capable of enabling further optimizations for EPs.

github-actions

You can commit the suggested changes from lintrunner.

github-actions · 2025-01-22T18:19:24Z

onnxruntime/core/optimizer/graph_transformer_utils.cc

        transformers.emplace_back(std::make_unique<QDQPropagationTransformer>());
-        transformers.emplace_back(std::make_unique<WeightBiasQuantization>());
+        //transformers.emplace_back(std::make_unique<WeightBiasQuantization>());



Suggested change

transformers.emplace_back(std::make_unique<QDQPropagationTransformer>());

transformers.emplace_back(std::make_unique<WeightBiasQuantization>());

//transformers.emplace_back(std::make_unique<WeightBiasQuantization>());

transformers.emplace_back(std::make_unique<QDQPropagationTransformer>());

// transformers.emplace_back(std::make_unique<WeightBiasQuantization>());

constant fold initializer for QD

bbb5862

github-actions bot reviewed Jan 14, 2025

View reviewed changes

temporarily disable weight/Bias quantization

981d95b

github-actions bot reviewed Jan 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Constant fold initializer for DQ node #23366

Constant fold initializer for DQ node #23366

chilo-ms commented Jan 14, 2025 •

edited

Loading

Uh oh!

github-actions bot left a comment

Uh oh!

github-actions bot Jan 14, 2025

Uh oh!

github-actions bot Jan 14, 2025

Uh oh!

chilo-ms commented Jan 15, 2025 •

edited

Loading

Uh oh!

github-actions bot left a comment

Uh oh!

github-actions bot Jan 22, 2025

Uh oh!

Uh oh!

Constant fold initializer for DQ node #23366

Are you sure you want to change the base?

Constant fold initializer for DQ node #23366

Conversation

chilo-ms commented Jan 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot Jan 14, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot Jan 14, 2025

Choose a reason for hiding this comment

Uh oh!

chilo-ms commented Jan 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot Jan 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

chilo-ms commented Jan 14, 2025 •

edited

Loading

chilo-ms commented Jan 15, 2025 •

edited

Loading