[LPT] FQStripping transformation rework by v-Golubev · Pull Request #33989 · openvinotoolkit/openvino

v-Golubev · 2026-02-05T18:47:27Z

Details:

Some INT16 models rely on U16/I16 FakeQuantize layers. Simply stripping these FakeQuantize operations may be insufficient when such models are executed in f16 precision, because the original (unquantized) activation values flowing through the stripped path may exceed the representable f16 range. This can lead to overflow and, consequently, incorrect inference results.

This PR introduces a new mechanism called ScaleAdjuster.
The ScaleAdjuster detects activation paths that feed into scale‑invariant nodes and safely reduces the magnitude of activation values to keep them within the f16 numeric range — without altering the model’s semantic correctness (so the adjustment is possible only for activations paths which reach scale-invariant nodes).

The implementation is validated by:

GPU functional tests, ensuring inference correctness, and
LPT graph comparison tests, verifying structural consistency of transformations.

Tickets:

CVS-180573

src/plugins/intel_gpu/src/plugin/transformations_pipeline.cpp

src/plugins/intel_gpu/tests/functional/subgraph_tests/dynamic/qdq_stripping.cpp

src/common/low_precision_transformations/src/qdq_stripping.cpp

src/common/transformations/include/transformations/utils/utils.hpp

src/plugins/intel_gpu/src/plugin/transformations_pipeline.cpp

src/tests/ov_helpers/ov_lpt_models/src/qdq_stripping.cpp

src/plugins/intel_gpu/tests/functional/subgraph_tests/dynamic/qdq_stripping.cpp

src/common/low_precision_transformations/src/qdq_stripping.cpp

aobolensk

LGTM

v-Golubev · 2026-02-27T14:37:20Z

@isanghao could you please take a look at GPU part? Thanks in advance

isanghao

LGTM for GPU part

### Details: Some INT16 models rely on U16/I16 FakeQuantize layers. Simply stripping these FakeQuantize operations may be insufficient when such models are executed in f16 precision, because the original (unquantized) activation values flowing through the stripped path may exceed the representable f16 range. This can lead to overflow and, consequently, incorrect inference results. This PR introduces a new mechanism called `ScaleAdjuster`. The `ScaleAdjuster` detects activation paths that feed into scale‑invariant nodes and safely reduces the magnitude of activation values to keep them within the f16 numeric range — without altering the model’s semantic correctness (so the adjustment is possible only for activations paths which reach scale-invariant nodes). The implementation is validated by: - GPU functional tests, ensuring inference correctness, and - LPT graph comparison tests, verifying structural consistency of transformations. ### Tickets: - *CVS-180573*

github-actions bot added category: GPU OpenVINO GPU plugin category: CPU OpenVINO CPU plugin category: LP transformations OpenVINO Low Precision transformations labels Feb 5, 2026

v-Golubev added the WIP work in progress label Feb 10, 2026

v-Golubev marked this pull request as ready for review February 10, 2026 15:31

v-Golubev requested review from a team as code owners February 10, 2026 15:31

v-Golubev requested a review from a team as a code owner February 12, 2026 12:49

github-actions bot added the category: transformations OpenVINO Runtime library - Transformations label Feb 12, 2026

v-Golubev requested review from a team as code owners February 12, 2026 13:58

github-actions bot added the category: IE Tests OpenVINO Test: plugins and common label Feb 12, 2026

v-Golubev force-pushed the vg/lpt/qdq_stripping_rework branch from f2e03fb to 6077ca5 Compare February 12, 2026 16:41

v-Golubev added 14 commits February 13, 2026 20:39

Added QDQ stripping pipeline to CPU plugin

0bf796a

CPU QDQ Stripping tests

86044a8

[TMP] Add debug serialization

94ab742

Refactor FQStripping

4a5f8cd

Tests refactoring

18aca0c

NeedScalingMulMatMul

30df189

Added third pattern

3360153

WIP: 1st stage

23c839e

Tests further modification

aaed259

Pass simplification

c4ed9d8

Original implementation WIP

b218b13

Added dubug serialization for GPU

927bd27

Restore old build_shared_dq_pattern builder

cbc3120

Adjust quantization values

05fa938

v-Golubev added 9 commits February 17, 2026 00:02

Forward/Backward propagation significantly simplified

bb9c6be

Further simplification

4112f19

ScaleAdjuster: is_allowed_node introduced for safety

8a4b222

Remove some nodes from is_allowed_node

4ef530c

Code cleanup

b922711

Added some docs

e8d81aa

Tests cleanup

20bb11e

Fixed shared_dq test case

24c7ecc

fixed build_residual_block_pattern

36ab9ce

v-Golubev removed the WIP work in progress label Feb 17, 2026

v-Golubev assigned aobolensk Feb 17, 2026

aobolensk reviewed Feb 17, 2026

View reviewed changes

aobolensk reviewed Feb 18, 2026

View reviewed changes

src/plugins/intel_gpu/tests/functional/subgraph_tests/dynamic/qdq_stripping.cpp Outdated Show resolved Hide resolved

src/common/low_precision_transformations/src/qdq_stripping.cpp Show resolved Hide resolved

src/common/low_precision_transformations/src/qdq_stripping.cpp Show resolved Hide resolved

v-Golubev added 2 commits February 18, 2026 19:58

Review comments applied

721e31e

LPT test builders correction

767438a

v-Golubev requested a review from aobolensk February 18, 2026 11:20

aobolensk approved these changes Feb 18, 2026

View reviewed changes

v-Golubev added 2 commits February 26, 2026 04:10

[TESTS] Residual block pattern: added shortcut convolution

e42db4d

Merge branch 'master' into vg/lpt/qdq_stripping_rework

46c7182

v-Golubev assigned isanghao Feb 27, 2026

isanghao approved these changes Mar 4, 2026

View reviewed changes

v-Golubev added this pull request to the merge queue Mar 4, 2026

Merged via the queue into openvinotoolkit:master with commit f8efd35 Mar 4, 2026
225 of 229 checks passed

v-Golubev deleted the vg/lpt/qdq_stripping_rework branch March 4, 2026 12:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LPT] FQStripping transformation rework#33989

[LPT] FQStripping transformation rework#33989
v-Golubev merged 77 commits intoopenvinotoolkit:masterfrom
v-Golubev:vg/lpt/qdq_stripping_rework

v-Golubev commented Feb 5, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aobolensk left a comment

Uh oh!

v-Golubev commented Feb 27, 2026

Uh oh!

isanghao left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

v-Golubev commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Details:

Tickets:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aobolensk left a comment

Choose a reason for hiding this comment

Uh oh!

v-Golubev commented Feb 27, 2026

Uh oh!

isanghao left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

v-Golubev commented Feb 5, 2026 •

edited

Loading

isanghao left a comment •

edited

Loading