Keep dequantization subgraph output as inference precision for GPU plugin #30685

z71258847 · 2025-05-23T05:08:00Z

Details:

Insert convert op as subgraph endpoint for identified u16, i32, u32 dequantization subgraph, in order to keep constant folding to compute in fp32 but output as the current inference precision (fp16) for GPU plugin.

Tickets:

CVS-167484

…into HEAD

sshlyapn · 2025-05-23T05:20:14Z

@z71258847 please move this extra logic to GPU specific transformations

p-durandin · 2025-05-23T14:22:08Z

build_jenkins

sshlyapn and others added 2 commits May 23, 2025 04:59

add convert to dq subgraph output

393b2e7

Merge branch 'master' of https://github.com/openvinotoolkit/openvino …

c6b320f

…into HEAD

z71258847 requested a review from a team as a code owner May 23, 2025 05:08

z71258847 requested review from itikhono and removed request for a team May 23, 2025 05:08

github-actions bot added the category: transformations OpenVINO Runtime library - Transformations label May 23, 2025

sys-openvino-ci added the ExternalIntelPR External contributor from Intel label May 23, 2025

z71258847 closed this May 23, 2025

move add convert to dq logic to ov::intel_gpu pass

ba5fb32

z71258847 reopened this May 23, 2025

z71258847 requested review from a team as code owners May 23, 2025 08:11

github-actions bot added category: GPU OpenVINO GPU plugin and removed category: transformations OpenVINO Runtime library - Transformations labels May 23, 2025

update copyright year, update brief for convertDQFp16 pass

8f466dd

sshlyapn approved these changes May 23, 2025

View reviewed changes

sshlyapn mentioned this pull request May 23, 2025

[GPU] Keep dequantization subgraph output as inference precision for GPU plugin #30702

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Keep dequantization subgraph output as inference precision for GPU plugin #30685

Keep dequantization subgraph output as inference precision for GPU plugin #30685

z71258847 commented May 23, 2025

Uh oh!

sshlyapn commented May 23, 2025

Uh oh!

p-durandin commented May 23, 2025

Uh oh!

Uh oh!

Keep dequantization subgraph output as inference precision for GPU plugin #30685

Are you sure you want to change the base?

Keep dequantization subgraph output as inference precision for GPU plugin #30685

Conversation

z71258847 commented May 23, 2025

Details:

Tickets:

Uh oh!

sshlyapn commented May 23, 2025

Uh oh!

p-durandin commented May 23, 2025

Uh oh!

Uh oh!