[GPU] Fix int8 cache in onednn convolution #30606

davidsnam-intel · 2025-05-18T22:47:47Z

Details:

Fixed the issue of garbage values coming in when using cache in onednn convolution in the yolo11 model.

Tickets:

162501

isanghao · 2025-05-19T03:06:13Z

src/plugins/intel_gpu/src/graph/impls/onednn/convolution_onednn.cpp

-                                    strides, dilates, padding_l, padding_r,
-                                    *_attrs.get());
-            _pd = *prim_desc;
+        dnnl::memory::desc bias_md = nullptr;


nit: dnnl::memory::desc bias_md; will be more explicit for zero_md according to onednn api.

For convolution_onednn, there is a constructor A that does not receive bias as a parameter and a constructor B that does. Eventually, A calls B, and at this time, it explicitly puts nullptr in the bias part. In this situation, how do you think about this?

My point is to replace
dnnl::memory::desc bias_md = nullptr; to dnnl::memory::desc bias_md; for explicit use of API. They seems to be basically same.

Actually, I understood and tried as you suggested. I couldn't prove perfectly why, but the cache works properly only when nullptr is explicitly assigned. I didn't compare memory stack both when nullptr is explicitly assigned and when it isn't, using any specific tool. However, I confirmed through many tests that cache works properly only when nullptr is explicitly assigned.

src/plugins/intel_gpu/src/graph/impls/onednn/convolution_onednn.cpp

isanghao · 2025-05-19T09:12:42Z

src/plugins/intel_gpu/src/graph/impls/onednn/convolution_onednn.cpp

+                                dnnl::prop_kind::forward_inference, dnnl::algorithm::convolution_direct,
+                                input_md, weights_md, bias_md, output_md,
+                                strides, dilates, padding_l, padding_r,
+                                *_attrs.get());


The behavior seems exactly same as below. Am I missing something?

I doubt the issue is hidden by compilation result change, but the real issue is unknown..

davidsnam-intel requested review from a team as code owners May 18, 2025 22:47

github-actions bot added the category: GPU OpenVINO GPU plugin label May 18, 2025

isanghao reviewed May 19, 2025

View reviewed changes

p-durandin added this to the 2025.2 milestone May 19, 2025

p-durandin added the Code Freeze label May 19, 2025

isanghao reviewed May 19, 2025

View reviewed changes

davidsnam-intel force-pushed the david/fix-convolution-onednn--int8-cache branch from 226b3f2 to 0278c1d Compare May 20, 2025 22:35

mlukasze requested review from isanghao and yeonbok May 21, 2025 03:06

davidsnam-intel added 5 commits May 21, 2025 16:28

[GPU] Fix int8 cache in onednn convolution

3403584

Modify the condifion that creates bias memory

08960e4

Remain previous condition but use same constructor

0278c1d

test

c8e2615

test

0c473a9

geunhwan removed the Code Freeze label May 22, 2025

geunhwan removed this from the 2025.2 milestone May 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GPU] Fix int8 cache in onednn convolution #30606

[GPU] Fix int8 cache in onednn convolution #30606

davidsnam-intel commented May 18, 2025

Uh oh!

isanghao May 19, 2025

Uh oh!

davidsnam-intel May 19, 2025 •

edited

Loading

Uh oh!

isanghao May 19, 2025

Uh oh!

davidsnam-intel May 20, 2025

Uh oh!

Uh oh!

isanghao May 19, 2025

Uh oh!

yeonbok May 19, 2025

Uh oh!

Uh oh!

[GPU] Fix int8 cache in onednn convolution #30606

Are you sure you want to change the base?

[GPU] Fix int8 cache in onednn convolution #30606

Conversation

davidsnam-intel commented May 18, 2025

Details:

Tickets:

Uh oh!

isanghao May 19, 2025

Choose a reason for hiding this comment

Uh oh!

davidsnam-intel May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

isanghao May 19, 2025

Choose a reason for hiding this comment

Uh oh!

davidsnam-intel May 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

isanghao May 19, 2025

Choose a reason for hiding this comment

Uh oh!

yeonbok May 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

davidsnam-intel May 19, 2025 •

edited

Loading