Skip to content

[GPU] Fix for ExperimentalDetectronROI levels#36305

Open
nshchego wants to merge 1 commit into
openvinotoolkit:masterfrom
nshchego:gpu/bug/softmax_bf
Open

[GPU] Fix for ExperimentalDetectronROI levels#36305
nshchego wants to merge 1 commit into
openvinotoolkit:masterfrom
nshchego:gpu/bug/softmax_bf

Conversation

@nshchego

@nshchego nshchego commented Jun 8, 2026

Copy link
Copy Markdown
Contributor

Details:

  • In experimental_detectron_roi_feature_extractor_ref.cl, ROI-to-pyramid-level mapping could produce invalid behavior for degenerate/non-positive-area ROIs. Need to use reference-consistent level math.
  • In softmax_gpu_bf.cl, fused-op output writes used the wrong loop variable (i instead of output_idx) in two places, causing incorrect output indexing/corruption risk.

Tickets:

AI Assistance:

  • AI assistance used: yes

@nshchego nshchego requested review from a team as code owners June 8, 2026 11:58
@github-actions github-actions Bot added the category: GPU OpenVINO GPU plugin label Jun 8, 2026
ACTIVATION_TYPE dequantized = my_chunk[output_idx] / my_sum;
FUSED_OPS_MAIN;
output[aligned_data_offset + get_sub_group_local_id() + i * get_sub_group_size()] = FUSED_OPS_RESULT_MAIN;
output[aligned_data_offset + get_sub_group_local_id() + output_idx * get_sub_group_size()] = FUSED_OPS_RESULT_MAIN;

@Lyamin-Roman Lyamin-Roman Jun 8, 2026

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It looks like original logic was lost after this PR
And src/plugins/intel_gpu/src/kernel_selector/kernels/softmax/softmax_kernel_bf.cpp:160 still contains the old code with i

Also please extend the existing fusing tests for this case src/plugins/intel_gpu/tests/unit/fusions/softmax_fusion_test.cpp

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: GPU OpenVINO GPU plugin

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants