[CPU][PA] Align bidirectional image attention and sliding window interaction with reference by p-wysocki · Pull Request #35446 · openvinotoolkit/openvino

p-wysocki · 2026-04-21T12:26:23Z

Details:

When both bidirectional image attention and a sliding window were active, the CPU plugin computed the attention start index as:
start_idx = image_group_end - sliding_window
Whereas the correct way to do it (implemented by transformers and GPU Plugin) is to not clip the image attention, but instead pass it as a whole block, no matter what the sliding window size is.
An example

Let's assume 6 image tokens and a sliding window of size 5.

Before the fix:
Attention mask: [1, 6) - first token cut

After the fix:
Attention mask: [0, 6) - full group

If attention mask is being calculated for a text token, the attention is regular causal/sliding window, no matter if previous tokens were image or text, which matches transformers implementation.

Tickets:

CVS-185393

AI Assistance:

AI assistance used: yes, to write tests and make sure the changes don't impact other aspects/inputs of PA

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

maxnick · 2026-04-29T08:38:35Z

@zhangYiIntel , could you please review?

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

p-wysocki added 3 commits April 21, 2026 12:14

Apply new sw behavior

6b6761f

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

Cleanup

ba3c324

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

Cleanup

89685b1

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

p-wysocki added this to the 2026.2 milestone Apr 21, 2026

p-wysocki requested review from maxnick, mitruska, pkowalc1 and rkazants April 21, 2026 12:26

p-wysocki requested review from a team as code owners April 21, 2026 12:26

github-actions Bot added the category: CPU OpenVINO CPU plugin label Apr 21, 2026

pkowalc1 approved these changes Apr 23, 2026

View reviewed changes

maxnick assigned zhangYiIntel Apr 29, 2026

zhangYiIntel self-requested a review April 30, 2026 22:32

zhangYiIntel approved these changes Apr 30, 2026

View reviewed changes

Comment thread src/plugins/intel_cpu/tests/functional/custom/subgraph_tests/src/x64/paged_attn_token_type.cpp

Improve data comp

4f6d393

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

p-wysocki enabled auto-merge May 4, 2026 08:23

p-wysocki added this pull request to the merge queue May 4, 2026

Merged via the queue into openvinotoolkit:master with commit f1c2004 May 4, 2026
196 checks passed

p-wysocki deleted the cpu_bidir_attn branch May 4, 2026 12:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CPU][PA] Align bidirectional image attention and sliding window interaction with reference#35446

[CPU][PA] Align bidirectional image attention and sliding window interaction with reference#35446
p-wysocki merged 4 commits intoopenvinotoolkit:masterfrom
p-wysocki:cpu_bidir_attn

p-wysocki commented Apr 21, 2026 •

edited

Loading

Uh oh!

maxnick commented Apr 29, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

p-wysocki commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Details:

Tickets:

AI Assistance:

Uh oh!

maxnick commented Apr 29, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

p-wysocki commented Apr 21, 2026 •

edited

Loading