Skip to content

Conversation

@jackzhxng
Copy link
Contributor

What does this PR do?

Pass in position_ids so that the custom attention interface implementations have access to it.

Who can review?

@ArthurZucker @Cyrilvallez

@github-actions
Copy link
Contributor

[For maintainers] Suggested jobs to run (before merge)

run-slow: gpt2

Copy link
Contributor

@vasqu vasqu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We have a test for this but it is skipped with GPT2 currently, i.e. check if paddingfree is the same as padding which relies exactly on this. I have a PR for this over here #42848

The logic is sound tho, depending on the other PR, it might make more sense to add an explicit test here

@jackzhxng
Copy link
Contributor Author

Sounds good, lmk if you want the test here / feel free to merge

@vasqu
Copy link
Contributor

vasqu commented Dec 15, 2025

Could you add a small test here directly? My other PR is going a bit beyond what I had originally planned

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants