[OpenVINO] Transformers 4.56/4.57 support 2 (awaiting npu testing) #1541

IlyasMoutawwakil · 2025-12-02T16:08:54Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2025-12-02T16:11:45Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

CuriousPanCake · 2025-12-03T19:30:11Z

@IlyasMoutawwakil what are your expectations for merging this PR? We're considering using transformers 4.57 for one of our testing pipelines, so having the corresponding support on optimum-intel's side would be a very good thing.

IlyasMoutawwakil · 2025-12-04T08:34:19Z

@CuriousPanCake my expectation is that it is good to go 😁
we have seen an occasional / non-consistent segfault with sam model (can't reproduce it locally and rerunning the test passes without a segfault) but it also started happening at the same time as the openvino 2025.4 release, i personally can't find a way to reproduce it locally and the issue happens randomly so, dunno what to do about that.

rkazants

there is an issue with gemma models from new transformers reported from GenAI:

Traceback (most recent call last):
  File "/home/jenkins/agent/workspace/DL-Benchmark/prod/WW46-2025.4.0-20398-RC2-Transformers4.57/LO_CPU_acc_wwb_wwb_ref_nat_vs_genai_CPU_ICX/venv/bin/wwb", line 9, in <module>
    sys.exit(main())
  File "/home/jenkins/agent/workspace/DL-Benchmark/prod/WW46-2025.4.0-20398-RC2-Transformers4.57/LO_CPU_acc_wwb_wwb_ref_nat_vs_genai_CPU_ICX/genai/tools/who_what_benchmark/whowhatbench/wwb.py", line 764, in main
    all_metrics_per_question, all_metrics = evaluator.score(
  File "/home/jenkins/agent/workspace/DL-Benchmark/prod/WW46-2025.4.0-20398-RC2-Transformers4.57/LO_CPU_acc_wwb_wwb_ref_nat_vs_genai_CPU_ICX/genai/tools/who_what_benchmark/whowhatbench/visualtext_evaluator.py", line 84, in score
    predictions = self._generate_data(model_or_data, gen_answer_fn, self.generation_config)
  File "/home/jenkins/agent/workspace/DL-Benchmark/prod/WW46-2025.4.0-20398-RC2-Transformers4.57/LO_CPU_acc_wwb_wwb_ref_nat_vs_genai_CPU_ICX/genai/tools/who_what_benchmark/whowhatbench/visualtext_evaluator.py", line 177, in _generate_data
    gen_answer_fn(
  File "/home/jenkins/agent/workspace/DL-Benchmark/prod/WW46-2025.4.0-20398-RC2-Transformers4.57/LO_CPU_acc_wwb_wwb_ref_nat_vs_genai_CPU_ICX/genai/tools/who_what_benchmark/whowhatbench/wwb.py", line 469, in genai_gen_visual_text
    out = model.generate(
RuntimeError: Exception from src/inference/src/cpp/infer_request.cpp:75:
Check '::getPort(port, name, {_impl->get_inputs(), _impl->get_outputs()})' failed at src/inference/src/cpp/infer_request.cpp:77:
Port for tensor name position_ids was not found.

@IlyasMoutawwakil, please double check and clarify why this issue appeared.

Thanks,
Roman

IlyasMoutawwakil · 2025-12-11T15:07:46Z

@rkazants no idea, there's not enough context in the traceback you shared to know what went wrong exactly

IlyasMoutawwakil added 30 commits November 26, 2025 12:14

transformers 4.57

02f9c50

patch dynamic cache layer

c68919f

fix qwen and gpt_oss

073fc46

fix seq2seq models as well

5b245cf

fix

513977a

fix

43d5842

more decoder fixes

79a0bbf

limit awq

bc57cec

fix dynamic layer in optimum-onnx's model patcher

6489d7e

remove

11b5a5a

fix donut

d6cd7a6

vlm fixes

272a624

fix speecht5

c62546e

fix whisper

a7ede39

fix

817bc54

fix qwenvl

225b81d

better fix

7c5c92c

fix recursion issue

9116262

fix llama4 and quantization

f4591a7

fix setup

a5029bd

fix gemma3 and skip grouped beam search

d1449c6

fix

a679411

fix quants

25d2f66

fix

bfcf961

fix

b714f6d

revert line

3ca93c8

test offline on python 3.10

20250f6

ov 2025.4.0

e5d2dc6

fix

ad94d8f

simply skip phi4

99372b8

Apply suggestion from @IlyasMoutawwakil

d14416b

CuriousPanCake mentioned this pull request Dec 3, 2025

[TESTS] Update transformers for pytorch tests openvinotoolkit/openvino#33096

Open

rkazants suggested changes Dec 11, 2025

View reviewed changes

rkazants requested review from nikita-savelyevv and popovaan December 11, 2025 13:58

echarlaix mentioned this pull request Dec 11, 2025

[OpenVINO]support Hunyuan LLM #1429

Open

2 tasks

nikita-savelyevv approved these changes Dec 11, 2025

View reviewed changes

IlyasMoutawwakil and others added 6 commits December 12, 2025 12:47

Merge branch 'main' into transformers-4.57

f406ff3

fix some models

08b6b6d

fix gemma missing position ids

a11fa24

remove unnecessary patch since optimum-onnx handles pkv conversion

bb627aa

keep gemma2 patcher for old transformers versions

d7a13b4

fix

f6543a8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[OpenVINO] Transformers 4.56/4.57 support 2 (awaiting npu testing) #1541

[OpenVINO] Transformers 4.56/4.57 support 2 (awaiting npu testing) #1541

Uh oh!

IlyasMoutawwakil commented Dec 2, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 2, 2025

Uh oh!

CuriousPanCake commented Dec 3, 2025

Uh oh!

IlyasMoutawwakil commented Dec 4, 2025

Uh oh!

rkazants left a comment •

edited

Loading

Uh oh!

IlyasMoutawwakil commented Dec 11, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[OpenVINO] Transformers 4.56/4.57 support 2 (awaiting npu testing) #1541

Are you sure you want to change the base?

[OpenVINO] Transformers 4.56/4.57 support 2 (awaiting npu testing) #1541

Uh oh!

Conversation

IlyasMoutawwakil commented Dec 2, 2025

What does this PR do?

Before submitting

Uh oh!

HuggingFaceDocBuilderDev commented Dec 2, 2025

Uh oh!

CuriousPanCake commented Dec 3, 2025

Uh oh!

IlyasMoutawwakil commented Dec 4, 2025

Uh oh!

rkazants left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

IlyasMoutawwakil commented Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

rkazants left a comment •

edited

Loading

IlyasMoutawwakil commented Dec 11, 2025 •

edited

Loading