[OpenVINO] Support openbmb/MiniCPM-o-2_6 for image-text-to-text task #1454

rkazants · 2025-09-29T15:25:38Z

What does this PR do?

Command to export the model:

optimum-cli export openvino -m openbmb/MiniCPM-o-2_6 MiniCPM-o-2_6 --task=image-text-to-text --trust-remote-code

Example of inference:

from optimum.intel.openvino import OVModelForVisualCausalLM
from transformers import AutoProcessor
from PIL import Image
import requests

model_id="openbmb/MiniCPM-o-2_6"

processor = AutoProcessor.from_pretrained(model_id, trust_remote_code=True)
prompt= "<|im_start|>user\n(<image>./</image>)\nWhat is in the image?<|im_end|>\n<|im_start|>assistant\n"
image = Image.open(requests.get("https://github.com/openvinotoolkit/openvino_notebooks/assets/29454499/d5fbbd1a-d484-415c-88cb-9986625b7b11", stream=True).raw).convert('RGB')

model = OVModelForVisualCausalLM.from_pretrained(model_id, trust_remote_code=True)

inputs = processor([prompt], [image], return_tensors="pt")

result  = model.generate(**inputs, max_new_tokens=20)

print(processor.tokenizer.batch_decode(result[:, inputs["input_ids"].shape[1]:]))

Before submitting

[N/A] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

optimum/intel/openvino/modeling_visual_language.py

docs/source/openvino/models.mdx

optimum/exporters/openvino/model_configs.py

Co-authored-by: Ella Charlaix <[email protected]>

IlyasMoutawwakil

LGTM ! left a question and a nit suggestion
thanks for the addition !

tests/openvino/test_seq2seq.py

optimum/intel/openvino/modeling_visual_language.py

IlyasMoutawwakil · 2025-10-03T09:32:05Z

tests/openvino/utils_tests.py

    "minicpm": "katuni4ka/tiny-random-minicpm",
    "minicpm3": "katuni4ka/tiny-random-minicpm3",
    "minicpmv": "katuni4ka/tiny-random-minicpmv-2_6",
+    "minicpmo": "rkazants/tiny-random-MiniCPM-o-2_6",


this model will slow down our ci greatly, it is 400MB 🫨
https://huggingface.co/rkazants/tiny-random-MiniCPM-o-2_6/tree/main

It is a minimal size I managed to receive. minicpmv is about ~300MB and it is tested: https://huggingface.co/katuni4ka/tiny-random-minicpmv-2_6/tree/main

should be reduced as well

I reduced to 144MB. Minimal hidden_size is 128 for llm part: https://huggingface.co/rkazants/tiny-random-MiniCPM-o-2_6/blob/main/modeling_minicpmo.py#L209
That also impacts apm and tts module size.

@IlyasMoutawwakil, @echarlaix, I propose to do further reduction in further PR(s) if any ideas. Now my other colleagues anticipate this PR merge, let us not block PR merge due to tiny model size. We know that the implemented logic are passing the tests in GHA.

completely agree with @IlyasMoutawwakil comment, we should be super careful with our tiny random models size to not slow down the ci, could you extend on the different models parameters constraint @rkazants https://huggingface.co/rkazants/tiny-random-MiniCPM-o-2_6/blob/main/config.json#L20 for example I see d_model / decoder_ffn_dim / encoder_ffn_dim respectively set to 1024, 1024 and 4096

Also if the PR really needs to be merged asap I'm ok with keeping this model but would like to have a following PR to change it to a smaller model or if that cannnot be done due to modeling constraint then would like to have more information on what are the constraints / why it cannot be done, would that sound reasonable @rkazants ?

Discussed offline with @echarlaix to proceed with the merge.
I will take this AR for further optimization. Indeed, there is a room for optimization such as d_model, encoder_ffn_dim but it will take some time because varying these parameters values needs to adjust several parameters from other modalities. It requires a bit deeper model understanding.
Thanks!

tests/openvino/test_seq2seq.py

optimum/exporters/openvino/model_configs.py

Co-authored-by: Nikita Savelyev <[email protected]>

tests/openvino/test_seq2seq.py

tests/openvino/test_exporters_cli.py

…icpmo_170590

tests/openvino/test_exporters_cli.py

optimum/exporters/openvino/model_configs.py

tests/openvino/utils_tests.py

tests/openvino/test_exporters_cli.py

Co-authored-by: Nikita Savelyev <[email protected]>

docs/source/openvino/models.mdx

tests/openvino/test_quantization.py

Co-authored-by: Nikita Savelyev <[email protected]>

docs/source/openvino/models.mdx

rkazants added 3 commits September 29, 2025 19:19

[OpenVINO] Support openbmb/MiniCPM-o-2_6 for image-text-to-text task

3b366e6

Fix export for minicpmo

66416ab

Add max version of transformers for support

bc2ca88

rkazants requested review from IlyasMoutawwakil, echarlaix and nikita-savelyevv September 30, 2025 14:36

Update documentation

8574228

echarlaix reviewed Oct 1, 2025

View reviewed changes

optimum/intel/openvino/modeling_visual_language.py Outdated Show resolved Hide resolved

docs/source/openvino/models.mdx Outdated Show resolved Hide resolved

optimum/exporters/openvino/model_configs.py Outdated Show resolved Hide resolved

rkazants and others added 4 commits October 1, 2025 14:24

Update docs/source/openvino/models.mdx

b485b83

Co-authored-by: Ella Charlaix <[email protected]>

Add minimal version for transformers

faa8378

Revert changes with import of VisionRotaryEmbedding

4533a70

Add custom prepare_generation_inputs for MiniCPMO

0c99715

rkazants requested a review from echarlaix October 2, 2025 11:43

rkazants added 4 commits October 3, 2025 05:50

Add vocos dependency for minicpmo validation

6b50d6d

Add additional vector_quantize_pytorch deps

14e4309

Adjust tests for minicpmo

3810295

Correct trust_remote_code option for test

3e642d2

IlyasMoutawwakil approved these changes Oct 3, 2025

View reviewed changes

tests/openvino/test_seq2seq.py Show resolved Hide resolved

optimum/intel/openvino/modeling_visual_language.py Show resolved Hide resolved

U se additional_inputs for tokenizer

0e8c00e

IlyasMoutawwakil reviewed Oct 3, 2025

View reviewed changes

nikita-savelyevv reviewed Oct 3, 2025

View reviewed changes

tests/openvino/test_seq2seq.py Show resolved Hide resolved

optimum/exporters/openvino/model_configs.py Outdated Show resolved Hide resolved

Update optimum/exporters/openvino/model_configs.py

72a3052

Co-authored-by: Nikita Savelyev <[email protected]>

rkazants commented Oct 3, 2025

View reviewed changes

tests/openvino/test_seq2seq.py Outdated Show resolved Hide resolved

rkazants added 2 commits October 3, 2025 09:50

Update tests/openvino/test_seq2seq.py

72c7a97

Added tests for quantiozation

977647e

rkazants commented Oct 3, 2025

View reviewed changes

tests/openvino/test_exporters_cli.py Outdated Show resolved Hide resolved

Update tests/openvino/test_exporters_cli.py

3297035

rkazants requested a review from nikita-savelyevv October 3, 2025 16:52

rkazants added 2 commits October 3, 2025 16:58

Merge remote-tracking branch 'remotes/upstream/main' into support_min…

d6f63d0

…icpmo_170590

Update util tests with minicpmo

e9e7e68

Fix test_filtered_architectures for latest transformers

91239a9

rkazants commented Oct 6, 2025

View reviewed changes

tests/openvino/test_exporters_cli.py Outdated Show resolved Hide resolved

Update tests/openvino/test_exporters_cli.py

2bd642a

rkazants commented Oct 6, 2025

View reviewed changes

tests/openvino/test_exporters_cli.py Outdated Show resolved Hide resolved

Update tests/openvino/test_exporters_cli.py

69571e9

rkazants commented Oct 6, 2025

View reviewed changes

optimum/exporters/openvino/model_configs.py Outdated Show resolved Hide resolved

rkazants added 2 commits October 6, 2025 14:10

Update optimum/exporters/openvino/model_configs.py

d873661

Adjust reference for quantization tests

d14820a

rkazants commented Oct 6, 2025

View reviewed changes

tests/openvino/utils_tests.py Outdated Show resolved Hide resolved

Update tests/openvino/utils_tests.py

92c6325

nikita-savelyevv reviewed Oct 6, 2025

View reviewed changes

tests/openvino/test_exporters_cli.py Outdated Show resolved Hide resolved

Update tests/openvino/test_exporters_cli.py

2c5861e

Co-authored-by: Nikita Savelyev <[email protected]>

rkazants requested a review from nikita-savelyevv October 6, 2025 12:35

rkazants commented Oct 6, 2025

View reviewed changes

docs/source/openvino/models.mdx Show resolved Hide resolved

Update docs/source/openvino/models.mdx

f70dfa0

rkazants commented Oct 6, 2025

View reviewed changes

docs/source/openvino/models.mdx Outdated Show resolved Hide resolved

Update docs/source/openvino/models.mdx

348ac8b

nikita-savelyevv reviewed Oct 6, 2025

View reviewed changes

tests/openvino/test_quantization.py Outdated Show resolved Hide resolved

rkazants requested a review from nikita-savelyevv October 6, 2025 17:37

Update tests/openvino/test_quantization.py

8b4afa3

Co-authored-by: Nikita Savelyev <[email protected]>

nikita-savelyevv approved these changes Oct 6, 2025

View reviewed changes

rkazants commented Oct 7, 2025

View reviewed changes

docs/source/openvino/models.mdx Show resolved Hide resolved

Apply suggestion from @rkazants

34ad21d

rkazants commented Oct 7, 2025

View reviewed changes

docs/source/openvino/models.mdx Outdated Show resolved Hide resolved

Apply suggestion from @rkazants

681fb2d

echarlaix merged commit 82a9ed7 into huggingface:main Oct 7, 2025
33 of 37 checks passed

echarlaix mentioned this pull request Oct 7, 2025

set MAX_TRANSFORMERS_VERSION as inclusive upper bound #1463

Merged

[OpenVINO] Support openbmb/MiniCPM-o-2_6 for image-text-to-text task #1454

[OpenVINO] Support openbmb/MiniCPM-o-2_6 for image-text-to-text task #1454

Uh oh!

Conversation

rkazants commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Uh oh!

Uh oh!

Uh oh!

Uh oh!

IlyasMoutawwakil left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

IlyasMoutawwakil Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

rkazants Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

IlyasMoutawwakil Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

rkazants Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

echarlaix Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

echarlaix Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

rkazants Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rkazants commented Sep 29, 2025 •

edited

Loading

IlyasMoutawwakil left a comment •

edited

Loading

rkazants Oct 3, 2025 •

edited

Loading

rkazants Oct 6, 2025 •

edited

Loading

rkazants Oct 6, 2025 •

edited

Loading