Lora support for VLM Pipeline by likholat · Pull Request #3402 · openvinotoolkit/openvino.genai

likholat · 2026-02-26T20:53:20Z

Description

Enables LoRa support for VLMPipeline

Continuous Batching case is already supported (because of Continuous Batching Pipeline reusage)

CVS-180080

Documentation

https://likholat.github.io/openvino.genai/

Checklist:

This PR follows GenAI Contributing guidelines.
Tests have been updated or added to cover the new code.
This PR fully addresses the ticket.
I have made corresponding changes to the documentation.

Copilot

Pull request overview

Adds LoRA adapter support to the OpenVINO GenAI VLMPipeline, enabling runtime adapter application while preserving adapter-owned states across generation and chat resets. This also updates benchmarking utilities and adds Python/C++ samples plus README guidance to demonstrate VLM + LoRA usage.

Changes:

Integrate LoRA adapter handling into VLMPipeline via adapter extraction from properties and AdapterController application during generation.
Extend who-what benchmark VLM loader/model paths to accept adapters/alphas.
Add new Python and C++ VLM LoRA samples and document how to run them.

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 15 comments.

Show a summary per file

File	Description
tools/who_what_benchmark/whowhatbench/model_loaders.py	Adds adapter plumbing for VLM GenAI pipeline and PEFT-based LoRA merging for HF visual-text models.
src/cpp/src/visual_language/pipeline_base.hpp	Stores an optional `AdapterController` in the VLM pipeline base for reuse by implementations.
src/cpp/src/visual_language/pipeline.cpp	Extracts adapters from properties, initializes/applies `AdapterController`, and preserves adapter state across resets.
src/cpp/src/lora/adapter.cpp	Adds adapter path existence/extension validation for safetensors adapters.
samples/python/visual_language_chat/visual_language_lora.py	New Python sample demonstrating VLM generation with and without LoRA adapters.
samples/python/visual_language_chat/README.md	Documents the new Python LoRA sample and alpha interpretation.
samples/cpp/visual_language_chat/visual_language_lora.cpp	New C++ sample demonstrating VLM generation with and without LoRA adapters.
samples/cpp/visual_language_chat/README.md	Documents the new C++ LoRA sample and alpha interpretation.
samples/cpp/visual_language_chat/CMakeLists.txt	Builds/installs the new C++ LoRA sample binary.

src/cpp/src/lora/adapter.cpp

samples/cpp/visual_language_chat/visual_language_lora.cpp

tools/who_what_benchmark/whowhatbench/model_loaders.py

src/cpp/src/visual_language/pipeline_base.hpp

src/cpp/src/visual_language/pipeline.cpp

tools/who_what_benchmark/whowhatbench/model_loaders.py

src/cpp/src/visual_language/pipeline.cpp

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 9 out of 9 changed files in this pull request and generated 4 comments.

src/cpp/src/visual_language/pipeline_base.hpp

samples/cpp/visual_language_chat/visual_language_lora.cpp

tools/who_what_benchmark/whowhatbench/model_loaders.py

samples/python/visual_language_chat/visual_language_lora.py

Copilot

Pull request overview

Copilot reviewed 11 out of 11 changed files in this pull request and generated 4 comments.

tools/who_what_benchmark/whowhatbench/model_loaders.py

samples/python/visual_language_chat/visual_language_lora.py

samples/python/visual_language_chat/README.md

src/cpp/src/visual_language/pipeline.cpp

Copilot

Pull request overview

Copilot reviewed 17 out of 17 changed files in this pull request and generated no new comments.

samples/cpp/visual_language_chat/visual_language_lora.cpp

Co-authored-by: Vladimir Zlobin <vladimir.zlobin@intel.com>

Copilot

Pull request overview

Copilot reviewed 17 out of 17 changed files in this pull request and generated 1 comment.

Copilot · 2026-03-12T13:30:14Z

tools/who_what_benchmark/whowhatbench/utils.py

+def apply_peft_adapters(model, adapters, alphas, merged_adapter_name="merged_lora"):
+    adapters, alphas = normalize_lora_adapters_and_alphas(adapters, alphas)
+
+    from peft import PeftModel
+
+    adapter_names = ["adapter_0"]
+    model = PeftModel.from_pretrained(model, adapters[0], adapter_name=adapter_names[0])


apply_peft_adapters() calls normalize_lora_adapters_and_alphas(), which returns (None, None) when adapters is None, and then immediately indexes adapters[0]. If apply_peft_adapters() is called with adapters=None (easy to do from external callers since this is a utility), it will crash with a TypeError rather than a clear validation error.

Add an explicit check after normalization (e.g., raise ValueError or return the model unchanged) so the function’s behavior is well-defined for None adapters.

tools/who_what_benchmark/tests/test_cli_vlm.py

Copilot

Pull request overview

Copilot reviewed 19 out of 19 changed files in this pull request and generated 2 comments.

Comments suppressed due to low confidence (1)

tools/who_what_benchmark/tests/test_cli_vlm.py:3

Path is imported but never used in this test module. Unused imports can break linting and add noise; please remove it.

import sys

tools/who_what_benchmark/requirements.txt

tools/who_what_benchmark/whowhatbench/utils.py

Copilot

Pull request overview

Copilot reviewed 19 out of 19 changed files in this pull request and generated 1 comment.

src/cpp/src/visual_language/pipeline.cpp

likholat added 5 commits February 20, 2026 14:28

lora for vlm

3c7aefd

hf vlm with lora

afe925c

python sample

5d1a42a

Merge remote-tracking branch 'origin/master' into lora_for_vlm

de0fe24

enable Lora for VLM CB wwb case

5c9abc0

github-actions bot added category: WWB PR changes WWB category: visual language Visual language pipeline category: LoRA Low rank adapters category: cmake / build Cmake scripts category: VLM samples GenAI VLM samples labels Feb 26, 2026

yatarkan mentioned this pull request Feb 27, 2026

[Testing] Support Qwen3-VL + LoRA for VLM + PA accuracy fix (M-RoPE, Qwen VL models) #3409

Closed

4 tasks

likholat added 2 commits March 6, 2026 14:13

Merge remote-tracking branch 'origin/master' into lora_for_vlm

bfea1cc

lora vlm sdpa acc fix

61e3b3a

Copilot AI review requested due to automatic review settings March 6, 2026 18:16

Copilot started reviewing on behalf of likholat March 6, 2026 18:16 View session

Copilot AI reviewed Mar 6, 2026

View reviewed changes

Update src/cpp/src/lora/adapter.cpp

b8f802b

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings March 6, 2026 19:46

Update samples/cpp/visual_language_chat/visual_language_lora.cpp

83ad53d

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot started reviewing on behalf of likholat March 6, 2026 19:47 View session

Copilot AI reviewed Mar 6, 2026

View reviewed changes

github-actions bot added the category: GGUF GGUF file reader label Mar 9, 2026

Copilot AI review requested due to automatic review settings March 9, 2026 11:27

Copilot started reviewing on behalf of likholat March 9, 2026 11:27 View session

likholat added this to the 2026.1 milestone Mar 9, 2026

likholat added the Code Freeze label Mar 9, 2026

likholat added 2 commits March 9, 2026 12:31

add test

eccee7d

Merge remote-tracking branch 'origin/master' into lora_for_vlm

f32818b

Copilot AI reviewed Mar 9, 2026

View reviewed changes

codestyle fixes

a01b4fb

Copilot started reviewing on behalf of likholat March 12, 2026 10:51 View session

likholat added 2 commits March 12, 2026 11:55

enable tests

970c833

codestyle fix

db5cc27

Copilot AI reviewed Mar 12, 2026

View reviewed changes

Merge remote-tracking branch 'origin/master' into lora_for_vlm

5a25225

likholat requested review from Wovchena and sbalandi March 12, 2026 12:12

Wovchena approved these changes Mar 12, 2026

View reviewed changes

samples/cpp/visual_language_chat/visual_language_lora.cpp Outdated Show resolved Hide resolved

Update samples/cpp/visual_language_chat/visual_language_lora.cpp

bce9726

Co-authored-by: Vladimir Zlobin <vladimir.zlobin@intel.com>

Copilot AI review requested due to automatic review settings March 12, 2026 13:22

Copilot started reviewing on behalf of likholat March 12, 2026 13:23 View session

Copilot AI reviewed Mar 12, 2026

View reviewed changes

sbalandi reviewed Mar 12, 2026

View reviewed changes

tools/who_what_benchmark/tests/test_cli_vlm.py Outdated Show resolved Hide resolved

likholat added 2 commits March 13, 2026 12:55

mv _download_hf_files_to_cache

ba7c2b2

update requirements

944956d

Copilot AI review requested due to automatic review settings March 13, 2026 12:16

Copilot started reviewing on behalf of likholat March 13, 2026 12:16 View session

random vlm + random lora for wwb test

508eda2

Copilot AI reviewed Mar 13, 2026

View reviewed changes

tools/who_what_benchmark/requirements.txt Outdated Show resolved Hide resolved

tools/who_what_benchmark/whowhatbench/utils.py Show resolved Hide resolved

likholat added 2 commits March 13, 2026 13:22

codestyle fix

13b51c1

fix cache dir for test

5e1f187

Copilot AI review requested due to automatic review settings March 13, 2026 14:35

Copilot started reviewing on behalf of likholat March 13, 2026 14:36 View session

specify peft version

812b3c7

Copilot AI reviewed Mar 13, 2026

View reviewed changes

src/cpp/src/visual_language/pipeline.cpp Show resolved Hide resolved

sbalandi approved these changes Mar 13, 2026

View reviewed changes

likholat added this pull request to the merge queue Mar 13, 2026

Merged via the queue into openvinotoolkit:master with commit eb33ce8 Mar 13, 2026
177 of 181 checks passed

goyaladitya05 mentioned this pull request Mar 22, 2026

[wwb] Enable WWB to run LoRA adapter evaluation support for Text2VideoPipeline #3546

Draft

4 tasks

Conversation

likholat commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Documentation

Checklist:

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

likholat commented Feb 26, 2026 •

edited

Loading