Fix lora video support by goyaladitya05 · Pull Request #21 · goyaladitya05/openvino.genai

goyaladitya05 · 2026-02-26T21:52:15Z

This PR builds upon the work started by @Eshaan-byte in openvinotoolkit#3309. I've integrated his logic and added the fixes for the set_adapters bindings and AdapterController initialization to resolve the CI issues.

Changes:

Added binding for LTXVideoTransformer3DModel.set_adapters() to resolve AttributeError in the TestLoRAVideoGeneration test suite.
Added a safety check in set_adapters() to prevent segmentation faults when the AdapterController is not initialized (e.g., when no adapters are provided at construction).

Closes openvinotoolkit#3306
Supersedes openvinotoolkit#3309

Verification Results:

without LoRA

with LoRA (Adapter: ltxv_095_cakeify_lora.safetensors)

These samples demonstrate that the LoRA adapter is successfully loaded and applied during the denoising process in the Text2VideoPipeline.

Environment:
Model: LTX-Video
Alpha: 1.5
Prompt:"A cute golden retriever puppy running in a green grassy field on a sunny day, high quality, photorealistic"

Checklist:

This PR follows GenAI Contributing guidelines.
Tests have been updated or added to cover the new code.
- Verified tests/python_tests/test_video_generation.py passes locally.
This PR fully addresses the ticket.
I have made corresponding changes to the documentation.

Summary by CodeRabbit

Release Notes

New Features
- Added LoRA adapter support for video generation, enabling customizable model adaptation during generation
- Introduced new Text-to-Video with LoRA samples (C++ and Python) demonstrating multi-adapter configuration and generation
Documentation
- Updated video generation guides with LoRA adapter examples and usage instructions

Implements LoRA (Low-Rank Adaptation) adapter support for LTX-Video transformer model, following the same pattern as image generation. Changes: - Remove blocking assertion in ltx_video_transformer_3d_model.cpp - Add adapter initialization in compile() method - Implement set_adapters() method for runtime adapter control - Add AdapterController member to manage adapter lifecycle - Create lora_text2video.cpp sample demonstrating usage - Add comprehensive tests in test_video_generation.py The implementation allows users to: - Load and apply one or more LoRA adapters - Mix multiple adapters with different alpha values - Enable/disable adapters per generation call - Use community LoRA adapters from HuggingFace Fixes openvinotoolkit#3306

- Added Python binding for LTXVideoTransformer3DModel.set_adapters() to expose the method to Python, fixing test_transformer_has_set_adapters_method - Added safety check in set_adapters() to prevent crashes when AdapterController is not initialized - Both fixes follow the same patterns used in image generation models Fixes the CI failures in TestLoRAVideoGeneration test suite.

…dapters pyi stub

…line wrapping

…stubgen output (24 spaces)

…erministic pyi stub

…pport

coderabbitai · 2026-02-26T21:52:38Z

📝 Walkthrough

Walkthrough

This pull request adds LoRA (Low-Rank Adaptation) adapter support to the video generation pipeline in OpenVINO GenAI. Changes include new C++ and Python sample implementations demonstrating LoRA usage, configuration structure updates to accept adapters, adapter-aware modifications to the LTX video transformer model initialization and inference, Python bindings exposure, test coverage, and documentation updates.

Changes

Cohort / File(s)	Summary
LoRA Video Generation Samples `samples/cpp/video_generation/CMakeLists.txt`, `samples/cpp/video_generation/lora_text2video.cpp`, `samples/cpp/video_generation/README.md`, `samples/python/video_generation/lora_text2video.py`, `samples/python/video_generation/README.md`	New C++ and Python samples demonstrating text-to-video generation with LoRA adapters; includes CMake build configuration, adapter command-line parsing, side-by-side generation with and without adapters, and documentation with usage examples.
Video Generation Configuration Infrastructure `src/cpp/include/openvino/genai/video_generation/generation_config.hpp`, `src/cpp/src/video_generation/generation_config_utils.cpp`	Added optional `adapters` field of type `AdapterConfig` to `VideoGenerationConfig` struct; updated configuration parser to deserialize adapter parameters from properties map.
LTX Video Transformer Model Adapter Support `src/cpp/include/openvino/genai/video_generation/ltx_video_transformer_3d_model.hpp`, `src/cpp/src/video_generation/models/ltx_video_transformer_3d_model.cpp`, `src/cpp/src/video_generation/ltx_pipeline.hpp`	Added public `set_adapters()` method to `LTXVideoTransformer3DModel`; updated compile flow to initialize adapter controller when adapters provided; refactored pipeline constructors to pass device and compile properties to model initialization; enabled adapter application in generate flow.
Python API Bindings `src/python/openvino_genai/py_openvino_genai.pyi`, `src/python/py_video_generation_models.cpp`	Exposed `set_adapters()` method on `LTXVideoTransformer3DModel` in Python bindings via pybind11; added type stub declaration.
Tests and Dependencies `tests/python_tests/test_video_generation.py`, `tests/python_tests/requirements.txt`, `tests/python_tests/test_vlm_pipeline.py`	Added `TestLoRAVideoGeneration` test class with three tests covering pipeline construction, generation, and transformer adapter API; added `sentence-transformers` dependency; fixed model ID directory sanitization to handle both forward and backward slashes.

Sequence Diagram

sequenceDiagram
    actor User
    participant Pipeline as Text2VideoPipeline
    participant T5Encoder as T5EncoderModel
    participant Transformer as LTXVideoTransformer3DModel
    participant Controller as AdapterController
    participant VAE as AutoencoderKLLTXVideo

    User->>Pipeline: Create pipeline with adapters config
    Pipeline->>T5Encoder: Initialize with device + properties
    Pipeline->>Transformer: Initialize with device + properties
    Pipeline->>VAE: Initialize with device + properties
    
    User->>Pipeline: Call generate(prompt, adapters=AdapterConfig)
    Pipeline->>Transformer: set_adapters(adapters)
    Transformer->>Controller: Create AdapterController(model, adapters, device)
    Transformer->>Transformer: Store adapter controller
    
    Pipeline->>T5Encoder: Encode text prompt
    T5Encoder-->>Pipeline: Text embeddings
    
    Pipeline->>Transformer: Infer with embeddings
    Transformer->>Transformer: Apply adapters via controller
    Transformer-->>Pipeline: Generated latent frames
    
    Pipeline->>VAE: Decode latents
    VAE-->>Pipeline: Video frames
    
    Pipeline-->>User: Save video (lora_video.avi or baseline_video.avi)

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Poem

🐰 Hop with glee, our models now adapt!
LoRA whispers through the transformer's clasp,
From text to motion, frames unfold,
With fine-tuned magic, stories told! ✨🎬

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 27.27% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title 'Fix lora video support' directly relates to the main change: adding LoRA adapter support to video generation, including bindings, safety checks, and samples.
Description check	✅ Passed	The description covers the main changes (bindings, safety checks), includes a reference to the related issue (`#3306`), provides verification results, and marks all checklist items as complete with test verification.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch fix-lora-video-support

Tip

Try Coding Plans. Let us write the prompt for your AI agent so you can ship faster (with fewer bugs).
Share your feedback on Discord.

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 6

🧹 Nitpick comments (2)

tests/python_tests/test_video_generation.py (1)

299-308: Add a regression assertion for the safety-check path in set_adapters.

Right now this only verifies None input. Please also assert that passing a non-empty adapter config after compile (without adapter initialization) raises a controlled exception, so the segfault-prevention path is explicitly covered.

🧪 Suggested test extension

     def test_transformer_has_set_adapters_method(self, video_generation_model):
         """Test that the LTXVideoTransformer3DModel has the set_adapters method"""
         model_path = Path(video_generation_model) / "transformer"
         if model_path.exists():
             model = ov_genai.LTXVideoTransformer3DModel(str(model_path))
             model.compile("CPU")

             assert hasattr(model, "set_adapters")

             model.set_adapters(None)
+            with pytest.raises(RuntimeError, match="Adapter controller is not initialized"):
+                model.set_adapters(ov_genai.AdapterConfig())

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/python_tests/test_video_generation.py` around lines 299 - 308, Extend
the test_transformer_has_set_adapters_method to cover the safety-check path in
LTXVideoTransformer3DModel.set_adapters by asserting that after
model.compile("CPU") calling model.set_adapters with a non-empty adapter config
(e.g., a dict or object representing adapters) raises a controlled exception;
wrap the call in pytest.raises(Exception) (or the specific exception class your
implementation raises, e.g., RuntimeError/ValueError) to ensure the
post-compile-but-uninitialized-adapters path is exercised and prevents a
segfault.

samples/cpp/video_generation/CMakeLists.txt (1)

51-76: LGTM! The LoRA sample executable is correctly configured.

The CMake configuration for lora_text2video correctly mirrors the existing text2video target with proper include directories, library linking, RPATH handling, and installation rules.

For maintainability, consider extracting the common configuration into a CMake function to reduce duplication:

♻️ Optional refactor using a helper function

function(add_video_generation_sample TARGET_NAME SOURCE_FILE)
    add_executable(${TARGET_NAME} ${SOURCE_FILE} imwrite_video.cpp)
    target_include_directories(${TARGET_NAME} PRIVATE ${CMAKE_BINARY_DIR} "${CMAKE_CURRENT_SOURCE_DIR}/../image_generation/" "${CMAKE_CURRENT_SOURCE_DIR}/../../../src/cpp/src/")
    ov_genai_link_opencv(${TARGET_NAME} core imgproc videoio imgcodecs)
    target_link_libraries(${TARGET_NAME} PRIVATE openvino::genai indicators::indicators)
    
    if(UNIX AND NOT APPLE)
        set_target_properties(${TARGET_NAME} PROPERTIES INSTALL_RPATH "$ORIGIN/../lib")
    elseif(APPLE)
        set_target_properties(${TARGET_NAME} PROPERTIES INSTALL_RPATH "@loader_path/../lib")
    endif()
    
    set_target_properties(${TARGET_NAME} PROPERTIES INSTALL_RPATH_USE_LINK_PATH ON)
    
    install(TARGETS ${TARGET_NAME}
            RUNTIME DESTINATION samples_bin/
            COMPONENT samples_bin
            EXCLUDE_FROM_ALL)
endfunction()

# Usage:
add_video_generation_sample(text2video text2video.cpp)
add_video_generation_sample(lora_text2video lora_text2video.cpp)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@samples/cpp/video_generation/CMakeLists.txt` around lines 51 - 76, The
CMakeBloc duplicates target setup for text2video and lora_text2video; refactor
by adding a helper function (e.g., add_video_generation_sample) that
encapsulates add_executable, target_include_directories, ov_genai_link_opencv,
target_link_libraries, the INSTALL_RPATH/INSTALL_RPATH_USE_LINK_PATH logic, and
the install() call, then replace the existing explicit target blocks with calls
like add_video_generation_sample(text2video text2video.cpp) and
add_video_generation_sample(lora_text2video lora_text2video.cpp) so both targets
use the same centralized configuration.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@samples/cpp/video_generation/lora_text2video.cpp`:
- Line 15: The usage string passed to OPENVINO_ASSERT in the program's argument
check contains a duplicated closing bracket at the end; update the
OPENVINO_ASSERT call (the one that asserts argc >= 3 && (argc - 3) % 2 == 0) to
remove the extra trailing ']' so the usage message ends with "...
[<LORA_SAFETENSORS> <ALPHA> ...]" (i.e., change the final "...]]" to "...]").
- Around line 27-28: Replace the silent atof parsing for LoRA alpha with
exception-safe std::stof parsing: in the code that reads argv for alpha (used
when calling adapter_config.add(adapter, alpha)), wrap std::stof(argv[...]) in a
try/catch (catch std::invalid_argument and std::out_of_range) and report a clear
CLI error (and exit) when parsing fails; optionally validate the parsed float
range before calling adapter_config.add. Also fix the usage string (the printed
help/usage message) to remove the extra trailing bracket by changing '...]]' to
'...]'.

In `@samples/python/video_generation/lora_text2video.py`:
- Line 69: The print call uses an unnecessary f-string in print(f"\nPerformance
metrics:") which triggers Ruff F541; change it to a normal string by removing
the f prefix (update the print statement in
samples/python/video_generation/lora_text2video.py so the call is
print("\nPerformance metrics:") wherever that print is defined).
- Around line 22-23: The VideoWriter may fail to initialize silently; after
creating writer = cv2.VideoWriter(output_path, fourcc, fps, (width, height))
check writer.isOpened() and handle the failure (raise an exception or log an
error and exit) using output_path and any relevant error context; update the
code around the writer creation (the cv2.VideoWriter call and subsequent frame
writing loop) to bail out early if isOpened() is False so you don't attempt to
write frames or print a misleading success message.

In `@src/cpp/src/video_generation/ltx_pipeline.hpp`:
- Around line 425-428: The first constructor leaves m_text_encode_device,
m_denoise_device, m_vae_device and m_compile_properties uninitialized which can
cause rebuild_models() (called from generate()) to recompile models with garbage
device/compiler settings; fix by initializing those members to sensible defaults
(e.g., empty strings for m_text_encode_device/m_denoise_device/m_vae_device and
a default-constructed m_compile_properties) in the first constructor, or add a
defensive guard at the start of rebuild_models() that validates/assigns defaults
to m_text_encode_device, m_denoise_device, m_vae_device and m_compile_properties
(and returns/throws if they remain invalid) before constructing T5EncoderModel,
LTXVideoTransformer3DModel or AutoencoderKLLTXVideo.

In `@tests/python_tests/requirements.txt`:
- Line 29: The requirements line 'sentence-transformers>=2.2.2,<=5.2.2' should
be removed if unused, or changed to an exact pinned version to match the repo
pattern; either delete that dependency entry from the requirements file (if no
imports of sentence-transformers are present) or replace the ranged specifier
with an exact pin such as 'sentence-transformers==5.2.2' (or the specific
approved version) to ensure reproducible tests.

---

Nitpick comments:
In `@samples/cpp/video_generation/CMakeLists.txt`:
- Around line 51-76: The CMakeBloc duplicates target setup for text2video and
lora_text2video; refactor by adding a helper function (e.g.,
add_video_generation_sample) that encapsulates add_executable,
target_include_directories, ov_genai_link_opencv, target_link_libraries, the
INSTALL_RPATH/INSTALL_RPATH_USE_LINK_PATH logic, and the install() call, then
replace the existing explicit target blocks with calls like
add_video_generation_sample(text2video text2video.cpp) and
add_video_generation_sample(lora_text2video lora_text2video.cpp) so both targets
use the same centralized configuration.

In `@tests/python_tests/test_video_generation.py`:
- Around line 299-308: Extend the test_transformer_has_set_adapters_method to
cover the safety-check path in LTXVideoTransformer3DModel.set_adapters by
asserting that after model.compile("CPU") calling model.set_adapters with a
non-empty adapter config (e.g., a dict or object representing adapters) raises a
controlled exception; wrap the call in pytest.raises(Exception) (or the specific
exception class your implementation raises, e.g., RuntimeError/ValueError) to
ensure the post-compile-but-uninitialized-adapters path is exercised and
prevents a segfault.

ℹ️ Review info

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 3b35df0 and cef3507.

📒 Files selected for processing (15)

samples/cpp/video_generation/CMakeLists.txt
samples/cpp/video_generation/README.md
samples/cpp/video_generation/lora_text2video.cpp
samples/python/video_generation/README.md
samples/python/video_generation/lora_text2video.py
src/cpp/include/openvino/genai/video_generation/generation_config.hpp
src/cpp/include/openvino/genai/video_generation/ltx_video_transformer_3d_model.hpp
src/cpp/src/video_generation/generation_config_utils.cpp
src/cpp/src/video_generation/ltx_pipeline.hpp
src/cpp/src/video_generation/models/ltx_video_transformer_3d_model.cpp
src/python/openvino_genai/py_openvino_genai.pyi
src/python/py_video_generation_models.cpp
tests/python_tests/requirements.txt
tests/python_tests/test_video_generation.py
tests/python_tests/test_vlm_pipeline.py

samples/cpp/video_generation/lora_text2video.cpp

samples/python/video_generation/lora_text2video.py

src/cpp/src/video_generation/ltx_pipeline.hpp

tests/python_tests/requirements.txt

…ya05/openvino.genai into fix-lora-video-support

goyaladitya05 · 2026-02-26T22:11:49Z

@codex review

Copilot

Copilot was unable to review this pull request because the user who requested the review has reached their quota limit.

Eshaan-byte and others added 13 commits February 20, 2026 23:48

Address PR feedback: refine ASSERT message and move CFG tests

7c1d319

Address PR feedback for LoRA video support

6830c2e

fix CI build and lint failures

6977cec

Regenerate pyi stubs for set_adapters

476a48d

Fix CI: Remove manual docstring from LTXVideoTransformer3DModel.set_a…

c95a549

…dapters pyi stub

Fix CI: Restore set_adapters docstring with correct pybind11-stubgen …

990d550

…line wrapping

Fix CI: Correct set_adapters docstring indentation to match pybind11-…

f8e2907

…stubgen output (24 spaces)

Fix CI: Remove set_adapters docstring from C++ binding to produce det…

5f6079e

…erministic pyi stub

Fix WWB tests and Windows VLM bug

f48aa80

Merge remote-tracking branch 'upstream/master' into fix-lora-video-su…

c097433

…pport

Merge branch 'master' into fix-lora-video-support

cef3507

github-actions bot added category: tests dependencies category: visual language category: video generation category: Video generation samples category: GGUF category: cmake / build category: Python API category: CPP API labels Feb 26, 2026

coderabbitai bot reviewed Feb 26, 2026

View reviewed changes

goyaladitya05 added 2 commits February 27, 2026 03:34

undo changes to requirements.txt

e89f419

Merge branch 'fix-lora-video-support' of https://github.com/goyaladit…

39af1aa

…ya05/openvino.genai into fix-lora-video-support

github-actions bot removed the category: tests dependencies label Feb 26, 2026

coderabbitai bot approved these changes Feb 26, 2026

View reviewed changes

goyaladitya05 assigned goyaladitya05 and unassigned goyaladitya05 Feb 26, 2026

goyaladitya05 closed this Feb 26, 2026

goyaladitya05 reopened this Feb 26, 2026

goyaladitya05 closed this Feb 26, 2026

goyaladitya05 requested a review from Copilot February 26, 2026 22:19

Copilot started reviewing on behalf of goyaladitya05 February 26, 2026 22:20 View session

Copilot AI reviewed Feb 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix lora video support#21

Fix lora video support#21
goyaladitya05 wants to merge 15 commits intomasterfrom
fix-lora-video-support

goyaladitya05 commented Feb 26, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Feb 26, 2026 •

edited

Loading

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

goyaladitya05 commented Feb 26, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

goyaladitya05 commented Feb 26, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Verification Results:

Checklist:

Summary by CodeRabbit

Release Notes

Uh oh!

coderabbitai bot commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

goyaladitya05 commented Feb 26, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

goyaladitya05 commented Feb 26, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 26, 2026 •

edited

Loading