input prompts only, test qwen3 omni. by xipingyan · Pull Request #119 · xipingyan/openvino.genai

xipingyan · 2026-03-07T08:04:11Z

This pull request adds a new configuration YAML file for the Qwen3-Omni model pipeline in the samples/cpp/module_genai/config_yaml directory. The configuration defines the structure and flow for prompt encoding, LLM inference, and result formatting, enabling streamlined integration and testing of the Qwen3-Omni model.

New Qwen3-Omni pipeline configuration:

Added config_prompt.yaml to specify the pipeline modules and their parameters for the Qwen3-Omni model, including prompt encoding, LLM inference, and result collection.
Defined module chaining and device allocation (GPU) for both the TextEncoderModule and LLMInferenceSDPAModule, supporting efficient model execution.
Provided explicit model paths and parameters for both encoder and LLM inference modules, facilitating reproducible testing and deployment.
Established output structure via the ResultModule, ensuring generated text is properly collected and formatted.

ISSUE:

[INFO] Running module: pipeline_params
[INFO]     Pass prompt to output port
[INFO] Running module: prompt_encoder
[INFO] Running module: llm
[ERROR] Exception from src/inference/src/cpp/infer_request.cpp:224:
Check 'TRShape::broadcast_merge_into(result_shape, input_shapes[input_port], broadcast_spec)' failed at src/core/shape_inference/include/select_shape_inference.hpp:40:
While validating node 'opset1::Select Select_401311 () -> ()' with friendly_name 'Select_401311':
Argument shapes are inconsistent.

Copilot

Pull request overview

This PR adds a new text-only (prompt-only) pipeline configuration YAML for the Qwen3-Omni model. The configuration strips away the vision/audio modules from the full config.yaml to create a simplified pipeline that only processes text prompts through the TextEncoderModule → LLMInferenceSDPAModule → ResultModule chain, similar to the existing Qwen3.5-0.8B/config_text.yaml.

Changes:

Added config_prompt.yaml for Qwen3-Omni with a text-only pipeline (prompt encoding → LLM inference → result collection), omitting vision and audio preprocessing stages.
Defined GPU device allocation for both the TextEncoderModule and LLMInferenceSDPAModule with model paths pointing to the Qwen3-Omni-4B test models.
Configured the pipeline to generate up to 512 tokens and collect the output through a ResultModule.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-07T08:07:30Z

samples/cpp/module_genai/config_yaml/Qwen3-Omni/config_prompt.yaml

+        source: "pipeline_params.prompt"
+    outputs:
+      - name: "input_ids"
+        type: "OVTensor"


The prompt_encoder is missing the mask output declaration. The TextEncoderModule::run() method unconditionally writes to this->outputs["mask"].data for the QWEN3_OMNI model type (see src/cpp/src/module_genai/modules/md_text_encoder.cpp:159). While this won't crash (the map auto-inserts), it's inconsistent with every other TextEncoderModule config in the codebase — for example, the analogous text-only config at samples/cpp/module_genai/config_yaml/Qwen3.5-0.8B/config_text.yaml:21-22 and the sibling config.yaml at samples/cpp/module_genai/config_yaml/Qwen3-Omni/config.yaml:48-49 both declare the mask output. Add the mask output to maintain consistency and properly document the module's outputs.

Suggested change

type: "OVTensor"

type: "OVTensor"

- name: "mask"

type: "OVTensor"

Copilot · 2026-03-07T08:07:30Z

samples/cpp/module_genai/config_yaml/Qwen3-Omni/config_prompt.yaml

+    type: "ParameterModule"
+    outputs:
+      - name: "prompt"
+        type: "String"


A blank line is missing between the pipeline_params and prompt_encoder module definitions. All other pipeline YAML configs in this directory consistently use blank lines to separate top-level modules (e.g., Qwen3.5-0.8B/config_text.yaml:10, Qwen3-Omni/config.yaml:12). Add a blank line after line 9 for consistency.

Suggested change

type: "String"

type: "String"

Signed-off-by: xiping.yan <xiping.yan@intel.com>

…en3_omni_text_input_only_samples

Copilot AI review requested due to automatic review settings March 7, 2026 08:04

github-actions bot added the no-match-files label Mar 7, 2026

Copilot started reviewing on behalf of xipingyan March 7, 2026 08:04 View session

Copilot AI reviewed Mar 7, 2026

View reviewed changes

xipingyan added 2 commits March 7, 2026 16:09

input prompts only, test qwen3 omni.

c8c3b3e

Signed-off-by: xiping.yan <xiping.yan@intel.com>

Merge remote-tracking branch 'origin/master_modular_genai' into xp/qw…

006723b

…en3_omni_text_input_only_samples

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

input prompts only, test qwen3 omni.#119

input prompts only, test qwen3 omni.#119
xipingyan wants to merge 2 commits intomaster_modular_genaifrom
xp/qwen3_omni_text_input_only_samples

xipingyan commented Mar 7, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 7, 2026

Uh oh!

Copilot AI Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

xipingyan commented Mar 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 7, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xipingyan commented Mar 7, 2026 •

edited

Loading