fix: fix engine initialization fails with ValueError #30645

leejianwoo-collab · 2025-12-14T12:29:55Z

Fix GLM-4.1V dummy video input generation failure

Problem

GLM-4.1V engine initialization fails with ValueError: t:1 must be larger than temporal_factor:2 during dummy video input generation.

Root Cause

Issue Location: vllm/model_executor/models/glm4_1v.py:995 in get_num_frames_with_most_features() function
Problem: Function returns max(max_frames_per_video, 1), which can return 1 when video token allocation is insufficient
Constraint Violation: GLM-4.1V's smart_resize function requires num_frames > temporal_factor=2 (strict inequality)
Failure: num_frames=1 violates 1 > 2, causing ValueError and engine core crash

Error Stack Trace

ValueError: t:1 must be larger than temporal_factor:2
RuntimeError: Engine core initialization failed. See root cause above. Failed core proc(s): {'EngineCore_DP0': 1}

Solution

Change the minimum fallback value from 1 to 3 in get_num_frames_with_most_features():

# Before (line 995)
return max(max_frames_per_video, 1)

# After (line 995)  
return max(max_frames_per_video, 3)

Why 3?

GLM-4.1V constraint: num_frames > temporal_factor=2
Minimum valid value: num_frames >= 3
Our fix ensures: 3 > 2 ✓ (constraint satisfied)

Impact

Fixes: GLM-4.1V engine initialization failure for text-only/image-only inference
Scope: Only affects GLM-4.1V dummy video input generation fallback case
Backward Compatible: No breaking changes to existing functionality
Performance: Minimal impact - only affects edge case with insufficient video token allocation

Testing

Verified fix addresses the specific constraint violation
Existing GLM-4.1V tests should continue to pass
Created test case to validate minimum frame count requirement

Related Issues

Fixes #30638

Checklist

Identified root cause in get_num_frames_with_most_features()
Implemented minimal fix changing fallback from 1 to 3
Verified fix satisfies GLM-4.1V's num_frames > temporal_factor=2 constraint
Created test case for validation
Confirmed no impact on other model implementations

chatgpt-codex-connector · 2025-12-14T12:30:00Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.

github-actions · 2025-12-14T12:30:06Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors.

You ask your reviewers to trigger select CI tests on top of fastcheck CI.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

If you have any questions, please reach out to us on Slack at https://slack.vllm.ai.

🚀

gemini-code-assist

Code Review

This pull request correctly addresses a ValueError that occurs during GLM-4.1V engine initialization. The root cause, related to dummy video input generation, has been accurately identified, and the proposed fix of changing the minimum fallback value for frames is effective. My review includes one suggestion to enhance the code's maintainability by programmatically deriving this fallback value instead of using a hardcoded number.

vllm/model_executor/models/glm4_1v.py

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: nathon <[email protected]>

DarkLight1337

Which model repo are you using and what is your transformers version? I can't seem to reproduce the problem even after hardcoding this method to return 1.

fix: fix engine initialization fails with ValueError

848aca9

gemini-code-assist bot reviewed Dec 14, 2025

View reviewed changes

vllm/model_executor/models/glm4_1v.py Outdated Show resolved Hide resolved

Update vllm/model_executor/models/glm4_1v.py

63a1ad1

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: nathon <[email protected]>

DarkLight1337 reviewed Dec 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix: fix engine initialization fails with ValueError #30645

fix: fix engine initialization fails with ValueError #30645

leejianwoo-collab commented Dec 14, 2025 •

edited by github-actions bot

Loading

Uh oh!

chatgpt-codex-connector bot commented Dec 14, 2025

Uh oh!

github-actions bot commented Dec 14, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

DarkLight1337 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

fix: fix engine initialization fails with ValueError #30645

Are you sure you want to change the base?

fix: fix engine initialization fails with ValueError #30645

Conversation

leejianwoo-collab commented Dec 14, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fix GLM-4.1V dummy video input generation failure

Problem

Root Cause

Error Stack Trace

Solution

Why 3?

Impact

Testing

Related Issues

Checklist

Uh oh!

chatgpt-codex-connector bot commented Dec 14, 2025

Uh oh!

github-actions bot commented Dec 14, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

leejianwoo-collab commented Dec 14, 2025 •

edited by github-actions bot

Loading