[Tests] Skip model weight download for render-only test server by sagearc · Pull Request #36813 · vllm-project/vllm

sagearc · 2026-03-11T19:28:51Z

Purpose

RemoteLaunchRenderServer inherits the full model weight download from RemoteVLLMServer.__init__, but the render server only needs the tokenizer — no weights are loaded at runtime.

Changes

Extract _pre_download_model hook from RemoteVLLMServer.__init__
Override it as a no-op in RemoteLaunchRenderServer

Test Plan

Existing render tests (test_launch_render.py) cover this path.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

gemini-code-assist

Code Review

This pull request refactors the model pre-download logic to allow skipping it for the render-only test server, which is a sensible optimization. However, the new implementation for the render server completely skips any pre-download. Since the render server still requires the tokenizer, this could lead to test flakiness if the server startup times out while downloading it. I've provided a suggestion to explicitly pre-download only the tokenizer for the render server to make the tests more robust.

tests/utils.py

sagearc · 2026-03-11T19:52:22Z

cc @DarkLight1337 @NickLucche

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

gemini-code-assist bot reviewed Mar 11, 2026

View reviewed changes

tests/utils.py Outdated Show resolved Hide resolved

sagearc added 2 commits March 11, 2026 21:55

[Tests] Skip model weight download for render-only test server

52c7ad2

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

download only tokenizer

4e3d23e

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

sagearc force-pushed the fix-render-server-model-download branch from 3bfce13 to 4e3d23e Compare March 11, 2026 19:55

Merge branch 'main' into fix-render-server-model-download

c3a6123

DarkLight1337 approved these changes Mar 12, 2026

View reviewed changes

Merge branch 'main' into fix-render-server-model-download

d875bb3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Tests] Skip model weight download for render-only test server#36813

[Tests] Skip model weight download for render-only test server#36813
sagearc wants to merge 4 commits intovllm-project:mainfrom
sagearc:fix-render-server-model-download

sagearc commented Mar 11, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

sagearc commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

sagearc commented Mar 11, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Changes

Test Plan

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

sagearc commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sagearc commented Mar 11, 2026 •

edited by github-actions bot

Loading