[Model Runner V2] Do not initialize sampler for non-last PP ranks by WoosukKwon · Pull Request #36824 · vllm-project/vllm

WoosukKwon · 2026-03-11T21:47:58Z

Skip the initialization of Sampler (and a few sample-related classes) for non-last PP ranks or for pooling models.

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

gemini-code-assist

Code Review

This pull request introduces an optimization to skip the initialization of the Sampler and related classes for non-last pipeline parallel ranks and for pooling models. The changes correctly make the initialization of these components conditional, which avoids unnecessary resource allocation. All usages of these potentially uninitialized components are now properly guarded with conditional checks or assertions, ensuring runtime safety. The related modifications in input_batch.py to handle an optional output_bin_counts tensor are also implemented correctly. The changes are logical, well-contained, and effectively achieve the intended optimization.

mergify · 2026-03-11T21:51:48Z

Hi @WoosukKwon, the pre-commit checks have failed. Please run:

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy failing?

mypy is run differently in CI. If the failure is related to this check, please use the following command to run it locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10

njhill

Nice :)

We've also always done this in V1 .. it had been on my to-do list to fix that too

I think we can similarly conditionally create the pooling_runner in load_model (also no need if not last pp rank)

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

njhill · 2026-03-11T23:37:40Z

vllm/v1/worker/gpu/model_runner.py

        )
-        if self.is_pooling_model:
+        if self.is_pooling_model and self.is_last_pp_rank:
            self.pooling_runner = PoolingRunner(self.model)


We may still need to get/store the supported tasks here ... since that's queried from the front-end and the executor returns the result from rank 0.

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

WoosukKwon added 3 commits March 11, 2026 21:39

[Model Runner V2] Do not initialize sampler for non-PP ranks

5d8ede5

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

fix

9f141fb

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

minor

793b724

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

WoosukKwon requested a review from njhill as a code owner March 11, 2026 21:47

WoosukKwon added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 11, 2026

mergify bot added the v1 label Mar 11, 2026

gemini-code-assist bot reviewed Mar 11, 2026

View reviewed changes

njhill approved these changes Mar 11, 2026

View reviewed changes

WoosukKwon added 2 commits March 11, 2026 23:18

Merge branch 'main' into woosuk/mrv2-sampler-last-pp

361e71b

pre-commit & review comment

bd185b3

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

WoosukKwon enabled auto-merge (squash) March 11, 2026 23:23

njhill reviewed Mar 11, 2026

View reviewed changes

WoosukKwon added 2 commits March 12, 2026 01:08

Merge branch 'main' into woosuk/mrv2-sampler-last-pp

82de208

fix PP + pooling

bf37e3d

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

njhill approved these changes Mar 12, 2026

View reviewed changes

WoosukKwon merged commit 2f8b4ce into main Mar 12, 2026
52 checks passed

WoosukKwon deleted the woosuk/mrv2-sampler-last-pp branch March 12, 2026 03:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Model Runner V2] Do not initialize sampler for non-last PP ranks#36824

[Model Runner V2] Do not initialize sampler for non-last PP ranks#36824
WoosukKwon merged 7 commits intomainfrom
woosuk/mrv2-sampler-last-pp

WoosukKwon commented Mar 11, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

mergify bot commented Mar 11, 2026

Uh oh!

njhill left a comment

Uh oh!

njhill Mar 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

WoosukKwon commented Mar 11, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

mergify bot commented Mar 11, 2026

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

njhill Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants