Skip to content

Support vLLM DP ranks with tensor-parallel GPU groups#86

Open
nealvaidya wants to merge 2 commits intoNVIDIA:mainfrom
nealvaidya:nealv/vllm-dp-ep
Open

Support vLLM DP ranks with tensor-parallel GPU groups#86
nealvaidya wants to merge 2 commits intoNVIDIA:mainfrom
nealvaidya:nealv/vllm-dp-ep

Conversation

@nealvaidya
Copy link
Copy Markdown

Summary

  • Honor tensor-parallel-size when expanding vLLM data-parallel endpoints.
  • Support DP+TP shapes like DP2TP4 by launching one process per DP rank with the correct GPU group.
  • Preserve DP16TP1 behavior and validate mismatched DP*TP GPU allocations.

Test plan

  • uv run pytest tests/test_configs.py::TestVLLMDataParallelMode -q

Made with Cursor

Honor tensor-parallel-size when expanding vLLM data-parallel endpoints so configurations like DP2TP4 launch one process per DP rank with the correct GPU set, while preserving DP16TP1 behavior.

Signed-off-by: Neal Vaidya <nealv@nvidia.com>
Signed-off-by: Neal Vaidya <nealv@nvidia.com>
@codecov-commenter
Copy link
Copy Markdown

Codecov Report

❌ Patch coverage is 92.85714% with 2 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (main@8e8b611). Learn more about missing BASE report.

Files with missing lines Patch % Lines
src/srtctl/backends/vllm.py 92.85% 2 Missing ⚠️
Additional details and impacted files
@@           Coverage Diff           @@
##             main      #86   +/-   ##
=======================================
  Coverage        ?   65.22%           
=======================================
  Files           ?       62           
  Lines           ?     7511           
  Branches        ?        0           
=======================================
  Hits            ?     4899           
  Misses          ?     2612           
  Partials        ?        0           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants