Skip to content

Add B300 DeepSeek V4 aggregate recipes#164

Draft
YAMY1234 wants to merge 1 commit into
NVIDIA:mainfrom
YAMY1234:yangminl/dsv4-b300-recipes
Draft

Add B300 DeepSeek V4 aggregate recipes#164
YAMY1234 wants to merge 1 commit into
NVIDIA:mainfrom
YAMY1234:yangminl/dsv4-b300-recipes

Conversation

@YAMY1234
Copy link
Copy Markdown
Collaborator

@YAMY1234 YAMY1234 commented May 19, 2026

Summary

  • Add a B300 DeepSeek-V4-Pro aggregate DP8/DP-attention/DeepEP recipe for the 8k/1k high-throughput Pareto sweep through concurrency 2048.
  • Add a B300 DeepSeek-V4-Pro aggregate TP8/no-DP-attention recipe for the 8k/1k low-latency Pareto sweep.
  • Use the container-provided SGLang and DeepGEMM stack with no local source, package, or DeepGEMM overlay mounts.
  • Align the TP8 recipe with the 2026-05-19 submission HTML source config: mixed chunk enabled, chunked-prefill/max-prefill 8192, scheduler recv interval 30, and DeepGEMM precompile.

Testing

  • python3 YAML parse for both recipes
  • .venv/bin/srtctl dry-run -f recipes/dsv4-pro/sglang/b300-fp4/8k1k/agg/stp/agg-low-latency-tp8.yaml
  • .venv/bin/srtctl dry-run -f recipes/dsv4-pro/sglang/b300-fp4/8k1k/agg/stp/agg-max-tpt-dp8.yaml

@codecov-commenter
Copy link
Copy Markdown

codecov-commenter commented May 19, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
⚠️ Please upload report for BASE (main@078ee8b). Learn more about missing BASE report.

Additional details and impacted files
@@           Coverage Diff           @@
##             main     #164   +/-   ##
=======================================
  Coverage        ?   65.10%           
=======================================
  Files           ?       67           
  Lines           ?     8217           
  Branches        ?        0           
=======================================
  Hits            ?     5350           
  Misses          ?     2867           
  Partials        ?        0           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@YAMY1234 YAMY1234 force-pushed the yangminl/dsv4-b300-recipes branch from bd677af to 78b7c02 Compare May 22, 2026 04:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants