-
Notifications
You must be signed in to change notification settings - Fork 32
Pull requests: NVIDIA/srt-slurm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
recipes: add DeepSeek-V4 GB200 decode/prefill bench yamls
#124
opened Apr 30, 2026 by
esmeetu
Loading…
4 tasks
feat: add vLLM GB200 GSM8K repro configs
#106
opened Apr 28, 2026 by
alec-flowers
Collaborator
•
Draft
refactor(gpqa): drop structured runner; ship configs/gpqa/run.sh
#96
opened Apr 27, 2026 by
ishandhanani
Collaborator
Loading…
1 of 2 tasks
feat: peak gen throughput metric in sa-bench + server-side node metrics CSV export
#93
opened Apr 27, 2026 by
zhengd-nv
Loading…
Support vLLM DP ranks with tensor-parallel GPU groups
#86
opened Apr 27, 2026 by
nealvaidya
Loading…
feat: use pre-generated custom dataset for benchmarking MTP with chat template
#63
opened Apr 23, 2026 by
richardhuo-nv
Collaborator
Loading…
feat: SGLang decode slow_down for PD disagg nsys profiling (with skip-warmup workflow)
#60
opened Apr 23, 2026 by
zhengd-nv
Loading…
feat(profiling): add extra_nsys_args for optional nsys CLI flags
#59
opened Apr 23, 2026 by
zhengd-nv
Loading…
Built-in GPU performance monitoring during benchmarks
#35
opened Apr 13, 2026 by
KaunilD
Collaborator
Loading…
1 of 3 tasks
ProTip!
Follow long discussions with comments:>50.