Skip to content

Pull requests: NVIDIA/srt-slurm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

recipes: add DeepSeek-V4 GB200 decode/prefill bench yamls
#124 opened Apr 30, 2026 by esmeetu Loading…
4 tasks
feat: live in-flight batch-metrics snapshotter (opt-in)
#115 opened Apr 29, 2026 by YAMY1234 Collaborator Draft
1 of 3 tasks
multi process tokenizer in benchmark_serving
#114 opened Apr 29, 2026 by fzyzcjy Loading…
feat: add vLLM GB200 GSM8K repro configs
#106 opened Apr 28, 2026 by alec-flowers Collaborator Draft
feat(vllm): vllm gb200 dsv4 recipes
#103 opened Apr 28, 2026 by alec-flowers Collaborator Draft
refactor(gpqa): drop structured runner; ship configs/gpqa/run.sh
#96 opened Apr 27, 2026 by ishandhanani Collaborator Loading…
1 of 2 tasks
GB300 DSv4 042626
#84 opened Apr 26, 2026 by ywang96 Loading…
Add DeepSeek V4 GB200 recipes
#77 opened Apr 25, 2026 by alec-flowers Collaborator Loading…
[codex] Add asset materialization POC
#57 opened Apr 21, 2026 by ishandhanani Collaborator Loading…
Add GLM5 B200 FP8 disaggregated recipe
#50 opened Apr 21, 2026 by weireweire Collaborator Loading…
Add lm-eval benchmark runner for evals
#41 opened Apr 17, 2026 by Oseltamivir Contributor Loading…
Built-in GPU performance monitoring during benchmarks
#35 opened Apr 13, 2026 by KaunilD Collaborator Loading…
1 of 3 tasks
Enable nsys profiling for vLLM
#28 opened Apr 10, 2026 by leo-cf-tian Contributor Loading…
ProTip! Follow long discussions with comments:>50.