-
Notifications
You must be signed in to change notification settings - Fork 53
Pull requests: NVIDIA/srt-slurm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] ADd Kimi-K2.5 FP4 TRTLLM GB200/GB300 configs
#204
opened Jun 8, 2026 by
xinli-sw
Collaborator
Loading…
Accept remote telemetry container aliases in preflight
#202
opened Jun 6, 2026 by
fallintoplace
Loading…
Fix preflight for remote model and container aliases
#201
opened Jun 6, 2026 by
fallintoplace
Loading…
Add DeepSeek-V4-Pro 8k/1k SA recipes for GB300 (MTP-off + MTP-on, MXFP4)
#192
opened Jun 2, 2026 by
nv-yna
Loading…
2 of 3 tasks
Draft: merge Q2 submission support into main without recipes
#191
opened May 29, 2026 by
jasonlizhengjian
Contributor
•
Draft
Cherry-pick Dynamo wheel install support to Q2
#184
opened May 29, 2026 by
jasonlizhengjian
Contributor
Loading…
Update GB300 FP4 GLM5 low-latency sweep
#175
opened May 25, 2026 by
weireweire
Collaborator
Loading…
[NOT RATE MATCHED]Add NVFP4 WideEP disaggregated DEP8/DEP16/DEP32 recipes for Qwen3.5-397B-A17B
#167
opened May 20, 2026 by
xiaoweiw-nv
Loading…
vllm: set VLLM_NIXL_SIDE_CHANNEL_HOST to node's routable IP
#158
opened May 15, 2026 by
esmeetu
Loading…
2 tasks
vllm: support mooncake_kv_store + expose mooncake_master metrics
#157
opened May 14, 2026 by
esmeetu
Loading…
3 tasks done
fix(sglang): override recipe-pinned disaggregation-bootstrap-port
#155
opened May 14, 2026 by
zhengd-nv
Contributor
Loading…
fix(sa-bench): shard high-concurrency loadgen across frontends
#154
opened May 13, 2026 by
YAMY1234
Collaborator
Loading…
recipes(qwen3.5): refresh fp8 mtp-off wideep configs
#149
opened May 12, 2026 by
zhengd-nv
Contributor
Loading…
3 tasks done
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.