Skip to content

Pull requests: sgl-project/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[diffusion] refactor: separate runtime metadata from arch config diffusion SGLang Diffusion lora
#22678 opened Apr 13, 2026 by mickqian Collaborator Draft
5 tasks
Add Intel nightly tests for XPU and CPU platforms deepseek
#22677 opened Apr 13, 2026 by MingxuZh Contributor Loading…
[NPU] Support Qwen3.5-MoE and Qwen3-Next quantization
#22674 opened Apr 13, 2026 by Dmovic Loading…
5 tasks
[Perf] Precompute gemma_weight to avoid redundant add on every forward
#22673 opened Apr 13, 2026 by Chen-0210 Contributor Loading…
5 tasks
reland [Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support blackwell SM100/SM120 diffusion SGLang Diffusion documentation Improvements or additions to documentation jit-kernel quant LLM Quantization run-ci
#22672 opened Apr 13, 2026 by BBuf Collaborator Loading…
[Draft PR] update CPU CI suite run-ci
#22670 opened Apr 13, 2026 by 1pikachu Contributor Loading…
feat: Support flashinfer_cutedsl MoE runner with flashinfer alltoall backend
#22669 opened Apr 13, 2026 by samuellees Contributor Loading…
5 tasks done
[diffusion] model: support Ltx 2.3 two stage ti2v diffusion SGLang Diffusion
#22667 opened Apr 13, 2026 by mickqian Collaborator Draft
5 tasks
Qwen3next flashinfer allreduce auto enable
#22664 opened Apr 13, 2026 by BBuf Collaborator Loading…
5 tasks
[Hotfix] final fixes for P2P Transfer deepseek
#22663 opened Apr 13, 2026 by JD-ETH Contributor Loading…
[VLM] Reduce GPU memory footprint of CUDA IPC MM feature transport run-ci
#22662 opened Apr 13, 2026 by yhyang201 Collaborator Loading…
5 tasks
Fix/amd wheel jit kernel support dependencies Pull requests that update a dependency file documentation Improvements or additions to documentation
#22661 opened Apr 13, 2026 by akao-amd Contributor Loading…
5 tasks
Add sleep/wake support for diffusion engine diffusion SGLang Diffusion documentation Improvements or additions to documentation
#22659 opened Apr 13, 2026 by MikukuOvO Contributor Draft
3 of 5 tasks
PD streaming: batch notify + SSE fast path run-ci
#22658 opened Apr 13, 2026 by inkcherry Contributor Loading…
5 tasks
[XPU] Support apply_router_weight_on_input for Llama4 for fused_experts quant LLM Quantization
#22654 opened Apr 13, 2026 by rahulvijayaraghavan Contributor Loading…
[Docker] Remove flashinfer cache copy
#22653 opened Apr 13, 2026 by mmangkad Contributor Loading…
enable streaming session retract tests
#22651 opened Apr 13, 2026 by hnyls2002 Collaborator Loading…
1 task
Upd: MoRI amd
#22646 opened Apr 13, 2026 by HaiShaw Collaborator Loading…
5 tasks
env: add knob to control SWA eviction interval
#22645 opened Apr 13, 2026 by happierpig Contributor Loading…
5 tasks
Replace all-reduce + dp_scatter with reduce_scatterv for DP attention run-ci
#22642 opened Apr 12, 2026 by YAMY1234 Contributor Loading…
3 of 5 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.