Skip to content

Pull requests: sgl-project/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix SHM feature finalization under DP attention run-ci
#29536 opened Jun 28, 2026 by yinghai Contributor Loading…
[scheduler] Add scheduler metrics reporter init hook run-ci
#29535 opened Jun 28, 2026 by yinghai Contributor Loading…
Fix MegaMOE FP4 fallback runner deepseek
#29534 opened Jun 28, 2026 by ronhuafeng Draft
5 tasks done
feat(mem_cache): page-major (layer-major within a page) KV/state layout bypass-fastfail documentation Improvements or additions to documentation run-ci run-ci-extra
#29533 opened Jun 28, 2026 by ch-wan Collaborator Loading…
5 tasks done
fix(quant): load W8A8 int8 checkpoints with per-shard static input scales quant LLM Quantization
#29530 opened Jun 27, 2026 by Sunt-ing Contributor Loading…
2 tasks done
[Fix] TokenizerManager: avoid KeyError race in _wait_one_response
#29529 opened Jun 27, 2026 by GodlyDonuts Loading…
2 tasks done
[metrics] Add SWA prefix-cache truncation counter
#29528 opened Jun 27, 2026 by brucechanglongxu Contributor Loading…
3 of 5 tasks
observability: add tokenizer event-loop-lag metric
#29527 opened Jun 27, 2026 by Kangyan-Zhou Collaborator Draft
Fix stale nsa.* imports in dsv4 compressor
#29526 opened Jun 27, 2026 by a-m-n-s Loading…
2 of 5 tasks
[Feature] Add DeepEPv2 (ElasticBuffer) MoE A2A backend deepseek
#29525 opened Jun 27, 2026 by MengYu10151 Loading…
5 tasks done
[MoE] Raise clear error for DeepEP normal dispatch in flashinfer_cutedsl FP4
#29523 opened Jun 27, 2026 by JustinTong0323 Collaborator Loading…
2 tasks done
fix: fix prefill-aware SWA floor tracking documentation Improvements or additions to documentation run-ci
#29520 opened Jun 27, 2026 by mickqian Collaborator Loading…
5 tasks
[diffusion] warmup: default to model sampling resolution (declare Z-Image default) diffusion SGLang Diffusion
#29519 opened Jun 27, 2026 by mickqian Collaborator Loading…
[Docs] Fix broken cross-page links in cookbook and Ascend NPU docs documentation Improvements or additions to documentation
#29518 opened Jun 27, 2026 by ajinkyajawale14499 Loading…
3 tasks done
Fix SGLANG_GRPC_PORT overflow for high --port values
#29517 opened Jun 27, 2026 by ajinkyajawale14499 Loading…
3 of 5 tasks
Fix LongCat MLP tensor parallelism
#29515 opened Jun 27, 2026 by ixxiii Loading…
[feat] Sana-WM Triton Optimizations diffusion SGLang Diffusion jit-kernel
#29513 opened Jun 27, 2026 by sjmshsh Contributor Loading…
[NPU]GLM-4.7-Flash optimize with fused kernels deepseek npu
#29509 opened Jun 27, 2026 by Estrella-xx Contributor Loading…
1 of 5 tasks
[Bugfix] fix quickreduce acc error in cudagraph mode sgl-kernel
#29508 opened Jun 27, 2026 by haoyangli0109 Contributor Loading…
[NPU] Qwen3-VL-30B use split_qkv_rmsnorm_rope for extend run-ci
#29505 opened Jun 27, 2026 by silencejade Contributor Loading…
5 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.