-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Spec] Add DSpark block speculative decoding for DeepSeek-V4
deepseek
speculative-decoding
#29538
opened Jun 28, 2026 by
adityakamat24
Contributor
Loading…
Fix SHM feature finalization under DP attention
run-ci
#29536
opened Jun 28, 2026 by
yinghai
Contributor
Loading…
[scheduler] Add scheduler metrics reporter init hook
run-ci
#29535
opened Jun 28, 2026 by
yinghai
Contributor
Loading…
Fix MegaMOE FP4 fallback runner
deepseek
#29534
opened Jun 28, 2026 by
ronhuafeng
•
Draft
5 tasks done
feat(mem_cache): page-major (layer-major within a page) KV/state layout
bypass-fastfail
documentation
Improvements or additions to documentation
run-ci
run-ci-extra
#29533
opened Jun 28, 2026 by
ch-wan
Collaborator
Loading…
5 tasks done
fix: warn about non-obvious chunked prefill budget behavior
#29531
opened Jun 27, 2026 by
ntny
Loading…
fix(quant): load W8A8 int8 checkpoints with per-shard static input scales
quant
LLM Quantization
#29530
opened Jun 27, 2026 by
Sunt-ing
Contributor
Loading…
2 tasks done
[Fix] TokenizerManager: avoid KeyError race in _wait_one_response
#29529
opened Jun 27, 2026 by
GodlyDonuts
Loading…
2 tasks done
[metrics] Add SWA prefix-cache truncation counter
#29528
opened Jun 27, 2026 by
brucechanglongxu
Contributor
Loading…
3 of 5 tasks
observability: add tokenizer event-loop-lag metric
#29527
opened Jun 27, 2026 by
Kangyan-Zhou
Collaborator
•
Draft
Fix stale nsa.* imports in dsv4 compressor
#29526
opened Jun 27, 2026 by
a-m-n-s
Loading…
2 of 5 tasks
[Feature] Add DeepEPv2 (ElasticBuffer) MoE A2A backend
deepseek
#29525
opened Jun 27, 2026 by
MengYu10151
Loading…
5 tasks done
[MoE] Raise clear error for DeepEP normal dispatch in flashinfer_cutedsl FP4
#29523
opened Jun 27, 2026 by
JustinTong0323
Collaborator
Loading…
2 tasks done
fix: fix prefill-aware SWA floor tracking
documentation
Improvements or additions to documentation
run-ci
#29520
opened Jun 27, 2026 by
mickqian
Collaborator
Loading…
5 tasks
[diffusion] warmup: default to model sampling resolution (declare Z-Image default)
diffusion
SGLang Diffusion
#29519
opened Jun 27, 2026 by
mickqian
Collaborator
Loading…
[Docs] Fix broken cross-page links in cookbook and Ascend NPU docs
documentation
Improvements or additions to documentation
#29518
opened Jun 27, 2026 by
ajinkyajawale14499
Loading…
3 tasks done
Fix SGLANG_GRPC_PORT overflow for high --port values
#29517
opened Jun 27, 2026 by
ajinkyajawale14499
Loading…
3 of 5 tasks
[diffusion] fix --warmup silently downgrading server-based warmup to request mode
diffusion
SGLang Diffusion
run-ci
#29514
opened Jun 27, 2026 by
mickqian
Collaborator
Loading…
[feat] Sana-WM Triton Optimizations
diffusion
SGLang Diffusion
jit-kernel
#29513
opened Jun 27, 2026 by
sjmshsh
Contributor
Loading…
[NPU]GLM-4.7-Flash optimize with fused kernels
deepseek
npu
#29509
opened Jun 27, 2026 by
Estrella-xx
Contributor
Loading…
1 of 5 tasks
[Bugfix] fix quickreduce acc error in cudagraph mode
sgl-kernel
#29508
opened Jun 27, 2026 by
haoyangli0109
Contributor
Loading…
[FEAT][SpecDecode] Add DP attention support for DFLASH speculative decoding
speculative-decoding
#29506
opened Jun 27, 2026 by
EanWang211123
Contributor
Loading…
5 tasks
[NPU] Qwen3-VL-30B use split_qkv_rmsnorm_rope for extend
run-ci
#29505
opened Jun 27, 2026 by
silencejade
Contributor
Loading…
5 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.