-
Notifications
You must be signed in to change notification settings - Fork 3.3k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
update usage of
trtllm_fp8_per_tensor_scale_moe
run-ci
#12569
opened Nov 3, 2025 by
b8zhong
Loading…
docs: document video-capable multimodal models
express-lane
A PR may be merged without a full CI check
#12565
opened Nov 3, 2025 by
WazupSteve
Loading…
1 of 4 tasks
[router] Add version command support to SMG
#12558
opened Nov 3, 2025 by
tonyluj
Loading…
1 of 4 tasks
disagg(decode): respect ignore_eos in quick-finish EOS branch
#12557
opened Nov 3, 2025 by
cscyuge
Loading…
4 tasks
Fix incorrect handling of max_tokens=0 in chat requests
#12556
opened Nov 3, 2025 by
Chen-0210
Loading…
1 task done
Fix(test): Skip MLA backend test on non-Hopper (sm_80) GPUs
#12552
opened Nov 3, 2025 by
yan0422de
Loading…
1 of 4 tasks
[Reasoning + Structured Output] make reasoning compatible with structured output
#12551
opened Nov 3, 2025 by
Muqi1029
Loading…
1 of 4 tasks
Super tiny dump server info such as args in bench for post analysis
run-ci
#12550
opened Nov 3, 2025 by
fzyzcjy
Loading…
4 tasks
Super tiny allow profile activities in bench_serving
run-ci
#12549
opened Nov 3, 2025 by
fzyzcjy
Loading…
4 tasks
[Doc] fix miss index for production request trace
#12547
opened Nov 3, 2025 by
stmatengss
Loading…
4 tasks
Enable Flashinfer TRTLLM-GEN-MoE FP8 blockwise kernel for Qwen3-Next on Blackwell
#12543
opened Nov 3, 2025 by
samuellees
Loading…
2 of 4 tasks
[WIP] feat: Enable workload tracing and kernel optimization via FlashInfer-Bench
high priority
#12542
opened Nov 3, 2025 by
dtunai
Loading…
4 tasks
Add AutoWeightsLoader utility for simplified weight loading
#12534
opened Nov 3, 2025 by
adityakamat24
Loading…
1 of 4 tasks
Enable Mooncake as the
--moe-a2a-backend on CI
#12528
opened Nov 3, 2025 by
UNIDY2002
Loading…
4 tasks
[cpu] Implement all gather/reduce for arm64 cpu
#12527
opened Nov 3, 2025 by
cyb70289
Loading…
2 of 4 tasks
[Deterministic] Optimize bmm_batch_invariant op
run-ci
#12522
opened Nov 2, 2025 by
zminglei
Loading…
4 tasks
[Test] Add DeepSeekV3.2 NSA Indexer Test Suite
#12520
opened Nov 2, 2025 by
Johnsonms
Loading…
4 tasks done
[chore] Fix update_kernel_whl_index script for multiple cuda version
run-ci
#12519
opened Nov 2, 2025 by
Fridge003
Loading…
4 tasks
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.