-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Automatically retry when HuggingFace Hub raises 429 HTTP error
#4845
opened Mar 28, 2025 by
fzyzcjy
Loading…
6 tasks
test: reduce
mem_fraction_static
for gemma3 vision test
#4840
opened Mar 28, 2025 by
vhain
Loading…
6 tasks
Fix DeepSeek V3 cannot run on 4x8xH100
high priority
#4836
opened Mar 28, 2025 by
fzyzcjy
Loading…
6 tasks
Clean up
import vllm
in quantization/__init__.py
high priority
#4834
opened Mar 28, 2025 by
merrymercy
Loading…
Support Page Size > 1 for FA3
high priority
#4832
opened Mar 27, 2025 by
hebiao064
Loading…
1 of 7 tasks
add __half22bfloat162 __bfloat1622half2 for awq_kernel with cu121 SDK
#4820
opened Mar 27, 2025 by
yiakwy-xpu-ml-framework-team
Loading…
1 of 6 tasks
[sgl-kernel] per token group quant support COLUMN MAJOR
#4817
opened Mar 27, 2025 by
BBuf
Loading…
2 of 8 tasks
Support non-attention path operators in Triton
#4792
opened Mar 26, 2025 by
JackChuang
Loading…
3 of 6 tasks
Improving Total Token Throughput by 1%: Reducing CPU Overhead in Zero-Overhead Scheduling
#4790
opened Mar 26, 2025 by
WANG-GH
Loading…
Fix AttributeError in scheduler's release_memory_occupation method
#4789
opened Mar 26, 2025 by
GeLee-Q
Loading…
6 tasks
Fix wrong variable name when stopping memory profile
#4772
opened Mar 25, 2025 by
Fr4nk1inCs
Loading…
6 tasks
[Feature] Support DeepEP Low Latency
high priority
#4767
opened Mar 25, 2025 by
liz-badada
Loading…
1 of 6 tasks
Add integration test for Flash Attention 3
#4760
opened Mar 25, 2025 by
yubofredwang
Loading…
2 of 6 tasks
fix typo: disagg_prefill_infight_queue -> disagg_prefill_inflight_queue
#4757
opened Mar 25, 2025 by
GaoYusong
Loading…
1 of 6 tasks
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.