-
-
Notifications
You must be signed in to change notification settings - Fork 12k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Feature]: Prometheus Metrics Abstraction
kv-connector
needs-rebase
speculative-decoding
v1
#30689
opened Dec 15, 2025 by
mladjan-gadzic
•
Draft
3 of 5 tasks
chores: adjust the attn register param order
#30688
opened Dec 15, 2025 by
ILikeIneine
Loading…
5 tasks
[Hardware] Replace Improvements or additions to documentation
nvidia
v1
torch.cuda.empty_cache with torch.accelerator.empty_cache
documentation
#30681
opened Dec 15, 2025 by
jikunshang
•
Draft
5 tasks
[Model] Add video input support for transformers modeling backend
documentation
Improvements or additions to documentation
multi-modality
Related to multi-modality (#4194)
v1
#30680
opened Dec 15, 2025 by
ch3nku1
Loading…
[CPU] Add action to automatically label CPU related PRs
ci/build
#30678
opened Dec 15, 2025 by
fadara01
Loading…
4 tasks
[Docs] Update design/multiprocessing.md
documentation
Improvements or additions to documentation
#30677
opened Dec 15, 2025 by
windsonsea
Loading…
[Refactor] [2/N] Move tool parsers into the vLLM main directory
deepseek
Related to DeepSeek models
documentation
Improvements or additions to documentation
frontend
gpt-oss
Related to GPT-OSS models
llama
Related to Llama models
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
tool-calling
#30675
opened Dec 15, 2025 by
chaunceyjiang
Loading…
5 tasks
[Bugfix] Fix missing first token in tool calls during reasoning-to-tool transition
frontend
#30671
opened Dec 15, 2025 by
mondaylord
Loading…
3 of 5 tasks
[Doc] Add AI Badgr framework integration documentation
documentation
Improvements or additions to documentation
#30669
opened Dec 15, 2025 by
miguelmanlyx
Loading…
4 of 5 tasks
Support GPU tensors in tensor_data() to enable GPU-accelerated multimodal preprocessing
v1
#30667
opened Dec 15, 2025 by
storyicon
Loading…
4 of 5 tasks
Phase 3 hybrid attention
documentation
Improvements or additions to documentation
llama
Related to Llama models
new-model
Requests to new models
performance
Performance-related issues
qwen
Related to Qwen models
speculative-decoding
v1
#30664
opened Dec 15, 2025 by
RGBmarya
Loading…
Strengthen input validation and tests for 'parse_raw_prompts’.
ready
ONLY add when PR is ready to merge/full CI is needed
#30652
opened Dec 14, 2025 by
mivehk
Loading…
3 of 5 tasks
[Bugfix] CustomAR + TritonAttn[AMPERE] + FULL_CG - gpt-oss
gpt-oss
Related to GPT-OSS models
nvidia
#30650
opened Dec 14, 2025 by
bbrowning
Loading…
[Perf] Eliminate padding and slicing op for GPT-OSS with Flashinfer MXFP4 MXFP8 MoE
ci/build
gpt-oss
Related to GPT-OSS models
#30647
opened Dec 14, 2025 by
elvischenv
•
Draft
5 tasks
fix: unsatisfiable testing dependencies caused by a version conflict
ci/build
#30646
opened Dec 14, 2025 by
leejianwoo-collab
Loading…
fix: fix engine initialization fails with ValueError
#30645
opened Dec 14, 2025 by
leejianwoo-collab
Loading…
5 tasks done
Auto-rebase PRs older than 40 commits compared to main
ci/build
#30643
opened Dec 14, 2025 by
khluu
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-11-15.