Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

chores: adjust the attn register param order
#30688 opened Dec 15, 2025 by ILikeIneine Loading…
5 tasks
Triton Attention: Support cross-layers blocks v1
#30687 opened Dec 15, 2025 by orozery Loading…
[WIP] Fix docker build cache ci/build ready ONLY add when PR is ready to merge/full CI is needed
#30686 opened Dec 15, 2025 by wzshiming Loading…
5 tasks
[MM Encoder]: Migrate legacy ViT MultiHeadAttention to new MMEncoderAttention interface llama Related to Llama models tpu Related to Google TPUs v1
#30684 opened Dec 15, 2025 by Isotr0py Draft
3 of 5 tasks
[UT][PCP&DCP] UT for block_table.py v1
#30683 opened Dec 15, 2025 by pisceskkk Loading…
[Hardware] Replace torch.cuda.empty_cache with torch.accelerator.empty_cache documentation Improvements or additions to documentation nvidia v1
#30681 opened Dec 15, 2025 by jikunshang Draft
5 tasks
[Model] Add video input support for transformers modeling backend documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194) v1
#30680 opened Dec 15, 2025 by ch3nku1 Loading…
[Docs] Update design/multiprocessing.md documentation Improvements or additions to documentation
#30677 opened Dec 15, 2025 by windsonsea Loading…
Sihao issue586 fix v1
#30676 opened Dec 15, 2025 by 1643661061leo Loading…
5 tasks
[Refactor] [2/N] Move tool parsers into the vLLM main directory deepseek Related to DeepSeek models documentation Improvements or additions to documentation frontend gpt-oss Related to GPT-OSS models llama Related to Llama models qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed tool-calling
#30675 opened Dec 15, 2025 by chaunceyjiang Loading…
5 tasks
[Bugfix] Fix multimodal configuration for Qwen3VL MOE model qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#30670 opened Dec 15, 2025 by maxyanghu Loading…
2 of 5 tasks
v0.13.0
[Doc] Add AI Badgr framework integration documentation documentation Improvements or additions to documentation
#30669 opened Dec 15, 2025 by miguelmanlyx Loading…
4 of 5 tasks
Phase 3 hybrid attention documentation Improvements or additions to documentation llama Related to Llama models new-model Requests to new models performance Performance-related issues qwen Related to Qwen models speculative-decoding v1
#30664 opened Dec 15, 2025 by RGBmarya Loading…
Strengthen input validation and tests for 'parse_raw_prompts’. ready ONLY add when PR is ready to merge/full CI is needed
#30652 opened Dec 14, 2025 by mivehk Loading…
3 of 5 tasks
[Bugfix] CustomAR + TritonAttn[AMPERE] + FULL_CG - gpt-oss gpt-oss Related to GPT-OSS models nvidia
#30650 opened Dec 14, 2025 by bbrowning Loading…
fix: fix engine initialization fails with ValueError
#30645 opened Dec 14, 2025 by leejianwoo-collab Loading…
5 tasks done
ProTip! What’s not been updated in a month: updated:<2025-11-15.