-
Notifications
You must be signed in to change notification settings - Fork 144
Pull requests: vllm-project/tpu-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add ragged_conv1d for ragged chunked kernel integration for gdn attention
ready
ONLY add when PR is ready to merge/full CI is needed
#2099
opened Mar 31, 2026 by
helloworld1
Loading…
[Environment variable override] Override ONLY add when PR is ready to merge/full CI is needed
VLLM_USE_AOT_COMPILE to False by default
ready
#2097
opened Mar 31, 2026 by
jrplatin
Loading…
Call ONLY add when PR is ready to merge/full CI is needed
_align_hybrid_block_size in TpuPlatform
ready
#2090
opened Mar 31, 2026 by
ShobhitBehl
Loading…
Set prompt_token_ids_cpu=None to match upstream interface change
ready
ONLY add when PR is ready to merge/full CI is needed
#2085
opened Mar 30, 2026 by
pv97
Loading…
Fix compressed tensors moe test.
ready
ONLY add when PR is ready to merge/full CI is needed
#2076
opened Mar 28, 2026 by
dmmolitor
Loading…
Do not submit.
ready
ONLY add when PR is ready to merge/full CI is needed
#2053
opened Mar 26, 2026 by
QiliangCui
Loading…
[RPA] Add bidirectional attention support for multimodal
#2046
opened Mar 26, 2026 by
kwang3939
Loading…
transcedentals cost in cost estimate
ready
ONLY add when PR is ready to merge/full CI is needed
#2044
opened Mar 26, 2026 by
coolkp
Loading…
Wire AWQ dense layers to use GMM V2 kernel for W4A16 matmul
#2038
opened Mar 26, 2026 by
rohan-reddy
Loading…
5 tasks done
[FP8][MoE] Move FP8 MoE weight requantization from CPU to TPU with cache clearing
#2028
opened Mar 25, 2026 by
rohan-reddy
Loading…
[Parallelism Support Matrix Tests] Replace flaky EP relative comparison with hardcoded absolute baseline
ready
ONLY add when PR is ready to merge/full CI is needed
#1984
opened Mar 20, 2026 by
syhuang22
Loading…
[Fused MoE] Use jax.nn.sigmoid
ready
ONLY add when PR is ready to merge/full CI is needed
#1980
opened Mar 20, 2026 by
catswe
Loading…
reorg support matrices for UX
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#1975
opened Mar 19, 2026 by
jcyang43
Loading…
Add support for Qwen3-VL model via Torchax path
#1974
opened Mar 19, 2026 by
muskansh-google
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.