forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 49
Pull requests: ROCm/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[355_wip] triton fusion optimized fused_shared_experts
#741
opened Oct 17, 2025 by
k50112113
Loading…
[WIP] Support persistent MLA for ROCm MLA backend
#739
opened Oct 16, 2025 by
ganyi1996ppo
Loading…
5 tasks
[355_wip] [triton] fuse bf16_gemm_reduce_kernel + rope_kv_cache
#730
opened Oct 9, 2025 by
k50112113
Loading…
[FEAT] Add support for AITER bpreshuffle block scale gemm
#717
opened Sep 27, 2025 by
tjtanaavllm
Loading…
5 tasks
[Perf] refactor attention backend for perf boost
#713
opened Sep 26, 2025 by
ganyi1996ppo
Loading…
5 tasks
[355_wip] Let dynamo capture rms/silu_mul+f4gemm pattern
#705
opened Sep 24, 2025 by
xytpai
Loading…
[ROCm] Add allreduce dispatcher for ROCm device
#704
opened Sep 24, 2025 by
zejunchen-zejun
Loading…
[ROCm] Add allreduce dispatcher for ROCm device
#695
opened Sep 18, 2025 by
zejunchen-zejun
Loading…
[ROCm] warpSize is being made non constexpr in ROCm 7.0 (#20330)
#694
opened Sep 18, 2025 by
xudonlyu
Loading…
[355_wip] Let inductor capture silu+mul+quant pattern and replace them with aiter operator
#669
opened Sep 11, 2025 by
xytpai
Loading…
support ck-tile fused bias gemm for rocm unquantized gemm
#668
opened Sep 11, 2025 by
eliotwang
Loading…
add fp8 gemm path choice for rocm_aiter_gemm_w8a8_blockscale
#659
opened Sep 8, 2025 by
zhuyuhua-v
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.