ROCm / vllm Public

forked from vllm-project/vllm

Notifications You must be signed in to change notification settings
Fork 49
Star 122

Code
Issues 2
Pull requests 40
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security and quality
Insights

Pull requests: ROCm/vllm

Labels 16 Milestones 0

New pull request New

40 Open 938 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[ROCm] Fix dynamic lm_head INT8 applying to non-lm_head embeddings

#1016 opened Jun 22, 2026 by marcusr-amd

Loading…

[ROCm][MoE] Custom W4A16 MoE prefill WMMA GEMM for gfx11 (default-on)

#1015 opened Jun 22, 2026 by roberteg16

Loading…

Merge upstream→gfx11

#1014 opened Jun 22, 2026 by eble-amd • Draft

Revert #867: stop force-enabling the fused RMSNorm custom op (+rms_norm) on gfx11

#1013 opened Jun 22, 2026 by parthash0804

Loading…

5 tasks

optimization qwen3-vl-4b TTFT for gfx1150 with 2 448x448 image and 256 text token input

#1012 opened Jun 22, 2026 by qingxuamd

Loading…

[EXPERIMENT][ROCm][W4A16] Cache dequantized bf16 weights for prefill GEMM

#1007 opened Jun 15, 2026 by roberteg16 • Draft

2 of 5 tasks

optimize TTFT qwen3-vl

#1006 opened Jun 15, 2026 by qingxuamd

Loading…

455 war room findings

#1001 opened Jun 12, 2026 by jpvillam-amd

Loading…

MoE: Grouped Triton GEMM for TTFT improvements

#970 opened May 26, 2026 by mgehre-amd • Draft

[ROCm][MoE] Modular MoE: alias fused_out with output to skip finalize copy

#940 opened May 19, 2026 by mgehre-amd

Loading…

2 tasks done

feat: Add NPU+GPU async pipelining for vision-language models

#936 opened May 14, 2026 by liangliangchang • Draft

4 of 5 tasks

Annotate VLM/audio tower nn.Linear calls in PyTorch profiles

#934 opened May 13, 2026 by mgehre-amd

Loading…

Marcusr/aiesw 32176 w4a16 ck wmma

#930 opened May 8, 2026 by marcusr-amd • Draft

3 of 5 tasks

[ROCm][quant] INC: route w4a16-sym MoE through HybridW4A16 HIP path

#929 opened May 8, 2026 by mgehre-amd • Draft

5 tasks

[bench] wvSplitK skinny GEMM: capture timed iters into a CUDA graph

#928 opened May 8, 2026 by mgehre-amd • Draft

Hybrid

#918 opened May 4, 2026 by liangliangchang • Draft

5 tasks

Auto-build flash-attn wheels on push, upload to S3

#910 opened Apr 30, 2026 by mgehre-amd • Draft

1 task

[ROCm][DSv4] Share AITER decode dequant + fp8-cast buffers across layers (rebased, stacked on #902)

#903 opened Apr 27, 2026 by ChuanLi1101 • Draft

2 of 4 tasks

[ROCm][DSv4] Make AITER sparse decode cudagraph-clean (rebased, stacked on #901)

#902 opened Apr 27, 2026 by ChuanLi1101 • Draft

2 of 5 tasks

[ROCm][DSv4] AITER-accelerated MLA decode for DeepSeek V4 on MI355X (rebased on tj/dsv4prrebase)

#901 opened Apr 27, 2026 by ChuanLi1101 • Draft

1 of 4 tasks

[Do Not Merge] For review purpose: Rocm/aiter mla dsv4 decode cudagraph

#900 opened Apr 26, 2026 by tjtanaavllm • Draft

5 tasks

[ROCm] support topk_softplus for all number of experts

#899 opened Apr 25, 2026 by tjtanaa

Loading…

5 tasks

Tune hybrid_triton_w4a16 prefill kernel for gfx1151

#879 opened Apr 15, 2026 by mgehre-amd • Draft

3 tasks done

Enable FLASH_ATTN backend with upstream flash-attn CK on ROCm for decode

#866 opened Apr 10, 2026 by mgehre-amd • Draft

1 task

update rocm version truncate stale

#864 opened Mar 16, 2026 by kiran-thumma

Loading…

Previous 1 2 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!