[WIP] Optimize GDN chunk_fwd_o_kernel performance by YangQun1 · Pull Request #297 · vllm-project/vllm-xpu-kernels

YangQun1 · 2026-04-21T01:56:25Z

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS ABOVE HAVE BEEN CONSIDERED.

Purpose

Optimize chunk_fwd_o_kernel performance:

use native exp
merge WS and QS gemm to reuse S tensor in register

Test Plan

Test Result

(Optional) Documentation Update

BEFORE SUBMITTING, PLEASE READ https://docs.vllm.ai/en/latest/contributing (anything written below this line will be removed by GitHub Actions)

Signed-off-by: yangqun <qun.yang@intel.com>

wuxun-zhang mentioned this pull request Apr 21, 2026

Qwen3.5 support and optimization plan #172

Open

7 tasks

YangQun1 and others added 5 commits April 23, 2026 13:12

add benchmark script

af9fb6e

Signed-off-by: yangqun <qun.yang@intel.com>

fuse qs and ws to reuse s tensor in register

d0f5f29

Signed-off-by: yangqun <qun.yang@intel.com>

use native exp

5899a7f

Signed-off-by: yangqun <qun.yang@intel.com>

minor change

bca30bd

Signed-off-by: yangqun <qun.yang@intel.com>

fix

ca40e0e

Signed-off-by: yangqun <qun.yang@intel.com>

YangQun1 force-pushed the dev/fwd_o_opt branch from 9d84889 to ca40e0e Compare April 24, 2026 04:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Optimize GDN chunk_fwd_o_kernel performance#297

[WIP] Optimize GDN chunk_fwd_o_kernel performance#297
YangQun1 wants to merge 5 commits intovllm-project:mainfrom
YangQun1:dev/fwd_o_opt

YangQun1 commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

YangQun1 commented Apr 21, 2026

Essential Elements of an Effective PR Description Checklist

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant