[CPU] Support head_size 512 in cpu_attn by bigPYJ1151 · Pull Request #38676 · vllm-project/vllm

bigPYJ1151 · 2026-04-01T02:26:55Z

Purpose

Add 512 head_size support for in-coming models.

Test Plan

CI tests

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

mergify · 2026-04-01T02:27:34Z

Documentation preview: https://vllm--38676.org.readthedocs.build/en/38676/

gemini-code-assist

Code Review

This pull request adds support for a head size of 512 to the CPU attention backend. The changes include updating the dispatch generation script, documentation, unit tests, and the backend's supported head sizes list. I have no feedback to provide.

Signed-off-by: jiang1.li <jiang1.li@intel.com>

bigPYJ1151 requested review from LucasWilkinson, MatthewBonanni, WoosukKwon, mgoin, tlrmchlsmth and yewentao256 as code owners April 1, 2026 02:26

mergify bot added documentation Improvements or additions to documentation cpu Related to CPU backends v1 labels Apr 1, 2026

bigPYJ1151 added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 1, 2026

gemini-code-assist bot reviewed Apr 1, 2026

View reviewed changes

jikunshang approved these changes Apr 1, 2026

View reviewed changes

support

88f7d46

Signed-off-by: jiang1.li <jiang1.li@intel.com>

bigPYJ1151 force-pushed the attn_512 branch from 9d9ddb2 to 88f7d46 Compare April 1, 2026 03:55

Isotr0py approved these changes Apr 1, 2026

View reviewed changes

Isotr0py enabled auto-merge (squash) April 1, 2026 04:15

Isotr0py merged commit 36d7f19 into vllm-project:main Apr 1, 2026
66 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CPU] Support head_size 512 in cpu_attn#38676

[CPU] Support head_size 512 in cpu_attn#38676
Isotr0py merged 1 commit intovllm-project:mainfrom
bigPYJ1151:attn_512

bigPYJ1151 commented Apr 1, 2026 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Apr 1, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

bigPYJ1151 commented Apr 1, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

mergify bot commented Apr 1, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bigPYJ1151 commented Apr 1, 2026 •

edited by github-actions bot

Loading