[2/N] Pass `model_config` to the Attention constructors by MatthewBonanni · Pull Request #38661 · vllm-project/vllm

MatthewBonanni · 2026-03-31T21:55:38Z

Purpose

We already pass cache_config and quant_config as arguments to Attention.__init__(), but model_config is routinely grabbed from get_current_vllm_config. #38124 requires accesssing model_config.dtype much more frequently, so this PR passes model_config as an argument to standardize and reduce reliance on get_current_vllm_config.

Test Plan

CI

Test Result

TBD

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

mergify · 2026-03-31T21:59:42Z

Hi @MatthewBonanni, the pre-commit checks have failed. Please run:

uv pip install pre-commit>=4.5.1
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy failing?

mypy is run differently in CI. If the failure is related to this check, please use the following command to run it locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10

gemini-code-assist

Code Review

This pull request updates the model executor layers and attention mechanisms across various model implementations to explicitly pass the model_config object. It also introduces a mapping from torch.dtype to KV cache string representations in vllm/utils/torch_utils.py to ensure correct cache configuration when cache_config is not provided. I have no further feedback to provide.

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Implement

652c4fc

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

MatthewBonanni requested review from LucasWilkinson, patrickvonplaten, sighingnow, tomeras91 and vadiklyutiy as code owners March 31, 2026 21:55

mergify bot added deepseek Related to DeepSeek models llama Related to Llama models qwen Related to Qwen models gpt-oss Related to GPT-OSS models labels Mar 31, 2026

github-project-automation bot added this to gpt-oss Issues & Enhancements Mar 31, 2026

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Mar 31, 2026

MatthewBonanni mentioned this pull request Mar 31, 2026

[3/N][Misc][Cleanup] Resolve kv cache dtype "auto" at init time and eliminate from internal code #38124

Draft

5 tasks

gemini-code-assist bot reviewed Mar 31, 2026

View reviewed changes

MatthewBonanni added 2 commits March 31, 2026 18:02

Clean up unintended changes

4cb3e00

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Clean up

2dcaef0

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

MatthewBonanni added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 31, 2026

MatthewBonanni added 2 commits March 31, 2026 18:12

Clean up

6eb4e43

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Clean up

04a8496

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[2/N] Pass `model_config` to the Attention constructors#38661

[2/N] Pass `model_config` to the Attention constructors#38661
MatthewBonanni wants to merge 5 commits intovllm-project:mainfrom
MatthewBonanni:thread-model-config

MatthewBonanni commented Mar 31, 2026 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Mar 31, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

MatthewBonanni commented Mar 31, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

mergify bot commented Mar 31, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

MatthewBonanni commented Mar 31, 2026 •

edited by github-actions bot

Loading