Fix: Align Qwen3 dtype handling with Qwen2 for TPU by xianglon-commits · Pull Request #1904 · vllm-project/tpu-inference

xianglon-commits · 2026-03-11T03:47:31Z

This PR addresses a ValueError caused by a dtype mismatch (bfloat16 vs float32) within the TPU attention kernels when running the Qwen3 model. This issue was encountered when using Tunix with the vLLM-JAX backend on the lance-ds branch.

Expected kv_cache.dtype=dtype(bfloat16) to be equal to k.dtype=dtype(bfloat16) and v.dtype=dtype(bfloat16), but found v.dtype=dtype('float32').

Following the pattern used to fix similar issues in qwen2.py (as suggested by the Tunix team, see https://screenshot.googleplex.com/8AwrZ44yv57Bt95), this change renames the dtype parameter to param_dtype within the __init__ methods of layers in tpu_inference/models/jax/qwen3.py.

This change aims to prevent potential naming conflicts and ensure the intended bfloat16 data type is consistently propagated and used for model parameters on TPU, resolving the mixed-precision error.

Aligned Qwen3 model definition with Qwen2 fixes by renaming __init__ parameter 'dtype' to 'param_dtype' to ensure correct bfloat16 handling on TPU, as suggested.

kyuyeunk · 2026-03-11T04:00:54Z

would this be related to this pr? #1771

wang2yn84 and others added 3 commits March 3, 2026 18:31

Fix nnx for qwen2.

e4c74bc

Disable deepcopy.

a9c9805

Fix Qwen3 dtype to param_dtype for TPU compatibility

f41b426

Aligned Qwen3 model definition with Qwen2 fixes by renaming __init__ parameter 'dtype' to 'param_dtype' to ensure correct bfloat16 handling on TPU, as suggested.

xianglon-commits requested review from jrplatin, kyuyeunk, mrjunwan-lang, sixiang-google, vipannalla and wenxindongwork as code owners March 11, 2026 03:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Align Qwen3 dtype handling with Qwen2 for TPU#1904

Fix: Align Qwen3 dtype handling with Qwen2 for TPU#1904
xianglon-commits wants to merge 3 commits intovllm-project:mainfrom
xianglon-commits:fix/qwen3-dtype-handling

xianglon-commits commented Mar 11, 2026

Uh oh!

kyuyeunk commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

xianglon-commits commented Mar 11, 2026

Uh oh!

kyuyeunk commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants