rmsaddbias #5012

Toneymiller · 2025-12-15T04:01:13Z

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.12.0
vLLM main: vllm-project/vllm@ad32e3e

Signed-off-by: zxwang <[email protected]>

gemini-code-assist

Code Review

This pull request introduces a Triton kernel for RMS normalization to support bias addition. The implementation contains a critical bug in the kernel launch configuration that would lead to incorrect computations for inputs with more than 48 rows. I've also pointed out another potential issue with passing None to the kernel which could lead to a runtime error. I've provided suggestions to fix these issues.

gemini-code-assist · 2025-12-15T04:02:26Z

vllm_ascend/ops/layernorm.py

+    # _, num_vectorcore = get_device_properties()
+    num_vectorcore = 48
+    grid = (M if M < num_vectorcore else num_vectorcore,)


The grid for launching the Triton kernel is incorrectly calculated. It's capped at a hardcoded num_vectorcore value of 48. The rms_norm_fwd_kernel processes one row per program instance. With the current grid calculation, if the input tensor has more than 48 rows, only the first 48 rows will be processed, leading to incorrect output for the remaining rows. This is a critical correctness bug.

To process all M rows, the grid size must be (M,). The hardcoded num_vectorcore and the logic that uses it should be removed.

Suggested change

# _, num_vectorcore = get_device_properties()

num_vectorcore = 48

grid = (M if M < num_vectorcore else num_vectorcore,)

grid = (M,)

gemini-code-assist · 2025-12-15T04:02:26Z

vllm_ascend/ops/layernorm.py

+        residual.stride(0) if residual is not None else None,
+        residual_out.stride(0) if residual is not None else None,


When residual is None, None is passed to the stride_z_row and stride_z_out_row arguments of the Triton kernel. Triton's JIT compiler expects numerical values for non-pointer arguments and will likely raise a TypeError when it receives None. You should pass a dummy integer value like 0 instead. These arguments are not used when residual is None because of the HAS_Z conditional, so 0 is a safe value.

Suggested change

residual.stride(0) if residual is not None else None,

residual_out.stride(0) if residual is not None else None,

residual.stride(0) if residual is not None else 0,

residual_out.stride(0) if residual is not None else 0,

github-actions · 2025-12-15T04:10:23Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Signed-off-by: zxwang <[email protected]>

rmsaddbias

66dedc3

Signed-off-by: zxwang <[email protected]>

gemini-code-assist bot reviewed Dec 15, 2025

View reviewed changes

github-actions bot added the module:ops label Dec 15, 2025

Toneymiller added 2 commits December 15, 2025 21:45

rmsaddbias

289511d

Signed-off-by: zxwang <[email protected]>

Merge branch 'main' into rmsaddbias

8668f09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

rmsaddbias #5012

rmsaddbias #5012

Toneymiller commented Dec 15, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 15, 2025

Uh oh!

gemini-code-assist bot Dec 15, 2025

Uh oh!

github-actions bot commented Dec 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		residual.stride(0) if residual is not None else None,
		residual_out.stride(0) if residual is not None else None,

rmsaddbias #5012

Are you sure you want to change the base?

rmsaddbias #5012

Conversation

Toneymiller commented Dec 15, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Toneymiller commented Dec 15, 2025 •

edited by github-actions bot

Loading