[Bugfix] Fix precision issues in moe_mlp (vllm-ascend main) #5025

Clorist33 · 2025-12-15T07:45:14Z

What this PR does / why we need it?

Use group_list[0] to replace group_diff[0] in function "cumsum_group_list" (moe_mlp.py).
The purpose is to modify it to the correct logic of converting cumsum to count.

Does this PR introduce any user-facing change?

No

vLLM version: v0.12.0
vLLM main: vllm-project/vllm@ad32e3e

Signed-off-by: tanqingshan (A) <[email protected]> Signed-off-by: tanqingshan (A) <[email protected]>

gemini-code-assist

Code Review

This pull request correctly fixes a bug in the cumsum_group_list function by changing group_diff[0] to group_list[0]. This ensures the logic for converting a cumulative sum list to a difference list is correct. The change is accurate and addresses the intended issue. I've also added a comment to handle a potential IndexError on an empty tensor, which would improve the robustness of the function.

gemini-code-assist · 2025-12-15T07:46:53Z

vllm_ascend/ops/fused_moe/moe_mlp.py

    if src_list_type == 0 and dst_list_type == 1:
        group_diff = torch.diff(group_list)
-        new_group = torch.cat([group_diff[0].unsqueeze(0), group_diff], dim=0)
+        new_group = torch.cat([group_list[0].unsqueeze(0), group_diff], dim=0)
        return new_group


There's a potential IndexError here if group_list is an empty tensor. Accessing group_list[0] on line 49 would cause a crash. It's good practice to handle this edge case, for example by checking if the tensor is empty before proceeding.

Suggested change

if src_list_type == 0 and dst_list_type == 1:

group_diff = torch.diff(group_list)

new_group = torch.cat([group_diff[0].unsqueeze(0), group_diff], dim=0)

new_group = torch.cat([group_list[0].unsqueeze(0), group_diff], dim=0)

return new_group

if src_list_type == 0 and dst_list_type == 1:

if not group_list.numel():

return group_list

group_diff = torch.diff(group_list)

new_group = torch.cat([group_list[0].unsqueeze(0), group_diff], dim=0)

return new_group

github-actions · 2025-12-15T08:53:18Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

use group_list[0] to replace group_diff[0]

0528d70

Signed-off-by: tanqingshan (A) <[email protected]> Signed-off-by: tanqingshan (A) <[email protected]>

gemini-code-assist bot reviewed Dec 15, 2025

View reviewed changes

Clorist33 changed the title ~~[Bugfix]use group_list[0] to replace group_diff[0] in moe_mlp (vllm-ascend main)~~ [Bugfix] Fix precision issues in moe_mlp (vllm-ascend main) Dec 15, 2025

github-actions bot added the module:ops label Dec 15, 2025

weijinqian0 added ready read for review ready-for-test start test by label for PR labels Dec 15, 2025

weijinqian0 approved these changes Dec 15, 2025

View reviewed changes

Clorist33 force-pushed the bugfix_group_list_main branch from ea908c0 to 0528d70 Compare December 15, 2025 10:11

Clorist33 mentioned this pull request Dec 15, 2025

[UT]Ut for function cumsum_group_list in main (ref #5025) #5036

Open

wangxiyuan merged commit d43cabc into vllm-project:main Dec 16, 2025
67 of 73 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Fix precision issues in moe_mlp (vllm-ascend main) #5025

[Bugfix] Fix precision issues in moe_mlp (vllm-ascend main) #5025

Clorist33 commented Dec 15, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 15, 2025

Uh oh!

github-actions bot commented Dec 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Bugfix] Fix precision issues in moe_mlp (vllm-ascend main) #5025

[Bugfix] Fix precision issues in moe_mlp (vllm-ascend main) #5025

Conversation

Clorist33 commented Dec 15, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Clorist33 commented Dec 15, 2025 •

edited by github-actions bot

Loading