Fix LongCat MLP tensor parallelism by ixxiii · Pull Request #29515 · sgl-project/sglang

ixxiii · 2026-06-27T13:47:06Z

Motivation

LongCat-Flash 2P4D dense did not support running with TP size = 1 because the MLP tensor-parallel path assumed TP sharding.
This PR fixes the dense MLP path so it can run correctly without tensor parallelism.

Tests

Successfully ran LongCat-Flash 2P4D with moe_dense_tp_size = 1.

CI States

Latest PR Test (Base): ❌ Run #28291065388
Latest PR Test (Extra): ❌ Run #28291065332

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

gemini-code-assist

Code Review

This pull request integrates support for fully data-parallel (DP) dense MoE execution in the LongcatFlash model. It updates LongcatFlashMLP to accept tp_rank and tp_size parameters, and configures them based on the status of enable_moe_dense_fully_dp(). When fully DP is enabled, tensor model parallel all-reduce operations are bypassed during the MLP forward pass. There are no review comments, so I have no additional feedback to provide.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

ixxiii · 2026-06-27T13:51:12Z

Hi @Fridge003 @ishandhanani @Qiaolin-Yu, this PR fixes LongCat-Flash dense MLP tensor parallelism for TP size = 1. Could you please take a look or help route it to the right reviewer when available?

ixxiii and others added 4 commits June 27, 2026 21:38

Fix LongCat MLP tensor parallelism

044ec82

Apply suggestion from @gemini-code-assist[bot]

74f81e5

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Apply suggestion from @gemini-code-assist[bot]

5b97fc3

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Apply suggestion from @gemini-code-assist[bot]

cf6fc5e

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

gemini-code-assist Bot reviewed Jun 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix LongCat MLP tensor parallelism#29515

Fix LongCat MLP tensor parallelism#29515
ixxiii wants to merge 4 commits into
sgl-project:mainfrom
ixxiii:fix_longcat_mlps_tp

ixxiii commented Jun 27, 2026 •

edited by github-actions Bot

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

ixxiii commented Jun 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

ixxiii commented Jun 27, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Tests

CI States

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

ixxiii commented Jun 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ixxiii commented Jun 27, 2026 •

edited by github-actions Bot

Loading