[BugFix] LoRA: Support loading base_layer of experts #31104

HollowMan6 · 2025-12-22T00:44:31Z

Purpose

This PR fixes weight loading when LoRA is enabled, i.e., we have base_layer added to the:

model.layers.0.mlp.experts.0.up_proj.weight -> model.layers.0.mlp.experts.0.up_proj.base_layer.weight

Currently before this fix, the patched code will handled this as:
model.layers.0.mlp.experts.w13_base_layer.weight, which is wrong and
it should actually be model.layers.0.mlp.experts.base_layer.w13_weight

Test Plan

Test on Qwen3 30B A3B

Test Result

Looks good.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

_{✨ Presented to you with Mind Lab - A Lab for Experiential Intelligence.}

gemini-code-assist

Code Review

This pull request addresses a bug in weight loading for FusedMoE layers when LoRA is enabled. The changes correctly handle the base_layer component in weight names. The core logic is adjusted in make_expert_params_mapping, and this fix is propagated by adding an is_lora_enabled flag to this function, which is then passed from various model definitions. The overall approach is sound and the widespread changes are necessary boilerplate to support the fix. I have one suggestion to improve the robustness of the string formatting to prevent potential issues with certain model configurations.

vllm/model_executor/layers/fused_moe/layer.py

hmellor

We should not be duplicating this code in every model. It should be abstracted to a util.

Also, please make sure that the fix is also applied to

vllm/vllm/model_executor/layers/fused_moe/layer.py

Line 1366 in 73cfb7a

def load_weights(

HollowMan6 · 2025-12-23T19:01:48Z

@hmellor Thanks for reviewing, now this is changed as requested!

cc: @jeejeelee

Signed-off-by: Hollow Man <[email protected]>

jeejeelee

LGTM once CI is green

HollowMan6 · 2025-12-25T13:24:03Z

Thank you @jeejeelee! Now all the CIs should have already been passed, but auto-merge (squash) is not merging it, maybe this needs some manual work.

jeejeelee · 2025-12-26T03:48:34Z

cc @hmellor

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

gemini-code-assist

Code Review

This pull request introduces a fix for loading LoRA weights for experts in MoE models. The issue with incorrect weight name remapping when a base_layer is present is addressed by a new helper function, remap_expert_weight_name. This function correctly handles the insertion of base_layer into the parameter name. The fix has been consistently applied across numerous model files, replacing the simple string replacement with the new, more robust logic. The implementation of the new function is sound and correctly addresses the described bug. The changes are well-contained and improve the LoRA support for MoE models. Overall, this is a good and necessary bug fix.

chatgpt-codex-connector · 2025-12-30T11:48:13Z

Codex Review: Didn't find any major issues. 🎉

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

jeejeelee · 2025-12-31T01:41:56Z

@hmellor could you please take another look?

HollowMan6 requested review from hmellor, luccafong, mgoin, patrickvonplaten, pavanimajety and sighingnow as code owners December 22, 2025 00:44

mergify bot added deepseek Related to DeepSeek models llama Related to Llama models qwen Related to Qwen models gpt-oss Related to GPT-OSS models speculative-decoding labels Dec 22, 2025

github-project-automation bot added this to gpt-oss Issues & Enhancements Dec 22, 2025

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Dec 22, 2025

HollowMan6 force-pushed the lora_base_layer branch from d822bed to 210bc7b Compare December 22, 2025 00:47

gemini-code-assist bot reviewed Dec 22, 2025

View reviewed changes

vllm/model_executor/layers/fused_moe/layer.py Outdated Show resolved Hide resolved

HollowMan6 mentioned this pull request Dec 22, 2025

[megatron] feat: LoRA adapter only refit (TensorLoRARequest) volcengine/verl#4632

Draft

7 tasks

jeejeelee reviewed Dec 22, 2025

View reviewed changes

vllm/model_executor/layers/fused_moe/layer.py Outdated Show resolved Hide resolved

jeejeelee self-assigned this Dec 22, 2025

HollowMan6 force-pushed the lora_base_layer branch 2 times, most recently from 00c09c7 to f9008c9 Compare December 22, 2025 12:36

HollowMan6 changed the title ~~[BugFix] LoRA: FusedMoE make_expert_params_mapping supports base_layer~~ [BugFix] LoRA: Support loading base_layer of experts Dec 22, 2025

hmellor requested changes Dec 23, 2025

View reviewed changes

github-project-automation bot moved this from To Triage to In progress in gpt-oss Issues & Enhancements Dec 23, 2025

HollowMan6 force-pushed the lora_base_layer branch from f9008c9 to 5c39293 Compare December 23, 2025 18:55

HollowMan6 requested a review from 22quinn as a code owner December 23, 2025 18:55

HollowMan6 force-pushed the lora_base_layer branch from 5c39293 to d70645e Compare December 23, 2025 18:59

HollowMan6 requested review from hmellor and jeejeelee December 23, 2025 19:01

HollowMan6 force-pushed the lora_base_layer branch from c1022bd to abc19bf Compare December 24, 2025 16:52

HollowMan6 force-pushed the lora_base_layer branch from abc19bf to 2c1825b Compare December 24, 2025 16:56

[BugFix] LoRA: Support loading base_layer of experts

b1139f3

Signed-off-by: Hollow Man <[email protected]>

HollowMan6 force-pushed the lora_base_layer branch from 2c1825b to b1139f3 Compare December 24, 2025 18:14

jeejeelee approved these changes Dec 25, 2025

View reviewed changes

jeejeelee added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 25, 2025

jeejeelee enabled auto-merge (squash) December 25, 2025 07:44

jeejeelee disabled auto-merge December 26, 2025 00:35

jeejeelee enabled auto-merge (squash) December 26, 2025 00:36

Merge branch 'main' into lora_base_layer

49e4e1b

Merge branch 'main' into lora_base_layer

33bc1b2

Copilot AI review requested due to automatic review settings December 30, 2025 01:32

Copilot started reviewing on behalf of HollowMan6 December 30, 2025 01:44 View session

Copilot AI reviewed Dec 30, 2025

View reviewed changes

Merge branch 'main' into lora_base_layer

df41b13

HollowMan6 requested a review from Copilot December 30, 2025 11:37

gemini-code-assist bot reviewed Dec 30, 2025

View reviewed changes

Copilot AI reviewed Dec 30, 2025

View reviewed changes

Copilot started reviewing on behalf of HollowMan6 December 30, 2025 14:00 View session

Merge branch 'main' into lora_base_layer

9fb41f8

Merge branch 'main' into lora_base_layer

bcc48bf

Uh oh!

[BugFix] LoRA: Support loading base_layer of experts #31104

Are you sure you want to change the base?

[BugFix] LoRA: Support loading base_layer of experts #31104

Conversation

HollowMan6 commented Dec 22, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

hmellor left a comment

Choose a reason for hiding this comment

Uh oh!

HollowMan6 commented Dec 23, 2025

Uh oh!

jeejeelee left a comment

Choose a reason for hiding this comment

Uh oh!

HollowMan6 commented Dec 25, 2025

Uh oh!

jeejeelee commented Dec 26, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

chatgpt-codex-connector bot commented Dec 30, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

jeejeelee commented Dec 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HollowMan6 commented Dec 22, 2025 •

edited by github-actions bot

Loading