[megatron] feat: LoRA adapter only refit (TensorLoRARequest) #4632

HollowMan6 · 2025-12-22T01:01:51Z

What does this PR do?

Waiting for:

Checklist Before Starting

Search for similar PRs. Paste at least one query link here: ...
Format the PR title as [{modules}] {type}: {description} (This will be checked by the CI)
- {modules} include fsdp, megatron, sglang, vllm, rollout, trainer, ci, training_utils, recipe, hardware, deployment, ray, worker, single_controller, misc, perf, model, algo, env, tool, ckpt, doc, data, cfg, reward
- If this PR involves multiple modules, separate them with , like [megatron, fsdp, doc]
- {type} is in feat, fix, refactor, chore, test
- If this PR breaks any API (CLI arguments, config, function signature, etc.), add [BREAKING] to the beginning of the title.
- Example: [BREAKING][fsdp, megatron] feat: dynamic batching

Test

For changes that can not be tested by CI (e.g., algorithm implementation, new model support), validate by experiment(s) and show results like training curve plots, evaluation results, etc.

API and Usage Example

Demonstrate how the API changes if any, and provide usage example(s) if possible.

# Add code snippet or script demonstrating how to use this

Design & Code Changes

Demonstrate the high-level design if this PR is complex, and list the specific changes.

Checklist Before Submitting

Important

Please check all the following items before requesting a review, otherwise the reviewer might deprioritize this PR for review.

Read the Contribute Guide.
Apply pre-commit checks: pre-commit install && pre-commit run --all-files --show-diff-on-failure --color=always
Add / Update the documentation.
Add unit or end-to-end test(s) to the CI workflow to cover all the code. If not feasible, explain why: ...
Once your PR is ready for CI, send a message in the ci-request channel in the verl Slack workspace. (If not accessible, please try the Feishu group (飞书群).)

gemini-code-assist

Code Review

This pull request introduces a significant feature for LoRA adapter-only weight updates in the Megatron backend, which avoids the overhead of merging adapters into the base model for each weight synchronization. The changes are well-structured and include updates to documentation, configuration handling for LoRA, new PEFT utility functions for vLLM compatibility, and refactored weight export logic. The implementation appears solid and aligns with the stated goals. I have a couple of suggestions to enhance code maintainability and clarity regarding duplicated logic and a confusing condition.

verl/utils/config.py

verl/workers/megatron_workers.py

Signed-off-by: Hollow Man <[email protected]>

gemini-code-assist

Code Review

This pull request introduces a valuable feature for LoRA adapter-only refitting in the Megatron backend, which should provide significant performance benefits. The changes across documentation, configuration, and worker implementations appear to correctly support both merging LoRA adapters and loading them separately. My primary concern, detailed in a specific comment, is the repeated implementation of LoRA configuration logic across several files. Addressing this will improve the long-term maintainability of the codebase.

verl/workers/rollout/vllm_rollout/vllm_rollout.py

HollowMan6 changed the title ~~[megatron] feat: LoRA adapter only weight update~~ [megatron] feat: LoRA adapter only weight update (TensorLoRARequest) Dec 22, 2025

HollowMan6 changed the title ~~[megatron] feat: LoRA adapter only weight update (TensorLoRARequest)~~ [megatron] feat: LoRA adapter only refit (TensorLoRARequest) Dec 22, 2025

gemini-code-assist bot reviewed Dec 22, 2025

View reviewed changes

verl/utils/config.py Outdated Show resolved Hide resolved

verl/workers/megatron_workers.py Show resolved Hide resolved

HollowMan6 force-pushed the lora_adapters_update branch 4 times, most recently from f6706bc to 81d261e Compare December 22, 2025 01:33

HollowMan6 mentioned this pull request Dec 22, 2025

[BugFix] LoRA: Support loading base_layer of experts vllm-project/vllm#31104

Open

5 tasks

HollowMan6 force-pushed the lora_adapters_update branch 5 times, most recently from 23b8716 to 797dc7f Compare December 28, 2025 14:01

HollowMan6 force-pushed the lora_adapters_update branch from 797dc7f to 2542c79 Compare December 30, 2025 01:16

Update docs and make lora merge optional

91d4732

Signed-off-by: Hollow Man <[email protected]>

HollowMan6 force-pushed the lora_adapters_update branch from dddcffe to 91d4732 Compare December 31, 2025 08:43

gemini-code-assist bot reviewed Dec 31, 2025

View reviewed changes

verl/workers/rollout/vllm_rollout/vllm_rollout.py Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[megatron] feat: LoRA adapter only refit (TensorLoRARequest) #4632

[megatron] feat: LoRA adapter only refit (TensorLoRARequest) #4632

Uh oh!

HollowMan6 commented Dec 22, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[megatron] feat: LoRA adapter only refit (TensorLoRARequest) #4632

Are you sure you want to change the base?

[megatron] feat: LoRA adapter only refit (TensorLoRARequest) #4632

Uh oh!

Conversation

HollowMan6 commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Checklist Before Starting

Test

API and Usage Example

Design & Code Changes

Checklist Before Submitting

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

HollowMan6 commented Dec 22, 2025 •

edited

Loading