Fix LoRA hot swapping #116

pbarker · 2025-04-17T19:58:05Z

LoRA hot swapping is broken as documented in unslothai/unsloth#2322

This is due to an overly permissive regex which when loading multiple adapters which will apply the first adapter correctly but when applying the second adapter it will target layers in the first adapter

The main change is a tighter regex in get_peft_regex

This also formats some of the code to be more up to spec with ruff linting.

I tested this with qwen-vl-2.5 3b and everything appears to work

danielhanchen · 2025-05-04T10:06:40Z

Thanks for the PR - sorry on the delay! @rolandtannous or @mmathew23 could you guys check if this logic works as expected - appreciate it :)

rolandtannous · 2025-05-04T13:41:04Z

@pbarker @danielhanchen

Test results

Just finished testing this and this fix failed for me.
Jupyterlab on private VM with A100 GPU, with latest unsloth, unsloth_zoo with this fix manually applied to peft_utils.py.
The reason why I think it fails, is that load_adapter method, is a peft library method and not an unsloth method and

get_peft_regex()

is not called when applying the

load_adapter()

method

when is get_peft_regex() actually called?

get_peft_regex() is called when applying the get_peft_model for FastVisionModel or FastLanguageModel

In fact, If you look at the Error trace strack in the testing notebook i've attached, you'll see that the call to load_adapter calls the peft library immediately and there is no intermediary call to get_peft_regex()

Test Notebook

Here is the E2E testing notebook I used based on the code shared by @pbarker here :
test_notebook

Existing peft issue

This seems to be an issue related to the peft library and has been discussed a few times in the peft repo github issues section. Examples:
huggingface/peft#957
huggingface/peft#2388

Workaround

There is a workaround for swapping lora adapters using the hotswap_adapter method. huggingface hotswap method.

The code would look something like this

from peft.utils.hotswap import hotswap_adapter
model_id = "unsloth/Qwen2.5-VL-7B-Instruct"

base_model, model_processor = FastVisionModel.from_pretrained(
    model_id,
    load_in_4bit=False,
)
FastVisionModel.for_inference(base_model)

model = FastVisionModel.get_peft_model(
    base_model,
    finetune_vision_layers=True,  # False if not finetuning vision layers
    finetune_language_layers=True,  # False if not finetuning language layers
    finetune_attention_modules=True,  # False if not finetuning attention layers
    finetune_mlp_modules=True,  # False if not finetuning MLP layers
    r=8,  # The larger, the higher the accuracy, but might overfit
    lora_alpha=16,  # Recommended alpha == r at least
    lora_dropout=0.1,
    bias="none",
    random_state=3407,
    use_rslora=False,  # We support rank stabilized LoRA
    loftq_config=None,  # And LoftQ
    use_fast=True,
    # target_modules = "all-linear", # Optional now! Can specify a list if needed
)

hotswap_adapter(model, "./outputs/checkpoint-15", adapter_name="default")
hotswap_adapter(model, "./outputs/checkpoint-30", adapter_name="default")

Here is an End to End notebook that illustrates the use of hotswap_adapter :
workaround_notebook

pbarker mentioned this pull request Apr 17, 2025

[BUG] Hot loading multiple LoRA adapters for inference seems to not work unslothai/unsloth#2322

Open

pbarker force-pushed the main branch from 3d8fd1a to 8864ee4 Compare May 5, 2025 19:22

pbarker added 2 commits May 19, 2025 09:05

fix regex

05af5ca

fix import

434d62b

pbarker force-pushed the main branch from 8864ee4 to 434d62b Compare May 19, 2025 15:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix LoRA hot swapping #116

Fix LoRA hot swapping #116

Uh oh!

pbarker commented Apr 17, 2025 •

edited

Loading

Uh oh!

danielhanchen commented May 4, 2025

Uh oh!

rolandtannous commented May 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

Fix LoRA hot swapping #116

Are you sure you want to change the base?

Fix LoRA hot swapping #116

Uh oh!

Conversation

pbarker commented Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

danielhanchen commented May 4, 2025

Uh oh!

rolandtannous commented May 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test results

when is get_peft_regex() actually called?

Test Notebook

Existing peft issue

Workaround

Uh oh!

Uh oh!

pbarker commented Apr 17, 2025 •

edited

Loading

rolandtannous commented May 4, 2025 •

edited

Loading