Task-conditioned LoRA + MoE by ryspark · Pull Request #324 · allenai/olmoearth_pretrain

ryspark · 2025-08-14T19:29:24Z

This PR adds two new pieces of functionality: LoRA and MoE adapters conditioned on task-specific embeddings. These are meant to be learned during finetuning.

LoRA adapters: drop-in replacement for the attention output projection weights, conditioned on downstream tasks. This is meant to be used during finetuning. From this paper it seems like finetuning only these projection weights gets you most of the way there compared to full finetuning.

Given embeddings E for each downstream task, the TaskLoRALinear layer computes a LoRA update (ie two matrices A: D x r, B: r x D) whose product is added to the original projection weights. Both A and B are computed via an MLP on top of E. This MLP is shared across tasks but different per-layer, and is directly learned during finetuning.

MoE adapters: instead of computing FFN(x) in each Transformer block, adds a soft MoE adapter so that the pre-LayerScale output is Linear(FFN(x) + MoE(x)).

Optionally MoE adapters compute expert combine weights (i.e., deciding which experts to use per token) by conditioning on batch-level task embeddings instead of on token-level embeddings.
The actual MoE implementation is mostly from this reference implementation and can be found in helios.nn.moe.

To accommodate these changes, there are a few extra arguments added to EncoderConfig, Encoder, FlexiHeliosBase, etc. all the way down to the base helios.nn.attention.Attention layers. Additionally, there is a new argument task_emb added to the forward pass of Encoder. I considered subclassing Encoder (i.e. something like EncoderWithTaskEmbeds) but decided it was simpler to just add a few extra arguments directly.

ryspark added 5 commits August 11, 2025 20:25

Add task lora

5dd1681

Fix bugs in lora

17c1de3

Linter issues

cfcbe95

Allow selecting lora layers

5166589

Merge branch 'main' into ryanp/lora

a4fbbb7

github-actions bot added the size/m label Aug 14, 2025

Linter fix

aec006e

ryspark marked this pull request as ready for review August 14, 2025 19:57

ryspark requested a review from Hgherzog August 14, 2025 19:58

ryspark marked this pull request as draft August 14, 2025 20:19

ryspark added 3 commits August 14, 2025 20:45

Update params in lora

60e2b06

Update lora + add moe

2b8da74

Merge branch 'main' into ryanp/lora

24b1dd5

ryspark changed the title ~~Task-conditioned LoRA~~ Task-conditioned LoRA + MoE Aug 18, 2025

github-actions bot added the size/xl label Aug 18, 2025

This was referenced Aug 18, 2025

Task-conditioned Helios model allenai/rslearn_projects#181

Open

Multi-stage freeze/unfreeze callback allenai/rslearn#236

Merged

ryspark and others added 13 commits August 19, 2025 20:06

Do not modify param dict

9c7a411

Add task evals

7ad1c6a

Change cluster

135b38e

Fix dataset name

12ca526

Update wandb project

38c1121

Do not materialize full weight delta

57371cd

Some linter fixes (not all)

7629ee0

Linter fixes

454a8aa

Fix moe/clean up lora

1d05396

Merge branch 'main' into ryanp/lora

dc311e6

Add moe pretrain

14ab3a0

Revert lora changes

fb66330

Update eval scripts

3aed51d

ryspark and others added 11 commits August 25, 2025 17:51

Hardcode eval paths

ee5a1c0

Merge branch 'main' into ryanp/lora

11adb93

Fix norm typo, linter

9e3a696

Add active param to lora

4602049

Add standard lora option

7c21a6e

Update scripts

90ea0fe

Update scripts

fc3c6bb

Change batch size

0d6858b

Merge branch 'main' into ryanp/lora

3ac1c36

Linter fixes

807207e

Fix test for new encoder output

6069352

ryspark marked this pull request as ready for review September 10, 2025 18:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Task-conditioned LoRA + MoE#324

Task-conditioned LoRA + MoE#324
ryspark wants to merge 33 commits intomainfrom
ryanp/lora

ryspark commented Aug 14, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ryspark commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ryspark commented Aug 14, 2025 •

edited

Loading