Enable "apply_lora_to_output" in models with tied embedding

Many families of models with small models (<=3B parameters) have tied embeddings, meaning that the output layer projection is the same as the input tok_embeddings. Examples are gemma, qwen and [llama 3.2](https://github.com/pytorch/torchtune/blob/05232a0f83233fa85e885a56b2ffc14df6711666/torchtune/models/llama3_2/_component_builders.py#L256).

These models currently don't support "apply_lora_to_output". This happens because in the past we used to pass as the output_proj a lambda function, e.g. lambda x: x @ tok_embeddings.weight.

Recently, we changed it, and started passing the [TiedLinear module.](https://github.com/pytorch/torchtune/blob/05232a0f83233fa85e885a56b2ffc14df6711666/torchtune/models/llama3_2/_component_builders.py#L256). We need the TiedLinear so it can work well with FSDP and other techniques.

This task is to enable LoRA on top of this TiedLinear, like we do it for nn.Linear in other models that do not have tied embeddings, e.g. in [llama 3.1](https://github.com/pytorch/torchtune/blob/05232a0f83233fa85e885a56b2ffc14df6711666/torchtune/models/llama3_1/_component_builders.py#L252).

After adding this feature, the configs of models from llama 3.2, qwen and gemma have to be updated to include the [flag](https://github.com/pytorch/torchtune/blob/05232a0f83233fa85e885a56b2ffc14df6711666/recipes/configs/llama3_1/8B_lora.yaml#L31).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable "apply_lora_to_output" in models with tied embedding #1960

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Enable "apply_lora_to_output" in models with tied embedding #1960

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions