[Config] Do we need to avoid combinatorial number of configs?

Currently, for every llama SKU, we have 6 configs:
- LoRA single device
- LoRA distributed
- QLoRA single device
- QLoRA distributed (after FSDP2)
- Full distributed
- Full single device

We do not have DPO/PPO (I guess we probably should), but this could easily add another 6 configs (QLoRA, LoRA, Full) * (distributed, single).

If/when we support DoRA, would we add another 8? (DoRA + QDoRA)*(single, distributed) * (sft, dpo)

Not sure if this needs a solution. Posting it for the discussion. I thought of a few ideas, but i don't think I like them very much
1) Make QLoRA (and other variants) a parameter in LoRA configs?
2) Make model size a parameter? ``--sku llama8b``
3) Only keep single/distributed configs, and make PeFT an optional parameter. We could create a single peft.yaml with just the relevant parameters, shared by all models; 💩 
4) Make distributed a parameter? 💩 
5) Dont touch it 




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Config] Do we need to avoid combinatorial number of configs? #1282

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Config] Do we need to avoid combinatorial number of configs? #1282

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions