Distributed fine tuning of LLMs 

The finetuning [notebook](https://github.com/redhat-et/foundation-models-for-documentation/blob/master/notebooks/finetune/Flan-T5-3B/RosaQA.ipynb) uses 1 GPU and LoRA technique to fine tune a T5 model with 3B parameters. The task to be completed in this issue is to fine tune the same model (or 7B version of the model) on multiple GPU nodes. Use Instascale and Codeflare to schedule the training job and retrieve the finetuned model. Create a notebook that demos this. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Distributed fine tuning of LLMs #49

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Distributed fine tuning of LLMs #49

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions