Fine-Tuning Your LLM

To use large language models for build generative AI applications get new challenge dem. One big wahala na to make sure say response quality (accuracy and relevance) dey gidigba for the content wey the model generate for any user request. For the previous lessons, we don yarn about techniques like prompt engineering and retrieval-augmented generation wey dey try solve the problem by modifying the prompt input for the existing model.

For today lesson, we go talk about one third technique, fine-tuning, wey dey try tackle the challenge by retraining the model itself with extra data. Make we enter the details.

Learning Objectives

Dis lesson go introduce the idea for fine-tuning for pre-trained language models, explore the benefits and wahala dem of dis approach, and give you beta guide on when and how to use fine-tuning to improve the performance of your generative AI models.

By the time you finish dis lesson, you for sabi answer dis questions dem:

Wetin be fine-tuning for language models?
When, and why, e beta to do fine-tuning?
How I fit fine-tune one pre-trained model?
Wetin be the limits of fine-tuning?

You ready? Make we start.

Illustrated Guide

You want make you sabi the big picture of wetin we go cover before we enter? Check dis illustrated guide wey dey talk about the learning journey for dis lesson - from learning the core concepts and reason for fine-tuning, to understanding the process and best ways to follow to do the fine-tuning work. Dis topic na one correct one for exploration, so no forget to check the Resources page for extra links wey fit help your self-guided learning journey!

Wetin be fine-tuning for language models?

By definition, large language models dem pre-trained on plenty text wey dem collect from different different places wey include internet. As we don learn for previous lessons, we need techniques like prompt engineering and retrieval-augmented generation to make sure say the quality of the model response to the user questions ("prompts") improve.

One popular prompt-engineering way na to give the model more guidance on wetin dem expect for the response by either give instructions (clear guidance) or give am small examples (implicit guidance). Dis na wetin dem dey call few-shot learning but e get two wahala:

Model token limits fit restrict the number of examples wey you fit give, and e fit reduce how e go work well.
Model token price fit make e costly to add examples for every prompt, and e reduce flexibility.

Fine-tuning na common practice for machine learning systems wey person go take pre-trained model come retrain am with new data to make e perform better for one specific work. For language models matter, you fit fine-tune the pre-trained model with one selected set of examples for one particular task or application area to build one custom model wey fit dey more correct and relevant for that task or domain. One side-benefit of fine-tuning be say e fit reduce how many examples need for few-shot learning - this one go reduce token use and the price wey join.

When and why we suppose fine-tune models?

For dis matter, when we dey talk about fine-tuning, na supervised fine-tuning we mean wey be say the retraining dey done by adding new data wey no be part of the original training data. Dis one different from unsupervised fine-tuning wey model still dey trained on the original data, but e get different hyperparameters.

The key thing to sabi be say fine-tuning na advanced technique wey need some level of skill to get the kind results wey you want. If you do am wrong, e fit no bring the improvements wey you expect, and e fit even spoil the model performance for the place wey you target.

So, before you learn "how" to fine-tune language models, you need know "why" you suppose take dis road, and "when" you go start to fine-tune. Begin by ask yourself dis kind questions:

Use Case: Wetin be your use case for fine-tuning? Which part of the current pre-trained model you want make better?
Alternatives: You don try other techniques before to get wetin you want? Use dem to create one baseline for comparison.
- Prompt engineering: Try quick techniques like few-shot prompting with examples wey relate to prompt response. Check the quality of responses.
- Retrieval Augmented Generation: Try to add search results to your prompts as extra info from your data. Check the quality of responses.
Costs: You don identify the price wey you go pay for fine-tuning?
- Tunability - the pre-trained model dey available for fine-tuning?
- Effort - for preparing training data, checking & improving model.
- Compute - for to run fine-tuning jobs, and to put fine-tuned model for use
- Data - access to enough good quality examples wey fit improve fine-tuning
Benefits: You don confirm the benefits of fine-tuning?
- Quality - the fine-tuned model perform pass the normal one?
- Cost - e reduce token usage by making prompts simple?
- Extensibility - fit re-use the base model for new domains?

If you fit answer these questions well, e go help you decide if fine-tuning na the correct route for your case. Ideally, e sef good if the benefits pass the costs. After you decide to continue, na time to think about how you fit fine tune the pre-trained model.

You want more ideas about how to decide? Watch To fine-tune or not to fine-tune

How we fit fine-tune pre-trained model?

To fine-tune pre-trained model, you go need:

pre-trained model wey you wan fine-tune
dataset to use for fine-tuning
training environment to do the fine-tuning job
hosting environment to put the fine-tuned model to work

Fine-Tuning In Action

Dis resources dem go give step-by-step tutorials to show you how to do am with real example using selected model plus selected dataset. To follow these tutorials, you need account on the provider wey you dey use, plus access to the right model and datasets.

Provider	Tutorial	Description
OpenAI	How to fine-tune chat models	Learn how to fine-tune `gpt-35-turbo` for one specific domain ("recipe assistant") by preparing training data, running the fine-tuning job, and using the fine-tuned model for inference.
Azure OpenAI	GPT 3.5 Turbo fine-tuning tutorial	Learn to fine-tune `gpt-35-turbo-0613` model for Azure by creating & uploading training data, then running the fine-tuning job. Deploy and use the new model.
Hugging Face	Fine-tuning LLMs with Hugging Face	This post go guide you fine-tuning open LLM (like `CodeLlama 7B`) using the transformers library & Transformer Reinforcement Learning (TRL) with open datasets on Hugging Face.

🤗 AutoTrain	Fine-tuning LLMs with AutoTrain	AutoTrain (or AutoTrain Advanced) na python library wey Hugging Face build wey allow fine-tuning for many different tasks including LLM fine-tuning. AutoTrain no need code, and you fit do the fine-tuning for your own cloud, on Hugging Face Spaces or locally. E get web-based GUI, CLI and also training with yaml config files.

🦥 Unsloth	Fine-tuning LLMs with Unsloth	Unsloth na open-source framework wey fit support LLM fine-tuning and reinforcement learning (RL). Unsloth dey make local training, evaluation, and deployment easy with ready use notebooks. E fit also support text-to-speech (TTS), BERT and multimodal models. To start, read their step-by-step Fine-tuning LLMs Guide.

Assignment

Pick one of the tutorials wey dey above and follow am well well. We fit put one version of these tutorials for Jupyter Notebooks inside this repo for reference. But abeg use the original source directly to get the latest version.

Great Work! Continue Your Learning.

After you don finish this lesson, check our Generative AI Learning collection make you continue to increase your Generative AI knowledge!

Congratulations!! You don finish the final lesson from v2 series for this course! No stop to learn and build. **Check the RESOURCES page for more ideas just about dis topic.

Our v1 series lessons don also update with more assignments and concepts. So take small time to freshen your mind - and please share your questions and feedback to help us make these lessons beta for the community.

Warning:
Dis document don translate wit AI translation service wey dem dey call Co-op Translator. Even though we dey try make am correct, make you sabi say AI fit make some mistakes or no get correct meaning. Di original document wey dey dia for dia own language na im be the correct one. If na important matter, e better make human wey sabi translate am well well do am. We no go take responsibility if person no understand or misunderstand dis translation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fine-Tuning Your LLM

Learning Objectives

Illustrated Guide

Wetin be fine-tuning for language models?

When and why we suppose fine-tune models?

How we fit fine-tune pre-trained model?

Fine-Tuning In Action

Assignment

Great Work! Continue Your Learning.

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Fine-Tuning Your LLM

Learning Objectives

Illustrated Guide

Wetin be fine-tuning for language models?

When and why we suppose fine-tune models?

How we fit fine-tune pre-trained model?

Fine-Tuning In Action

Assignment

Great Work! Continue Your Learning.