Fine-tuning and Deployment of a llm with llama-factory and FastAPI

This guide covers the full pipeline of fine-tuning a language model using LLaMA-Factory, and deploying it with FastAPI for serving inference.

1. Clone LLaMA-Factory Repository

git clone https://github.com/hiyouga/LLaMA-Factory.git
cd LLaMA-Factory

2. Set Up Conda on Data Disk(optional)

mkdir -p /root/autodl-tmp/conda/pkgs
conda config --add pkgs_dirs /root/autodl-tmp/conda/pkgs

mkdir -p /root/autodl-tmp/conda/envs
conda config --add envs_dirs /root/autodl-tmp/conda/envs

3. Create Conda Environment for LLaMA-Factory

conda create -n llama-factory python=3.10 -y
conda activate llama-factory

pip install -e ".[torch,metrics]"

4. Launch LLaMA-Factory WebUI

llamafactory-cli webui

5. Download Pretrained Model from HuggingFace

mkdir -p /root/autodl-tmp/Hugging-Face
export HF_HOME=/root/autodl-tmp/Hugging-Face

pip install -U huggingface_hub

huggingface-cli download --resume-download <your-model-name>

6. Prepare Your Dataset

Place your training data in the data directory:

LLaMA-Factory/data/your_data.json

Update the dataset configuration dataset_info.json:

"your_data": {
    "file_name": "your_data.json"
}

7. Start Fine-Tuning

Launch the WebUI:

llamafactory-cli webui

In the WebUI:
- Set your model path to the unique hash inside the downloaded model folder.
- Select your_data as your training dataset.
- Configure your training parameters as needed.
- Click Start Training.

8. Export Merged Model

After training completes:

mkdir -p Models/<your-model-name>-merged

In the WebUI:
- Set the export path accordingly.
- Click Start Export.

9. Deploy Model with FastAPI

Create FastAPI Environment

conda create -n fastapi python=3.10 -y
conda activate fastapi

conda install -c conda-forge fastapi uvicorn transformers pytorch -y
pip install safetensors sentencepiece protobuf

Set Up FastAPI Project Structure

mkdir App
cd App
touch main.py test.py

Paste your FastAPI app code into main.py.
Paste your test script into test.py (modify as needed for your use case).

Start FastAPI Service

uvicorn main:app --reload --host 0.0.0.0

In a new terminal, run the test script:

python test.py

Directory Structure Overview

├── LLaMA-Factory/
│   ├── data/
│   │   ├── your_data.json
│   │   └── dataset_info.json
├── Models/
│   └── your-model-name-merged/
├── App/
│   ├── main.py
│   └── test.py

Notes

Adjust paths according to your environment setup.
Ensure that ports are open for API access if running on a remote server.
You can use nohup or screen for long-running services.

License

This project combines open-source tools. Please refer to each respective repository for licensing details.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
llama-factory to finetune llm		llama-factory to finetune llm
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Fine-tuning and Deployment of a llm with llama-factory and FastAPI

1. Clone LLaMA-Factory Repository

2. Set Up Conda on Data Disk(optional)

3. Create Conda Environment for LLaMA-Factory

4. Launch LLaMA-Factory WebUI

5. Download Pretrained Model from HuggingFace

6. Prepare Your Dataset

7. Start Fine-Tuning

8. Export Merged Model

9. Deploy Model with FastAPI

Create FastAPI Environment

Set Up FastAPI Project Structure

Start FastAPI Service

Directory Structure Overview

Notes

License

About

Uh oh!

Releases

Packages

Languages

AresCheah/Fintuning-with-LLaMA-Factory

Folders and files

Latest commit

History

Repository files navigation

Fine-tuning and Deployment of a llm with llama-factory and FastAPI

1. Clone LLaMA-Factory Repository

2. Set Up Conda on Data Disk(optional)

3. Create Conda Environment for LLaMA-Factory

4. Launch LLaMA-Factory WebUI

5. Download Pretrained Model from HuggingFace

6. Prepare Your Dataset

7. Start Fine-Tuning

8. Export Merged Model

9. Deploy Model with FastAPI

Create FastAPI Environment

Set Up FastAPI Project Structure

Start FastAPI Service

Directory Structure Overview

Notes

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages