Add a playbook of llama factory fine-tuning by zhangnju · Pull Request #70 · amd/playbooks

zhangnju · 2026-02-12T04:16:01Z

This playbook is based on llama factory, and listed the below info:1) playbook duration and risk 2) detailed fine-tuning instructions 3) the introduction to important components of llama factory

adamlam2-amd

Thanks for the PR - overall nice job adding the functionalities and nuances of llama-factory.

Couple higher level comments:

we will use Pytorch+ROCm that's already installed on the Box; thus, we don't need any docker commands
I think in general we might be able to shorten the playbook. We can focus on one main outcome in the actual playbook - lora/qlora/etc - and reference the rest of them in the next steps section. We don't want to overload the user with too much information during the actual steps.
Would recommend using the llama-factory UI as it seems more intuitive(?)
General comments regarding wording, conciseness, and consistency of words/headings.

danielholanda · 2026-03-04T23:10:24Z

@adamlam2-amd any other comments here?
@iswaryaalex please also make sure to review at some point this week.

adamlam2-amd · 2026-03-05T06:58:07Z

This playbook is still problematic. There are 4 major questions that need to be resolved, among other more minor things.

Are we recommending to use Docker? @danielholanda need your guidance, as the HaloBox comes pre-installed with Pytorch already. But if we open it up to other AMD hardware, what is the strategy here? I recommend not using Docker at all and installing from source. This includes removing the 'Optional' bits as they can confuse users.
Do we need to recommend bits and bytes? I feel this will complicate the overall process. We can mention it in the next steps instead.
Must we use the CLI tool? LlaMA Factory has a great UI that I think will be a much better user experience. We can add pictures as well. I recommend not using the CLI in favour of the GUI.
What model and fine-tuning method is recommended? Let's have 1 for each so the user can understand, and then we can add more in the 'next steps' section.

As a reference, this LlaMA Factory guide is more helpful: https://www.datacamp.com/tutorial/llama-factory-web-ui-guide-fine-tuning-llms. In general, we want to make a new-developer have a positive and seamless experience with this tutorial.

@zhangnju please spend some time to make the required fixes. DM me for more info. Thank you!

adamlam2-amd · 2026-03-05T06:58:43Z

We will also need Windows specific content, if any. I suspect the git commands will differ slightly, as well as the pictures may differ as well.

zhangnju · 2026-03-05T09:14:37Z

HI @adamlam2-amd Thanks for your review.

for docker, I have

This playbook is still problematic. There are 4 major questions that need to be resolved, among other more minor things.

Are we recommending to use Docker? @danielholanda need your guidance, as the HaloBox comes pre-installed with Pytorch already. But if we open it up to other AMD hardware, what is the strategy here? I recommend not using Docker at all and installing from source. This includes removing the 'Optional' bits as they can confuse users.

Do we need to recommend bits and bytes? I feel this will complicate the overall process. We can mention it in the next steps instead.

Must we use the CLI tool? LlaMA Factory has a great UI that I think will be a much better user experience. We can add pictures as well. I recommend not using the CLI in favour of the GUI.

What model and fine-tuning method is recommended? Let's have 1 for each so the user can understand, and then we can add more in the 'next steps' section.

As a reference, this LlaMA Factory guide is more helpful: https://www.datacamp.com/tutorial/llama-factory-web-ui-guide-fine-tuning-llms. In general, we want to make a new-developer have a positive and seamless experience with this tutorial.

@zhangnju please spend some time to make the required fixes. DM me for more info. Thank you!

for item 1, if Halo linux also has pytorch installed , I can remove the optional pytorch docker sections @danielholanda could you help confirm it?
for item 2, bisandbytes is an important library for developer, and many developers are interested in it. that is why I introduced how to install it from source codes, otherwise developer may meet some issues on Radeon
for item 3, I have talked with some developers, and they still prefer cli tools instead of UI tool. NV DGX spark playbook also introduced cli tool for llama factory, not UI tool. I have added UI into next steps sections
for item 4, I used qwen3 model, and mainly introduced lora finetuning. since some developers are interested in qlora, I also give a sample command for qlora for their reference. if you also think qlora is not needed, I can remove it @danielholanda

adamlam2-amd · 2026-03-05T16:10:18Z

Hey,

regarding 2), bits and bytes doesnt work on Windows (I think?) Please confirm if it does.

regarding 3), we have 2 other playbooks that teach users how to do LLM Fine-tuning - Unsloth and Pytorch. Let's use the GUI for LlamaFactory to appeal to the no-code quick-start developer community.

regarding 4), we also have qlora and lora explained in another playbook, which is why I mentioned.

zhangnju · 2026-03-09T07:58:00Z

Hey,

regarding 2), bits and bytes doesnt work on Windows (I think?) Please confirm if it does.

regarding 3), we have 2 other playbooks that teach users how to do LLM Fine-tuning - Unsloth and Pytorch. Let's use the GUI for LlamaFactory to appeal to the no-code quick-start developer community.

regarding 4), we also have qlora and lora explained in another playbook, which is why I mentioned.

HI @adamlam2-amd

I have removed the bitsandbytes and docker setup sections in the playbook.

some developers prefer to use unsloth as finetuning tool, and some developers prefer to use llama factory. so, even if we have unlosth and pytorch playbook, we still need to have a llama factory playbook, which can tell them AMD device can also support llama factory and you can have a try.

danielholanda · 2026-03-09T18:42:29Z

@adamlam2-amd Please let us know if there is any additional requirements here before we merge this

danielholanda · 2026-03-09T18:42:54Z

@iswaryaalex Please remember to take a look and review this.

iswaryaalex

Overall clean playbooks, I like logical flow of install → finetune → test/export

Key changes to consider

Dependencies: This is critical for playbook. As we are not using docker, I highly encourage to add references to prerequisites that are already pre-installed in the section Dependencies
For additioanl Dependencies introduced in llama-factory finetuning, clarify it in Additional Dependencies
Typos in README, need to get fixed

danielholanda · 2026-03-13T21:29:42Z

@zhangnju Please ping the reviewers here once this is ready for another round of reviews

zhangnju · 2026-03-14T00:05:07Z

@zhangnju Please ping the reviewers here once this is ready for another round of reviews

sure. I have updated the playbook according to the latest feedback. @danielholanda @iswaryaalex @adamlam2-amd

danielholanda · 2026-03-17T22:38:21Z

@iswaryaalex @adamlam2-amd Can you please take another look?

iswaryaalex · 2026-03-20T00:11:07Z

Looks good to me! Changes in playbook.json needed to pass the checks

iswaryaalex

LGTM!

zhangnju · 2026-03-20T03:23:34Z

Looks good to me! Changes in playbook.json needed to pass the checks

I have updated playbook.json

adamlam2-amd

I made some changes myself to improve general UI and text formatting/grammar. Please review if you wish. Otherwise, looks good.

Approving so it can pass into QA.

…aybooks into nzhang/llama_factory

zhangnju added 3 commits February 12, 2026 06:25

add llama factory finetuning playbook

4eebfd5

update llama factory playbook

5026f24

update llama factory playbook

73e3421

danielholanda requested a review from adamlam2-amd February 12, 2026 17:14

danielholanda assigned zhangnju Feb 12, 2026

danielholanda requested a review from iswaryaalex February 17, 2026 22:40

adamlam2-amd requested changes Feb 19, 2026

View reviewed changes

zhangnju added 6 commits February 20, 2026 17:41

correct an typing error

43ce38c

mark pytorch setup as an optional step

022122b

add webUI tool info

bbbab10

updated some content

f33b12a

updated

280c576

update finetuning time

8e18f9a

small updates

f836a76

remove bitsandbytes and docker setup

656fb8c

iswaryaalex requested changes Mar 12, 2026

View reviewed changes

zhangnju added 2 commits March 13, 2026 21:54

add the dependency info

1836a84

correct typo issue

0036047

iswaryaalex approved these changes Mar 20, 2026

View reviewed changes

adamlam2-amd added 2 commits March 20, 2026 00:52

ui and text formatting

fabfbd3

ui

1cd0f12

adamlam2-amd approved these changes Mar 20, 2026

View reviewed changes

zhangnju and others added 5 commits March 20, 2026 19:14

update playbook json file

6ebb47a

Merge branch 'main' into nzhang/llama_factory

f2c7a3c

add supported platforms

3afacd9

Merge branch 'nzhang/llama_factory' of https://github.com/amd/halo_pl…

001a039

…aybooks into nzhang/llama_factory

Merge branch 'main' into nzhang/llama_factory

dfc1677

danielholanda merged commit e4d47a5 into main Mar 20, 2026
3 checks passed

Conversation

zhangnju commented Feb 12, 2026

Uh oh!

adamlam2-amd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

danielholanda commented Mar 4, 2026

Uh oh!

adamlam2-amd commented Mar 5, 2026

Uh oh!

adamlam2-amd commented Mar 5, 2026

Uh oh!

zhangnju commented Mar 5, 2026

Uh oh!

adamlam2-amd commented Mar 5, 2026

Uh oh!

zhangnju commented Mar 9, 2026

Uh oh!

danielholanda commented Mar 9, 2026

Uh oh!

danielholanda commented Mar 9, 2026

Uh oh!

iswaryaalex left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

danielholanda commented Mar 13, 2026

Uh oh!

zhangnju commented Mar 14, 2026

Uh oh!

danielholanda commented Mar 17, 2026

Uh oh!

iswaryaalex commented Mar 20, 2026

Uh oh!

iswaryaalex left a comment

Choose a reason for hiding this comment

Uh oh!

zhangnju commented Mar 20, 2026

Uh oh!

adamlam2-amd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants