docs: add LoRA fine-tuning tutorial#3601

Draft

chiajunglien wants to merge 2 commits intoAI-Hypercomputer:jackyf/feat/lora-nnxfrom

CIeNET-International:emma/lora-tutorial-final

chiajunglien commented Apr 8, 2026

Description

Start with a short description of what the PR does and how this is a change from
the past.

The rest of the description includes relevant details and context, examples:

why is this change being made,
the problem being solved and any relevant context,
why this is a good solution,
some information about the specific implementation,
shortcomings of the solution and possible future improvements.

If the change fixes a bug or a Github issue, please include a link, e.g.,:
FIXES: b/123456
FIXES: #123456

Notice 1: Once all tests pass, the "pull ready" label will automatically be assigned.
This label is used for administrative purposes. Please do not add it manually.

Notice 2: For external contributions, our settings currently require an approval from a MaxText maintainer to trigger CI tests.

Tests

Please describe how you tested this change, and include any instructions and/or
commands to reproduce.

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.


          docs: add LoRA tutorial

034b378

RexBearIU reviewed

View reviewed changes

docs/tutorials/posttraining/lora.md Outdated Show resolved Hide resolved

docs/tutorials/posttraining/lora.md Show resolved Hide resolved

docs/tutorials/posttraining/lora.md Outdated

+                  scan_layers=True
+              ```
+              Your fine-tuned model checkpoints will be saved here: `$BASE_OUTPUT_DIRECTORY/$RUN_NAME/checkpoints`.

                
                    No newline at end of file

Collaborator

RexBearIU Apr 9, 2026

Also instruct the usage of maxtext_lora_to_hf script

RexBearIU requested changes

View reviewed changes

docs/tutorials/posttraining/lora.md Outdated Show resolved Hide resolved

docs/tutorials/posttraining/lora.md Outdated Show resolved Hide resolved

docs/tutorials/posttraining/lora.md Outdated Show resolved Hide resolved

docs/tutorials/posttraining/lora.md Outdated Show resolved Hide resolved

docs/tutorials/posttraining/lora.md Outdated Show resolved Hide resolved

docs/tutorials/posttraining/lora.md Show resolved Hide resolved

docs/tutorials/posttraining/lora.md Outdated Show resolved Hide resolved


          fix some naming and add the conversion instruction

473f9cf

RexBearIU reviewed

View reviewed changes

docs/tutorials/posttraining/lora.md

+                  learning_rate="${LEARNING_RATE?}" \
+                  weight_dtype="${WEIGHT_DTYPE?}" \
+                  dtype="${DTYPE?}" \
+                  profiler=xplane \

Collaborator

RexBearIU Apr 10, 2026

I think we don't need to use profiler in tutorial

RexBearIU reviewed

View reviewed changes

docs/tutorials/posttraining/lora.md


		Your fine-tuned model checkpoints will be saved here: `$BASE_OUTPUT_DIRECTORY/$RUN_NAME/checkpoints`.

		## (Optional) Export Fine-tuned LoRA to Hugging Face Format

Collaborator

RexBearIU Apr 10, 2026 •

edited

Loading

Convert should be more appropriate

RexBearIU requested changes

View reviewed changes

docs/tutorials/posttraining/lora.md

+              ```sh
+              python3 maxtext/checkpoint_conversion/maxtext_to_hf_lora.py \
+                  maxtext/configs/post_train/sft.yml \

Collaborator

RexBearIU Apr 10, 2026

Could we remove maxtext/configs/post_train/sft.yml?

docs/tutorials/posttraining/lora.md

+              ```sh
+              python3 maxtext/checkpoint_conversion/hf_lora_to_maxtext.py \
+                  maxtext/configs/post_train/sft.yml \

Collaborator

RexBearIU Apr 10, 2026

Could we remove maxtext/configs/post_train/sft.yml?

docs/tutorials/posttraining/lora.md

+                  maxtext/configs/post_train/sft.yml \
+                  model_name="${PRE_TRAINED_MODEL?}" \
+                  load_parameters_path="${BASE_OUTPUT_DIRECTORY?}/${RUN_NAME?}/checkpoints/<step_number>/items" \
+                  base_output_directory="${BASE_OUTPUT_DIRECTORY?}/hf_lora_adaptor" \

Collaborator

RexBearIU Apr 10, 2026

Rest of the file use adapter and we should align

docs/tutorials/posttraining/lora.md

+              If your LoRA adapter is currently in Hugging Face format, you must convert it to MaxText format before it can be loaded. Use the provided conversion script:
+              ```sh
+              python3 maxtext/checkpoint_conversion/hf_lora_to_maxtext.py \

Collaborator

RexBearIU Apr 10, 2026

we should use python3 -m maxtext.checkpoint_conversion.hf_lora_to_maxtext to align the training command

docs/tutorials/posttraining/lora.md

+              After completing the fine-tuning process, your LoRA weights are stored in MaxText/Orbax format. To use these weights with the Hugging Face ecosystem (e.g., for inference or sharing), convert them back using the `maxtext_lora_to_hf.py` script.
+              ```sh
+              python3 maxtext/checkpoint_conversion/maxtext_to_hf_lora.py \

Collaborator

RexBearIU Apr 10, 2026

we should use python3 -m maxtext.checkpoint_conversion.maxtext_to_hf_lora to align the training command

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

RexBearIU RexBearIU requested changes

jacoguzo Awaiting requested review from jacoguzo jacoguzo will be requested when the pull request is marked ready for review jacoguzo is a code owner

bvandermoon Awaiting requested review from bvandermoon bvandermoon will be requested when the pull request is marked ready for review bvandermoon is a code owner

richjames0 Awaiting requested review from richjames0 richjames0 will be requested when the pull request is marked ready for review richjames0 is a code owner

shralex Awaiting requested review from shralex shralex will be requested when the pull request is marked ready for review shralex is a code owner

gobbleturk Awaiting requested review from gobbleturk gobbleturk will be requested when the pull request is marked ready for review gobbleturk is a code owner

RissyRan Awaiting requested review from RissyRan RissyRan will be requested when the pull request is marked ready for review RissyRan is a code owner

gagika Awaiting requested review from gagika gagika will be requested when the pull request is marked ready for review gagika is a code owner

A9isha Awaiting requested review from A9isha A9isha will be requested when the pull request is marked ready for review A9isha is a code owner

jiangjy1982 Awaiting requested review from jiangjy1982 jiangjy1982 will be requested when the pull request is marked ready for review jiangjy1982 is a code owner

vipannalla Awaiting requested review from vipannalla vipannalla will be requested when the pull request is marked ready for review vipannalla is a code owner

Labels

None yet