Skip to content

feat: add rhoai 3.2 fine tuning examples#27

Merged
kramaranya merged 3 commits into
red-hat-data-services:mainfrom
briangallagher:add-3.2-finetuning-examples
Jan 16, 2026
Merged

feat: add rhoai 3.2 fine tuning examples#27
kramaranya merged 3 commits into
red-hat-data-services:mainfrom
briangallagher:add-3.2-finetuning-examples

Conversation

@briangallagher

Copy link
Copy Markdown
Contributor

Add RHOAI 3.2 fine tuning examples for OSFT and SFT

@briangallagher briangallagher force-pushed the add-3.2-finetuning-examples branch 5 times, most recently from 63d2581 to 6fd60d9 Compare January 12, 2026 13:15
Signed-off-by: Brian Gallagher <briangal@gmail.com>
@briangallagher briangallagher force-pushed the add-3.2-finetuning-examples branch from 6fd60d9 to 7cbcdce Compare January 12, 2026 13:25

@Fiona-Waters Fiona-Waters left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some small nits after code review, otherwise they look good.

Comment thread examples/fine-tuning/rhoai-3.2/osft/README.md Outdated
Comment thread examples/fine-tuning/rhoai-3.2/osft/README.md Outdated
Comment thread examples/fine-tuning/rhoai-3.2/osft/osft-example.ipynb Outdated
Comment thread examples/fine-tuning/rhoai-3.2/training-hub/sft/README.md Outdated
Comment thread examples/fine-tuning/rhoai-3.2/training-hub/sft/sft.ipynb Outdated
Comment thread examples/fine-tuning/rhoai-3.2/training-hub/sft/sft.ipynb Outdated

@kramaranya kramaranya left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @briangallagher!
Looks good to me, just left a few comments

Comment on lines +24 to +25
* Log in, then go to _Data Science Projects_ and create a project:
![](./images/02.png)

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Shall we update those images with RHOAI 3.2?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ok if we fix this for 3.3? Given time constraint? We don't currently have a 3.2 cluster available. 3.2 fast release also.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sure, I think it should be fine

"!python3 -m pip install datasets transformers accelerate bitsandbytes huggingface_hub\n",
"!python3 -m pip install --force-reinstall --no-cache-dir -U datasets\n",
"!python3 -m pip install --force-reinstall --no-cache-dir -U \"kubeflow @ git+https://github.com/opendatahub-io/kubeflow-sdk.git@v0.2.1+rhai0\"\n",
"!python3 -m pip install --force-reinstall --no-cache-dir -U training-hub==0.4.0\n",

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just thinking out loud, why training-hub is not a dependency in kubeflow sdk?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

iirc, Saad had an opinion on this that training-hub was too "big and complex" to be a dependency of the sdk. We could make it an extra but then we have to consider how to deal the training-hub extras such as cuda. I think it makes sense to keep separate.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks Brian

Comment thread examples/fine-tuning/rhoai-3.2/training-hub/sft/sft.ipynb
"!python3 -m pip install --force-reinstall --no-cache-dir -U datasets\n",
"!python3 -m pip install --force-reinstall --no-cache-dir -U \"kubeflow @ git+https://github.com/opendatahub-io/kubeflow-sdk.git@v0.2.1+rhai0\"\n",
"!python3 -m pip install --force-reinstall --no-cache-dir -U training-hub==0.4.0\n",
"!python3 -m pip install --force-reinstall --no-cache-dir -U ipykernel"

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why do we need ipykernel?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hmm, I'm struggling to remember now. I did hit a an issue that required me to install it but struggling to remember. I think it's maybe safer to leave in and revisit for 3.3?

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Its a dev dependency in training-hub and mini-trainer. I had to add it for e2e tests also. BUT it is included in the universal image so when we are using that we won't need to pip install separately.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Makes sense

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why do we need those changes in this file?

@briangallagher briangallagher Jan 14, 2026

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It was failing format or lint. I've shared this with Shri and he was fine with us committing

Signed-off-by: Brian Gallagher <briangal@gmail.com>
@kramaranya

Copy link
Copy Markdown
Contributor

Thank you @briangallagher!
lgtm

Signed-off-by: Brian Gallagher <briangal@gmail.com>

@Fiona-Waters Fiona-Waters left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@kramaranya kramaranya left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you!
/approve

@kramaranya kramaranya merged commit 12861df into red-hat-data-services:main Jan 16, 2026
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants