Set up of OpenDataHub or RHOAI (with or without GPUs) for RHDH AI Model Catalog testing

Latest functional testing shows you need OCP 4.16 or greater. It has been verified for as recent a version as 4.19.

Install via kustomize

From a clone of this repo, run from the same directory as this README:

ODH

oc apply -k ./kustomize/

RHOAI

oc apply -k ./kustomize-rhoai/

GPUs

If you provisioned a cluster with GPUs, run this kustomize before running either the ODH or RHOAI kustomizes

oc apply -k ./kustomize-gpu

The NodeFeature subscription is OCP version specific. The nodefeature-subscription.yaml currently references a 4.19 version. You can edit that file for your OCP version, and run the contents of the kustomize-gpu/job.yaml Job manually:

bash ./subscriptions-gpu.sh
bash ./cpu-setup.sh
bash ./nfd-setup.sh
# this will clean up status files created during the subscriptions setup
rm *.txt

Set up LlamaStack operator and a LlamaStack instance

So the Running LlamaStack Operator with ODH instructions, after some minor modifications, were able to produce:

a running llama 3.2 3B instruct model as vLLM Nvidia GPU KServe InferenceService instance in the llamastack namespace.
a running llama-stack instance that uses the llama 3.2 3B instruct model

Tweaks to the instructions there include:

The UI / Dashboard flow directions do not line up exactly with the 2.33 ODH console on OCP. Instead, go into the llamastack project create by the kustomize-gpu Job, and select single server model serving for the llamastack project
Then click the Connections tab near the top
Click Create a new Connection
The values for Connection name, type, and URI for Create a Connection are still correct.
For deploying the model, while still in the llamastack project, click the Models tab near th top
Click the deploy a model button
The specific field settings in the instructions are still correct for those fields mentioned
But also select the expose the model by a route, and disable authentication.
When creating the LlamaStackDistribution CR in step three, using the service URL for VLLM_URL did not work. Changing it to the URL of the Route created for the model does work (where you add the /v1 suffix to the Route URL)
Also, set the mountPath in the last line to the default
You'll have to create your own LlamaStackDistribution yaml for your cluster, but a reference example exists in the file llamastackdistribution-gabe-pers-cluster.yaml.
Lastly, the various python snippets in Query the Model from Jupyter Notebook have been put in the file jupyter-nb-test.py file for convenience. And similarly to the prior steps, the UI / Dashboard navigation directions don't quite line up with what you will see with ODH 2.33 running on OCP. Assuming you are still in the llamastack project, click the Workbenches tab near the top, create a new notebook, and go from there. As long as a 3.12 python workbook type is chosen, so far, any of those choices seem to be OK.

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
kustomize-gpu		kustomize-gpu
kustomize-rhoai		kustomize-rhoai
kustomize		kustomize
CONTRIBUTING.md		CONTRIBUTING.md
DCO		DCO
LICENSE		LICENSE
README.md		README.md
authorino-subscription.yaml		authorino-subscription.yaml
cert-manager-operator.yaml		cert-manager-operator.yaml
cert-manager-subscription.yaml		cert-manager-subscription.yaml
cluster-policy.yaml		cluster-policy.yaml
cp-setup.sh		cp-setup.sh
default-dsc-rhoai.yaml		default-dsc-rhoai.yaml
default-dsc.yaml		default-dsc.yaml
default-dsci.yaml		default-dsci.yaml
default-model-catalog-cm.yaml		default-model-catalog-cm.yaml
dsc-setup-rhoai.sh		dsc-setup-rhoai.sh
dsc-setup.sh		dsc-setup.sh
inferenceservice.yaml		inferenceservice.yaml
job-set-cr.yaml		job-set-cr.yaml
jobset-operator.yaml		jobset-operator.yaml
jobset-subscription.yaml		jobset-subscription.yaml
jupyter-nb-test.py		jupyter-nb-test.py
kmm-ns.yaml		kmm-ns.yaml
kmm-operatorgroup.yaml		kmm-operatorgroup.yaml
kmm-subscription.yaml		kmm-subscription.yaml
llamastackdistribution-gabe-pers-cluster.yaml		llamastackdistribution-gabe-pers-cluster.yaml
model-catalog-sources-cm.yaml		model-catalog-sources-cm.yaml
model-registry-setup-rhoai.sh		model-registry-setup-rhoai.sh
model-registry-setup.sh		model-registry-setup.sh
mysql-db.yaml		mysql-db.yaml
nfd-setup.sh		nfd-setup.sh
nfd.yaml		nfd.yaml
nodefeature-ns.yaml		nodefeature-ns.yaml
nodefeature-operatorgroup.yaml		nodefeature-operatorgroup.yaml
nodefeature-subscription.yaml		nodefeature-subscription.yaml
nvidia-gpu-ns.yaml		nvidia-gpu-ns.yaml
nvidia-gpu-operatorgroup.yaml		nvidia-gpu-operatorgroup.yaml
nvidia-gpu-subscription.yaml		nvidia-gpu-subscription.yaml
odh-admins.yaml		odh-admins.yaml
odh-model-registrires-ns.yaml		odh-model-registrires-ns.yaml
opendatahub-subscription.yaml		opendatahub-subscription.yaml
registry-rhoai.yaml		registry-rhoai.yaml
registry.yaml		registry.yaml
rhoai-subscription.yaml		rhoai-subscription.yaml
rhods-admins.yaml		rhods-admins.yaml
servingruntime.yaml		servingruntime.yaml
smm.yaml		smm.yaml
subscriptions-gpu.sh		subscriptions-gpu.sh
subscriptions-rhoai.sh		subscriptions-rhoai.sh
subscriptions.sh		subscriptions.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Set up of OpenDataHub or RHOAI (with or without GPUs) for RHDH AI Model Catalog testing

Install via kustomize

ODH

RHOAI

GPUs

Set up LlamaStack operator and a LlamaStack instance

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Set up of OpenDataHub or RHOAI (with or without GPUs) for RHDH AI Model Catalog testing

Install via kustomize

ODH

RHOAI

GPUs

Set up LlamaStack operator and a LlamaStack instance

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages