Llama Stack Demos on OpenDataHub

This repository contains practical examples and demos designed to get you started quickly building AI apps with Llama Stack on Kubernetes or OpenShift. Whether you're a cluster admin looking to deploy the right GenAI infrastructure or a developer eager to innovate with AI Agents, the content in this repo should help you get started.

🛠️ Get Started

Requirements

To run these demos, ensure your environment meets the following:

OpenShift Cluster 4.17+
2 GPUs with a minimum of 40GB VRAM each.

Deployment Instructions

Next, follow these simple steps to deploy the core components:

Create a dedicated OpenShift project:
```
oc new-project llama-serve
```
Apply the Kubernetes manifests:
```
oc apply -k kubernetes/kustomize/overlay/all-models
```
This will deploy the foundational Llama Stack services, vLLM model servers, and MCP tool servers.

Setting Up Your Development Environment

We use uv for managing Python dependencies, ensuring a consistent and efficient development experience. Here's how to get your environment ready:

Install uv:
```
pip install uv
```
Synchronize your environment:
```
uv sync
```
Activate the virtual environment:
```
source .venv/bin/activate
```

Now you're all set to run any Python scripts or Jupyter notebooks within the demos/rag_agentic directory!

💡 Demo Architecture

The below diagram is an example architecture for a secure Llama Stack based application deployed on OpenShift (OCP) using both MCP tools and a Milvus vectorDB for its agentic and RAG based workflows. This is the same architecture that has been implemented in the RAG/Agentic demos.

We're excited to see what you build with Llama Stack! If you have any questions or feedback, please don't hesitate to open an issue. Happy building! 🎉

Name		Name	Last commit message	Last commit date
Latest commit History 555 Commits
.github/workflows		.github/workflows
demos		demos
distribution		distribution
images		images
kubernetes		kubernetes
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
local_setup_guide.md		local_setup_guide.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Llama Stack Demos on OpenDataHub

🛠️ Get Started

Requirements

Deployment Instructions

Setting Up Your Development Environment

💡 Demo Architecture

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 29

Languages

License

opendatahub-io/llama-stack-demos

Folders and files

Latest commit

History

Repository files navigation

Llama Stack Demos on OpenDataHub

🛠️ Get Started

Requirements

Deployment Instructions

Setting Up Your Development Environment

💡 Demo Architecture

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 29

Languages

Packages