Skip to content

Commit c20cd4c

Browse files
authored
Merge pull request #90 from MichaelClifford/readme
update README intro
2 parents ce6abe9 + c2af544 commit c20cd4c

1 file changed

Lines changed: 15 additions & 3 deletions

File tree

README.md

Lines changed: 15 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,23 @@
11
# Llama Stack Demos
2-
## Overview
3-
This document provides an overview of the architecture and deployment process for the Llama Stack on OpenShift (OCP). It demonstrates how the Llama Stack agentic framework, vLLM agents, and multiple MCP servers are interconnected and deployed within an OpenShift cluster. The architecture diagram below illustrates the logical flow and integration of these components.
2+
3+
This repo contains a collection of examples and demos for both cluster admins and AI developers to help them start building [Llama Stack](https://github.com/meta-llama/llama-stack) based apps on OpenShift or Kubernetes.
4+
5+
**For Cluster Admins**, please take a look at the [kubernetes/](./kubernetes/) directory of this repo. It contains useful documentation along with all the manifests required to deploy each of the components of a Llama Stack based app onto OpenShift or Kubernetes. That includes Llama Stack itself, as well as [vLLM](https://docs.vllm.ai/en/stable/index.html) model servers, [MCP](https://github.com/modelcontextprotocol) tool servers, an observability toolkit, and simple frontend apps for users to interact with the AI demos.
6+
7+
**For AI Devlopers**, please take a look at the [demos/](./demos/) directory of this repo. It contains useful documentation as well as all the notebooks, Containerfiles and application code needed to learn about developing AI applications with Llama Stack and deploying them on OpenShift or Kubernetes.
8+
9+
Current Demos:
10+
11+
* [RAG/Agentic](./demos/rag_agentic/)
12+
13+
14+
## Example Architecture
15+
The below diagram is an example architecture for a secure Llama Stack based application deployed on OpenShift (OCP) using both MCP tools and and a [Milvus](https://milvus.io/) vectorDB for its agentic and RAG based workflows. This is the same architecture that has been implemented in the [RAG/Agentic](./demos/rag_agentic/) demos.
416

517
![Architecture Diagram](./images/architecture-diagram.jpg)
618

719
## Requirements
8-
The following scenario requires at minimum the following:
20+
The following scenarios requires at minimum the following:
921

1022
* OpenShift Cluster 4.17+
1123
* 8 GPUs free (A100 or H100)

0 commit comments

Comments
 (0)