Name	Name	Last commit message	Last commit date
parent directory ..
docs	docs
.beamignore	.beamignore
.env.example	.env.example
.gitignore	.gitignore
README.md	README.md
app.py	app.py
rag.py	rag.py
start_server.py	start_server.py

Name

Last commit message

Last commit date

.beamignore

Fastest RAG stack with Milvus and Groq

This project builds the fastest stack to build a RAG application with retrieval latency < 15ms.

It leverages binary quantization for efficient retrieval coupled with Groq's blazing fast inference speeds.

We use:

LlamaIndex for orchestrating the RAG app.
Milvus vectorDB for binary vector indexing and storage.
Groq as the inference engine for MoonshotAI's Kimi K2.
Beam for ultra-fast serverless deployment.

Setup and Installation

Ensure you have Python 3.11 or later installed on your system.

First, let’s install uv and set up our Python project and environment:

# MacOS/Linux
curl -LsSf https://astral.sh/uv/install.sh | sh

# Windows
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"

Install dependencies:

# Create a new directory for our project
uv init fastest-rag
cd fastest-rag

# Create virtual environment and activate it
uv venv
source .venv/bin/activate  # MacOS/Linux

.venv\Scripts\activate     # Windows

# Install dependencies
uv add pymilvus llama-index llama-index-embeddings-huggingface llama-index-llms-groq streamlit beam-client

Setup Groq:

Get an API key from Groq and set it in the .env file as follows:

GROQ_API_KEY=<YOUR_GROQ_API_KEY>

Setup Beam:

Go to https://www.beam.cloud/ and get started
Your default token will be generated automatically

In your terminal add the command with your beam token to register

beam configure default --token <YOUR_BEAM_TOKEN>

Deploy the app on Beam cloud:

python start_server.py

This will successfully deploy your streamlit application on Beam cloud.

Copy the generated link and access the app straight from your browser.

Run the app locally (optional):

Or you can also run the app locally by running the following command:

streamlit run app.py

📬 Stay Updated with Our Newsletter!

Get a FREE Data Science eBook 📖 with 150+ essential lessons in Data Science when you subscribe to our newsletter! Stay in the loop with the latest tutorials, insights, and exclusive resources. Subscribe now!

Contribution

Contributions are welcome! Please fork the repository and submit a pull request with your improvements.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Fastest RAG stack with Milvus and Groq

Setup and Installation

📬 Stay Updated with Our Newsletter!

Contribution

FilesExpand file tree

fastest-rag-milvus-groq

Directory actions

More options

Directory actions

More options

Latest commit

History

fastest-rag-milvus-groq

Folders and files

parent directory

README.md

Fastest RAG stack with Milvus and Groq

Setup and Installation

📬 Stay Updated with Our Newsletter!

Contribution