Replies: 1 comment
-
|
Totally agree—OpenSearch as a vector store backend would make LiteLLM way more flexible for RAG projects. We’ve deployed OpenSearch in production for hybrid search setups and it scales nicely, offers real-time indexing, and handles both dense (vector) and sparse (BM25) retrieval. Key implementation points:
from opensearchpy import OpenSearch
client = OpenSearch(...)
doc = {
"text": "foo bar",
"embedding": [0.1, 0.2, ...], # float32 vector
"metadata": {"author": "alice"}
}
client.index(index="docs", body=doc)We’ve found OpenSearch much easier to integrate than dedicated vector DBs in enterprise environments—no extra infra, just extend existing clusters. Supporting hybrid retrieval is a huge win for real-world RAG, especially when users expect both semantic and keyword relevance. Happy to share more integration details if needed. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Motivation
OpenSearch is widely used for document search and can support RAG use cases with:
This would help connect LiteLLM with existing OpenSearch-based RAG infrastructures and the broader OpenRAG ecosystem.
Proposed Feature
Add an OpenSearch vector store backend that can:
Example Use Case
A team already has documents indexed in OpenSearch and wants to use LiteLLM as a single gateway for LLMs, embeddings, and RAG retrieval without adding another vector database.
Benefit
This would make LiteLLM easier to adopt in enterprise RAG setups where OpenSearch is already used through for instance the openrag
Thanks for considering this feature.
Beta Was this translation helpful? Give feedback.
All reactions