This project implements a MultiModal RAG with DeepSeek's latest model Janus-Pro.
We use the following tools
- DeepSeek-Janus-Pro as the multi-modal LLM
- ColPali as the vision encoder
- Qdrant as the vector database
- Streamlit as the web interface
A demo of the project is available below:
Setup Janus:
git clone https://github.com/deepseek-ai/Janus.git
pip install -e ./Janus
Install Dependencies: Ensure you have Python 3.11 or later installed.
pip install streamlit fastembed flash-attn transformersFinally, run the project by running the following command:
streamlit run app.pyGet a FREE Data Science eBook 📖 with 150+ essential lessons in Data Science when you subscribe to our newsletter! Stay in the loop with the latest tutorials, insights, and exclusive resources. Subscribe now!
Contributions are welcome! Please fork the repository and submit a pull request with your improvements.
