GitHub - peijin0405/goldenvisa-rag-chat: LusAI is a local, RAG-based assistant for navigating Portugal’s Golden Visa policies. It combines document ingestion, semantic search, and LLM generation to deliver reliable answers from official documents. Powered by LangChain, Chroma, FastEmbed, and Ollama.

LusAI Assistant for Golden Visa (Powered by RAG & Ollama)

This project demonstrates a document ingestion pipeline for Retrieval-Augmented Generation (RAG) using the LangChain framework with an Ollama local LLM backend. The pipeline converts local unstructured files into vector embeddings and stores them in a Chroma vector database, enabling semantic search and question answering.

🔧 Key Technologies

LangChain: Modular framework for building LLM-powered applications, used for document loading, transformation, and RAG chain creation.
Ollama + LLM (e.g., Ollama3/Mistral): Local language model used as the reasoning engine.
Chroma: Embedded vector database for storing and retrieving document embeddings.
FastEmbed: Lightweight, high-speed embedding model for converting text to vector representations.
Streamlit: For a simple interactive front-end.

📁 Features

Loads PDF and TXT documents from local ./data directory.
Splits documents into manageable chunks using recursive character splitting.
Generates vector embeddings using FastEmbedEmbeddings.
Stores processed documents and metadata in a persistent Chroma DB.
Sets up a basic RAG pipeline to answer questions using retrieved chunks and Ollama-powered LLM responses.

🚀 How to Use

Install dependencies:

pip install langchain langchain-community chromadb fastembed

Start Ollama with a supported model
```
curl -fsSL https://ollama.com/install.sh | sh
```
Make sure Ollama is running and the model (e.g., llama3) is downloaded.
```
ollama run llama3
```
Run the notebook
```
streamlit run streamlit_app.py
```

📂 Folder Structure

.
├── streamlit_app.py             # Main retrieval pipeline logic
├── data/                        # Directory with PDF and TXT documents
└── chroma_db/                   # Auto-created directory for Chroma vector store

🧠 Example Use Case

After ingesting Golden Visa documents, you can query:

"What are the requirements for Portugal's Golden Visa?"

The system will retrieve semantically relevant document chunks and generate a response using the local LLM.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
.gitignore		.gitignore
Ollama_RAG_Ingestion.ipynb		Ollama_RAG_Ingestion.ipynb
README.md		README.md
assistant_avatar.png		assistant_avatar.png
golden_visa_agent_demo.ipynb		golden_visa_agent_demo.ipynb
page_backup.html		page_backup.html
portugal_golden_visa_backup.html		portugal_golden_visa_backup.html
portugal_golden_visa_scraper.ipynb		portugal_golden_visa_scraper.ipynb
portugal_golden_visa_scraper.py		portugal_golden_visa_scraper.py
streamlit_app.py		streamlit_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LusAI Assistant for Golden Visa (Powered by RAG & Ollama)

🔧 Key Technologies

📁 Features

🚀 How to Use

📂 Folder Structure

🧠 Example Use Case

About

Uh oh!

Releases

Packages

Languages

peijin0405/goldenvisa-rag-chat

Folders and files

Latest commit

History

Repository files navigation

LusAI Assistant for Golden Visa (Powered by RAG & Ollama)

🔧 Key Technologies

📁 Features

🚀 How to Use

📂 Folder Structure

🧠 Example Use Case

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages