RAG Chat Application

A Retrieval Augmented Generation (RAG) chat application that allows users to upload PDF documents and ask questions about their content. The app uses FastAPI for the backend, Qdrant as the vector database, OpenAI for chat completions, and Next.js for the frontend.

Features

📄 PDF Upload: Upload PDF documents to build your knowledge base with progress tracking
💬 Interactive Chat: Ask questions about your uploaded documents
🔍 Source Attribution: See which documents were used to answer your questions
🚀 Real-time Processing: Fast document ingestion and retrieval
🎯 Semantic Search: Find relevant information using vector similarity
📊 Status Indicators: Real-time backend and database connection status
🔄 Progress Tracking: Visual upload and processing progress bars
⚡ Health Monitoring: Automatic system health checks every 30 seconds
📈 RAG Evaluation: Comprehensive evaluation system with 5 quality metrics
- Relevance: How well the response addresses the user's query
- Faithfulness: Whether the response is grounded in retrieved context
- Completeness: How thoroughly the response addresses the query
- Clarity: How clear and well-structured the response is
- Retrieval Quality: Quality and diversity of retrieved contexts
🎛️ Metrics Toggle: Enable/disable evaluation for each chat interaction
📊 Evaluation Dashboard: View evaluation summary and trends in the sidebar
📚 Document CRUD Management: Complete document lifecycle management
- Create: Upload and process PDF documents with detailed metadata
- Read: View comprehensive document details, statistics, and chunk previews
- Update: Modify document metadata and properties
- Delete: Remove documents and all associated chunks with confirmation
📋 Document Statistics: Real-time overview of collection statistics
🔍 Document Details Modal: In-depth view of document structure and chunks
💾 Enhanced Metadata: File size, page count, processing date, and chunk information

Architecture

Frontend: Next.js 15 with TypeScript and Tailwind CSS
Backend: FastAPI with Python 3.13.0
Vector Database: Qdrant (local Docker instance)
Embeddings: OpenAI text-embedding-ada-002 (1536 dimensions)
LLM: OpenAI GPT models
PDF Processing: PyPDF2

Prerequisites

Python 3.12+ (3.13.0 recommended)
Node.js 18+
Docker (for Qdrant)
OpenAI API key

Setup Instructions

1. Clone and Setup Backend

# Navigate to backend directory
cd backend

# Create virtual environment
python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install Python dependencies
pip install -r requirements.txt

# Configure environment variables
cp .env .env.local
# Edit .env.local and add your OpenAI API key:
# OPENAI_API_KEY=your_openai_api_key_here

2. Start Qdrant Vector Database

# From the backend directory
./start_qdrant.sh

This will start Qdrant in a Docker container on port 6333. The web UI will be available at http://localhost:6333/dashboard.

3. Start Backend Server

# From the backend directory
./start_server.sh

The FastAPI server will start on http://localhost:8000. You can view the API documentation at http://localhost:8000/docs.

4. Setup and Start Frontend

# From the root directory
npm install
npm run dev

The Next.js frontend will start on http://localhost:3000.

Usage

Start all services: Make sure Qdrant, the FastAPI backend, and Next.js frontend are all running
Upload PDFs: Click the "Upload PDF" button in the top right to upload your documents
Manage documents: Use the enhanced document sidebar to:
- View document statistics and metadata
- Click "View" to see detailed document information and chunk previews
- Click "Delete" to remove documents (with confirmation)
Enable evaluation (optional): Click the "Metrics" button to enable RAG evaluation for quality assessment
Ask questions: Type your questions in the chat interface
View sources: See which documents were used to answer your questions
Review metrics: When evaluation is enabled, view quality scores for each response and overall trends in the sidebar

Document Management Features

The enhanced document management system provides:

Document Statistics: Real-time overview showing total documents, chunks, file sizes, and text length
Detailed Document Cards: Each document shows file size, page count, chunk information, and processing date
Document Details Modal: Click "View" on any document to see:
- Complete metadata (filename, size, pages, processing date)
- Chunk statistics and distribution
- Preview of document chunks with text snippets
- Individual chunk information (index, length, creation date)
Safe Deletion: Delete documents with confirmation dialog
Automatic Refresh: Statistics and document list update automatically after uploads/deletions

Evaluation Features

The RAG evaluation system provides detailed quality assessment:

Individual Message Scores: Each response shows scores for all 5 metrics when evaluation is enabled
Overall Score: Combined score across all metrics for quick quality assessment
Evaluation Summary: Sidebar shows total evaluations, average scores, and recent trends
Color-coded Indicators: Visual representation of score quality (red: 0-0.4, yellow: 0.4-0.7, green: 0.7-1.0)
Historical Tracking: System maintains evaluation history for trend analysis

API Endpoints

Chat and Documents

POST /chat - Send a chat message and get a response with sources
- Optional evaluate: boolean parameter to enable evaluation
POST /upload-pdf - Upload and process a PDF file
GET /documents - List all documents with detailed metadata
GET /documents/{document_name} - Get detailed information about a specific document
PUT /documents/{document_name}/metadata - Update metadata for a document
DELETE /documents/{document_name} - Delete a document and its chunks
GET /documents/statistics/overview - Get overall collection statistics
GET /health - System health check endpoint

Evaluation Endpoints

GET /evaluation/summary - Get evaluation metrics summary and trends
GET /evaluation/history - Get detailed evaluation history

Configuration

You can modify the following environment variables in backend/.env:

OPENAI_API_KEY - Your OpenAI API key
QDRANT_HOST - Qdrant host (default: localhost)
QDRANT_PORT - Qdrant port (default: 6333)
COLLECTION_NAME - Qdrant collection name (default: documents)
EMBEDDING_MODEL - Sentence transformer model (default: all-MiniLM-L6-v2)
CHUNK_SIZE - Text chunk size for processing (default: 1000)
CHUNK_OVERLAP - Overlap between chunks (default: 200)

Development

Backend Development

cd backend
source venv/bin/activate
uvicorn main:app --host 0.0.0.0 --port 8000 --reload

Frontend Development

npm run dev

Troubleshooting

Common Issues

Qdrant connection error: Make sure Docker is running and Qdrant container is started
OpenAI API errors: Check that your API key is correctly set in the .env file
PDF upload errors: Ensure the uploaded file is a valid PDF
Port conflicts: Make sure ports 3000, 6333, and 8000 are available

Logs

Backend logs: Check the terminal where the FastAPI server is running
Qdrant logs: docker logs qdrant-rag
Frontend logs: Check the browser console and Next.js terminal

Contributing

Fork the repository
Create a feature branch
Make your changes
Test thoroughly
Submit a pull request

License

This project is open source and available under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
app		app
backend		backend
public		public
.gitignore		.gitignore
README.md		README.md
contextual-chunking-patch.js		contextual-chunking-patch.js
docker-compose.yml		docker-compose.yml
manage.sh		manage.sh
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
test-setup.sh		test-setup.sh
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG Chat Application

Features

Architecture

Prerequisites

Setup Instructions

1. Clone and Setup Backend

2. Start Qdrant Vector Database

3. Start Backend Server

4. Setup and Start Frontend

Usage

Document Management Features

Evaluation Features

API Endpoints

Chat and Documents

Evaluation Endpoints

Configuration

Development

Backend Development

Frontend Development

Troubleshooting

Common Issues

Logs

Contributing

License

About

Uh oh!

Releases

Packages

Languages

kanungle/simple-rag-app

Folders and files

Latest commit

History

Repository files navigation

RAG Chat Application

Features

Architecture

Prerequisites

Setup Instructions

1. Clone and Setup Backend

2. Start Qdrant Vector Database

3. Start Backend Server

4. Setup and Start Frontend

Usage

Document Management Features

Evaluation Features

API Endpoints

Chat and Documents

Evaluation Endpoints

Configuration

Development

Backend Development

Frontend Development

Troubleshooting

Common Issues

Logs

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages