Multi-Agent-RAG

A powerful document question-answering system powered by Docling and LangGraph, featuring a multi-agent workflow for accurate and verified answers.

Features

Multi-Agent Workflow: Research, verification, and relevance checking agents
Document Processing: Support for PDF, DOCX, TXT, and MD files
Hybrid Retrieval: Combines dense and sparse retrieval for better results
Answer Verification: Built-in fact-checking and verification
Modern Chat Interface: Powered by Chainlit
Session Management: Efficient document caching and state management

Quick Start

Prerequisites

Python 3.10+ (recommended)
uv (for fast dependency management)

Installation

Clone the repository:

git clone <repository-url>
cd Multi-Agent-RAG

Install dependencies using uv:
```
uv pip install -e .
```
If you don’t have uv installed, get it with:
```
pip install uv
```

Running the Chainlit Interface

Start the Chainlit chat interface:

chainlit run chainlit.py

The interface will be available at http://localhost:8000.

Usage

Document Upload

Upload your documents (PDF, DOCX, TXT, MD) in the chat interface.
The system will process and index them automatically.
Documents are cached for efficient reuse.

Asking Questions

Type your question in the chat.
The system will:
- Check document relevance
- Research the answer using retrieved documents
- Verify the answer for accuracy
- Provide both the answer and verification report

Architecture

Multi-Agent Workflow

Relevance Checker: Determines if documents can answer the question
Research Agent: Generates initial answers from retrieved documents
Verification Agent: Checks answer accuracy and provides verification report

Document Processing

File Handler: Processes various document formats
Chunking: Splits documents into manageable chunks
Embedding: Creates vector representations for retrieval

Retrieval System

Hybrid Retriever: Combines dense (vector) and sparse (BM25) retrieval
ChromaDB: Vector database for document storage
Ensemble: Merges results from multiple retrieval methods

Configuration

Settings are managed through the config/ directory:

constants.py: System constants and file type definitions
settings.py: Environment-specific settings

Supported File Types

PDF (.pdf)
Word documents (.docx)
Text files (.txt)
Markdown files (.md)

Development

Project Structure

Multi-Agent-RAG/
├── agent/                 # Multi-agent workflow
│   ├── workflow.py        # Main workflow orchestration
│   ├── research_agent.py  # Research agent implementation
│   ├── verification_agent.py # Verification agent
│   └── relevance_checker.py  # Relevance checking
├── retriever/             # Document retrieval system
│   ├── builder.py         # Retriever construction
│   └── file_handler.py    # Document processing
├── config/                # Configuration files
├── utils/                 # Utility functions
├── chainlit.py            # Chainlit interface (entrypoint)
├── pyproject.toml         # Project dependencies
└── uv.lock                # uv dependency lockfile

Adding New Agents

Create a new agent class in the agent/ directory
Implement the required interface methods
Add the agent to the workflow in agent/workflow.py

Extending Document Support

Add new file type to constants.ALLOWED_TYPES
Implement processing logic in retriever/file_handler.py
Update the interface validation

Troubleshooting

Common Issues

Document Processing Errors: Ensure files are not corrupted and in supported formats
Memory Issues: Large documents may require more memory allocation
Retrieval Performance: Consider adjusting chunk sizes or retrieval parameters

Logging

The system uses structured logging. Check logs for detailed error information:

tail -f chainlit.log  # For Chainlit interface

Acknowledgments

Built with Docling
Powered by LangGraph
Interface: Chainlit

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.chainlit		.chainlit
agent		agent
config		config
retriever		retriever
utils		utils
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
chainlit.md		chainlit.md
chainlit.py		chainlit.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Agent-RAG

Features

Quick Start

Prerequisites

Installation

Running the Chainlit Interface

Usage

Document Upload

Asking Questions

Architecture

Multi-Agent Workflow

Document Processing

Retrieval System

Configuration

Supported File Types

Development

Project Structure

Adding New Agents

Extending Document Support

Troubleshooting

Common Issues

Logging

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent-RAG

Features

Quick Start

Prerequisites

Installation

Running the Chainlit Interface

Usage

Document Upload

Asking Questions

Architecture

Multi-Agent Workflow

Document Processing

Retrieval System

Configuration

Supported File Types

Development

Project Structure

Adding New Agents

Extending Document Support

Troubleshooting

Common Issues

Logging

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages