Corrective RAG (CRAG) with LangGraph

A LangGraph implementation of the CRAG paper, a self-correcting RAG pipeline that evaluates retrieved documents before using them for generation.

Instead of blindly trusting every retrieved document, CRAG adds a quality gate that scores each document and decides whether to use it, discard it, search the web, or do both.

How It Works

Retrieve docs from FAISS vector store
Evaluate each doc with a lightweight LLM (Llama 3.2)
Route based on verdict:
- Correct → Refine docs → Generate
- Incorrect → Rewrite query → Web search → Refine → Generate
- Ambiguous → Refine docs + Web search → Generate
Refine by decomposing docs into sentence-level strips and filtering out irrelevant ones
Generate the final answer using GPT-4o

Tech Stack

LangGraph — Pipeline orchestration with conditional routing
FAISS — Vector store for document retrieval
OpenAI (GPT-4o) — Generation and strip filtering
Ollama (Llama 3.2:3b) — Lightweight local model for document evaluation
Tavily — Web search fallback
LangChain — Chains, prompts, and document handling

Setup

1. Initialize and install dependencies

uv init .
uv add faiss-cpu huggingface ipykernel langchain langchain-community langchain-ollama langchain-openai langchain-tavily langchain-text-splitters pypdf python-dotenv

2. Set up environment variables

Create a .env file in the root directory:

OPENAI_API_KEY=your_openai_key
TAVILY_API_KEY=your_tavily_key

3. Make sure Ollama is running

ollama pull llama3.2:3b

4. Add your PDF

Place your PDF in the ./docs/ folder. The notebook uses harrypotter.pdf by default.

5. Run the notebook

Open the Jupyter notebook and run all cells top to bottom. The first run will take a while to load and embed the PDF — after that it's cached.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.vscode		.vscode
cache		cache
docs		docs
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
clean_crag.ipynb		clean_crag.ipynb
crag.ipynb		crag.ipynb
crag2.ipynb		crag2.ipynb
main.py		main.py
pyproject.toml		pyproject.toml
rag.ipynb		rag.ipynb
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Corrective RAG (CRAG) with LangGraph

How It Works

Tech Stack

Setup

1. Initialize and install dependencies

2. Set up environment variables

3. Make sure Ollama is running

4. Add your PDF

5. Run the notebook

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Corrective RAG (CRAG) with LangGraph

How It Works

Tech Stack

Setup

1. Initialize and install dependencies

2. Set up environment variables

3. Make sure Ollama is running

4. Add your PDF

5. Run the notebook

References

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages