Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -235,4 +235,4 @@ This course emphasizes:

## License

This is educational material for the "Systematically Improving RAG Applications" course.
This is educational material for the "Systematically Improving RAG Applications" course.
5 changes: 0 additions & 5 deletions cohort_1/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -30,8 +30,3 @@ The `latest/` directory contains:

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions cohort_1/week1_bootstrap_evals/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,8 +10,3 @@ Finally, we calculate metrics on these questions in `metrics.ipynb`. This uses a

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
Original file line number Diff line number Diff line change
Expand Up @@ -30,8 +30,3 @@ The `RAGClassifier` is a classifier that uses a retreival model to classify user

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
Original file line number Diff line number Diff line change
Expand Up @@ -8,8 +8,3 @@ Because you'll want data when first exploring this code, the notebook uses synth

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions cohort_1/week3_seperate_indices/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,8 +4,3 @@ It continues the example of a hardware retailer building a system to answer user

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions cohort_1/week4_routing/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,8 +6,3 @@ This directory shows how to test whether we are retrieving the correct tools. It

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions cohort_1/week5_fine_tuning/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,8 +10,3 @@ The key files in this directory are:

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions cohort_2/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -200,8 +200,3 @@ Want to learn more about RAG? Take our free email course and get the latest news

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions cohort_2/office-hours/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -129,8 +129,3 @@ The script handles various transcript file formats and naming patterns, includin

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
4 changes: 0 additions & 4 deletions cohort_2/office-hours/week1-summary.md
Original file line number Diff line number Diff line change
Expand Up @@ -324,8 +324,4 @@ Even with one-to-one mappings, MRR (Mean Reciprocal Rank) is still useful to see

## What really matters isn't the absolute number but whether we can move these metrics in a positive direction with our interventions. It's like weighing yourself—the absolute number may vary by scale, but if you've gained two pounds, you've definitely gained two pounds.

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions cohort_2/office-hours/week2-summary.md
Original file line number Diff line number Diff line change
Expand Up @@ -332,8 +332,3 @@ This approach helps ensure reliability across different types of function callin

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
4 changes: 0 additions & 4 deletions cohort_2/office-hours/week3-summary.md
Original file line number Diff line number Diff line change
Expand Up @@ -258,8 +258,4 @@ Even systems like Deep Research might evolve to pull from sources you tend to ag

## _Note: I'll continue to add resources and notes from future office hours sessions_

--8<--
"snippets/enrollment-button.md"
--8<--

---
4 changes: 0 additions & 4 deletions cohort_2/office-hours/week3/week3-summary.md
Original file line number Diff line number Diff line change
Expand Up @@ -258,8 +258,4 @@ Even systems like Deep Research might evolve to pull from sources you tend to ag

## _Note: I'll continue to add resources and notes from future office hours sessions_

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions cohort_2/office-hours/week4-summary.md
Original file line number Diff line number Diff line change
Expand Up @@ -347,8 +347,3 @@ The protocol is still evolving but shows promise for creating more powerful AI s

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions cohort_2/week0/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -62,8 +62,3 @@ These notebooks provide the essential tools and concepts you'll use throughout t

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions cohort_2/week0/uv.md
Original file line number Diff line number Diff line change
Expand Up @@ -104,8 +104,3 @@ When working with virtual environments, we recommend the following best practice

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions cohort_2/week1/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -82,8 +82,3 @@ This approach ensures you invest in retrieval techniques that provide measurable

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
4 changes: 0 additions & 4 deletions cohort_2/week2/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -71,8 +71,4 @@ These notebooks demonstrate two distinct approaches to improving embedding perfo

## Both approaches show significant performance gains (15-30% in MRR and recall), highlighting that domain-specific fine-tuning is one of the most effective ways to improve RAG system performance.

--8<--
"snippets/enrollment-button.md"
--8<--

---
4 changes: 0 additions & 4 deletions cohort_2/week4/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -72,8 +72,4 @@ Together, these notebooks provide a systematic approach to understanding and imp

## This data-driven approach helps identify which query areas need improvement most urgently, allowing targeted enhancements rather than making random changes.

--8<--
"snippets/enrollment-button.md"
--8<--

---
4 changes: 0 additions & 4 deletions cohort_2/week5/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -90,8 +90,4 @@ These notebooks move beyond simple semantic search to address complex real-world

## By combining vector search with structured filtering, SQL access, and document parsing, you'll build RAG systems capable of handling sophisticated user needs that pure semantic search can't address alone.

--8<--
"snippets/enrollment-button.md"
--8<--

---
4 changes: 0 additions & 4 deletions cohort_2/week6/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -73,8 +73,4 @@ Modern RAG systems increasingly rely on tool orchestration beyond simple retriev

## This approach demonstrates how simple prompting changes can dramatically improve tool selection, enabling RAG systems to coordinate multiple specialized capabilities rather than relying on retrieval alone.

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions docs/blog.md
Original file line number Diff line number Diff line change
Expand Up @@ -256,8 +256,3 @@ For a deeper dive into these concepts, check out the free 6-week email course on

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions docs/office-hours/AGENTS.md
Original file line number Diff line number Diff line change
Expand Up @@ -60,8 +60,3 @@ topics:

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions docs/office-hours/cohort2/week1-summary.md
Original file line number Diff line number Diff line change
Expand Up @@ -354,8 +354,3 @@ What really matters isn't the absolute number but whether we can move these metr

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions docs/office-hours/cohort2/week2-summary.md
Original file line number Diff line number Diff line change
Expand Up @@ -360,8 +360,3 @@ This approach helps ensure reliability across different types of function callin

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions docs/office-hours/cohort2/week3-summary.md
Original file line number Diff line number Diff line change
Expand Up @@ -288,8 +288,3 @@ _Note: I'll continue to add resources and notes from future office hours session

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions docs/office-hours/cohort2/week4-summary.md
Original file line number Diff line number Diff line change
Expand Up @@ -372,8 +372,3 @@ The protocol is still evolving but shows promise for creating more powerful AI s

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions docs/office-hours/cohort2/week5-summary.md
Original file line number Diff line number Diff line change
Expand Up @@ -186,8 +186,3 @@ Follow-up prompts at the end of interactions help maintain momentum. By asking w

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions docs/office-hours/cohort2/week6-summary.md
Original file line number Diff line number Diff line change
Expand Up @@ -239,8 +239,3 @@ For most organizations building AI applications, starting with a specific domain

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions docs/office-hours/cohort3/week-4-2.md
Original file line number Diff line number Diff line change
Expand Up @@ -129,8 +129,3 @@ Beyond the course, I'm organizing a speaker series with guests from OpenAI's mem

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions docs/office-hours/cohort3/week-5-1.md
Original file line number Diff line number Diff line change
Expand Up @@ -270,8 +270,3 @@ Newer models with improved attention mechanisms handle long contexts better than

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions docs/office-hours/cohort3/week-5-2.md
Original file line number Diff line number Diff line change
Expand Up @@ -197,8 +197,3 @@ Not entirely. Even as models improve at reasoning, having a portfolio of special

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
6 changes: 0 additions & 6 deletions docs/office-hours/faq.md
Original file line number Diff line number Diff line change
Expand Up @@ -409,11 +409,6 @@ What really matters isn't the absolute number but whether we can move these metr

---

--8<--
"snippets/enrollment-button.md"
--8<--

---

## How would you evaluate the effect of different parsing strategies in RAG, notably on documents with weird layouts, tables, and charts?

Expand Down Expand Up @@ -3921,4 +3916,3 @@ Found an error or want to suggest improvements to these FAQs? The source files a

---

---
3 changes: 0 additions & 3 deletions docs/snippets/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -9,9 +9,6 @@ This directory contains reusable content snippets that can be included in any ma
Simple enrollment button without additional styling:

```markdown
--8<--
"snippets/enrollment-button.md"
--8<--
```

### 2. `enrollment-section.md`
Expand Down
5 changes: 0 additions & 5 deletions docs/talks/AGENTS.md
Original file line number Diff line number Diff line change
Expand Up @@ -116,8 +116,3 @@ Common tags include: RAG, coding agents, embeddings, evaluation, feedback system

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
4 changes: 0 additions & 4 deletions docs/talks/chromadb-anton-chunking.md
Original file line number Diff line number Diff line change
Expand Up @@ -107,8 +107,4 @@ The ChromaDB team is developing better tooling to help with this process, but in

## By focusing on these fundamentals rather than blindly applying frameworks or following defaults, you can significantly improve the performance of your RAG applications and deliver better results to your users.

--8<--
"snippets/enrollment-button.md"
--8<--

---
4 changes: 0 additions & 4 deletions docs/talks/colin-rag-agents.md
Original file line number Diff line number Diff line change
Expand Up @@ -185,8 +185,4 @@ For the SWE-Bench coding evaluation, embedding models didn't significantly impro

## This doesn't mean embedding models aren't valuable—they become essential for larger codebases, less structured content, or more complex retrieval tasks.

--8<--
"snippets/enrollment-button.md"
--8<--

---
Original file line number Diff line number Diff line change
Expand Up @@ -190,8 +190,3 @@ Contextual chunk rewriting involves adding context to document chunks to improve

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
5 changes: 0 additions & 5 deletions docs/talks/extend-document-automation.md
Original file line number Diff line number Diff line change
Expand Up @@ -249,8 +249,3 @@ This approach delivers business value faster while building organizational comfo

---

--8<--
"snippets/enrollment-button.md"
--8<--

---
4 changes: 0 additions & 4 deletions docs/talks/fine-tuning-rerankers-embeddings-ayush-lancedb.md
Original file line number Diff line number Diff line change
Expand Up @@ -231,8 +231,4 @@ Start with a small experiment using a lightweight model that you can train quick

## Multimodal retrieval is likely the next frontier. While much work has been done on text retrieval, there's still significant room for improvement in retrieving and ranking content across different modalities like images, audio, and video. Building better benchmarks and baselines for multimodal RAG systems represents an important area for future development.

--8<--
"snippets/enrollment-button.md"
--8<--

---
6 changes: 0 additions & 6 deletions docs/talks/glean-manav.md
Original file line number Diff line number Diff line change
Expand Up @@ -182,10 +182,4 @@ Glean's approach is refreshingly pragmatic. They've learned that the path to hig
The emphasis on company-specific language models is particularly noteworthy - this is an area where many companies struggle when they try to apply generic embedding models to their unique terminology and document structures.

**Key Takeaway:** Successful enterprise search requires a pragmatic approach that combines custom embedding models, unified data architecture, hybrid search techniques, and continuous learning from user feedback rather than relying solely on off-the-shelf solutions.

## --8<--

"snippets/enrollment-button.md"
--8<--

---
4 changes: 0 additions & 4 deletions docs/talks/john-lexical-search.md
Original file line number Diff line number Diff line change
Expand Up @@ -115,8 +115,4 @@ The most promising direction appears to be hybrid approaches that leverage the s

## **_Key Takeaway:_** The debate isn't really about lexical versus semantic search, but rather how to effectively combine them to create retrieval systems that are both precise and understanding - offering both the filtering power of traditional search and the semantic comprehension of modern embedding-based approaches.

--8<--
"snippets/enrollment-button.md"
--8<--

---
Loading
Loading