Skip to content

Commit c89af58

Browse files
committed
reorganize intro slides: improve structure, remove double headers, drop OpenAI Dev Day reference and 'What's Changed' section
1 parent 7a0117e commit c89af58

File tree

1 file changed

+65
-82
lines changed

1 file changed

+65
-82
lines changed

intro-slides/slides.md

Lines changed: 65 additions & 82 deletions
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@ slideNumber: true
2020

2121
# Systematically Improving RAG Applications
2222

23-
September 16, 2025, `maven.com/applied-llms/rag-playbook`
23+
November 19, 2025
2424

2525
<br>
2626

@@ -36,18 +36,16 @@ By Jason Liu
3636

3737
---
3838

39-
# Todays Plan
39+
# Today's Plan
4040

4141
- Introductions
4242
- About me; consulting and training
43-
- Who are you?
44-
- Format & office hours
43+
- Course format & logistics
4544
- Key insights & course outcomes
46-
- What’s changed since v1 (2024)
4745
- Syllabus (Sessions 0–3)
4846
- Syllabus (Sessions 4–6)
49-
- Resources & contributions
50-
- Q&A and next steps
47+
- Resources & next steps
48+
- Q&A
5149

5250
---
5351

@@ -63,7 +61,7 @@ By Jason Liu
6361
- CV + multimodal retrieval
6462
- VAEs/GANs for GenAI
6563
- ~$50M incremental revenue
66-
- led ~$400K/yr data curation for next‑gen models
64+
- Led ~$400K/yr data curation for next‑gen models
6765

6866
---
6967

@@ -72,40 +70,53 @@ By Jason Liu
7270
- Personal note: Hand injury (2021–2022) → shifted focus to higher‑leverage teaching and advising
7371
- Consulting (2023–present): Query understanding, prompts, embedding search, fine‑tuning, MLOps/observability; upgrading legacy workflows to agentic systems
7472
- Clients: HubSpot, Zapier, Limitless, and others across assistants, construction, research
73+
- Recently: helping with medical triage AI; advising startups that do observability
7574

7675
---
7776

78-
# Format & Office Hours
77+
# Course Format
7978

80-
- Inverted classroom: ~6 hours pre‑recorded lectures + tutorial videos + Jupyter exercises
81-
- Slack: post questions in the cohort channel for async help
82-
- Welcome to share your learnings online via linked in or twitter (please link back to us some how)
79+
**This is a 3-week accelerated version of the course.**
8380

84-
<v-click>
81+
- Inverted classroom: ~5 hours pre‑recorded lectures + tutorial videos + Jupyter exercises
82+
- I recommend watching two sessions per week
8583

86-
## Office hours:
84+
---
85+
86+
# Office Hours & Community
87+
88+
- **Office hours**: Bring your problems, introduce yourself
89+
- Treat it like a tech‑lead review of your work
90+
- Cameras on is really appreciated! Helps me a lot.
91+
- Guest lectures: 1-2 times a week, practitioners actively building in the space
92+
- **Slack**: For any questions about code or anything else, please post on the Slack
93+
- **Sharing**: Welcome to share your learnings online via LinkedIn or Twitter (please link back to us somehow)
8794

88-
- bring your problems, introduce yourself.
89-
- treat it like a tech‑lead review of your work
90-
- cameras on is really appreciated! helps me a lot.
91-
- Guest lectures: 1-2 times a week, practitioners actively building in the space.
95+
---
9296

93-
</v-click>
97+
# Logistics & Support
9498

95-
<v-click>
99+
- **Scheduling**: Occasional reschedules; advance notice
100+
- **Credits/support**: Contact Marian — [email protected]
101+
- **Fit concerns**: If you feel like it's not a really good fit for you, just message me and we can figure out how we can make this better for you
96102

97-
## Logistics
103+
---
98104

99-
- Scheduling: occasional reschedules (e.g., OpenAI Dev Day); advance notice
100-
- Credits/support: contact Marian — support at jxnl.co ([email protected])
105+
# Upcoming Talks This Season
101106

102-
</v-click>
107+
- Understanding tool called hallucinations: how LangChain themselves have been thinking about building agents
108+
- Dropbox talk: how they think about knowledge graphs, DSPy, and a bunch of new topics
109+
- Stay tuned to Slack to figure out when these events will happen
103110

104111
---
105112

106-
# Key Insights & Course Outcomes
113+
# What You'll Learn
107114

108-
This course will give you the foundations and practical skills to build, evaluate, and operate retrieval-augmented generation (RAG) systems. Here’s what to keep in mind and what you’ll learn:
115+
This course will give you the foundations and practical skills to build, evaluate, and operate retrieval-augmented generation (RAG) systems.
116+
117+
---
118+
119+
# Core Principles
109120

110121
### Keep these in mind
111122

@@ -120,104 +131,76 @@ This course will give you the foundations and practical skills to build, evaluat
120131

121132
# The Models are Good but Context Is the Bottleneck
122133

123-
<v-click>
124-
125134
- Models are already very capable for work.
126135
- Even if models hold steady, apps can still improve.
127136
- With the right context, success rates are very high.
128137
- The real challenge is getting that context — the R in retrieval.
129138

130-
</v-click>
131-
132-
---
133-
134-
# What’s Changed Since v1 (2024)
135-
136-
Lets talk more about this in the office hours!
137-
138-
- Since Claude 3.5, tools/agents are much more reliable; planning loops keep improving
139-
- Code assistants (e.g., Claude Code) excel at code RAG flows: grep + edit with validation
140-
- Prompt caching widespread; longer contexts change cost/latency tradeoffs
141-
- Better PDF/diagram extraction; richer citations in UIs (bounding boxes)
142-
- Shift from one‑off evals to continuous testing and monitoring over time
143-
- Always a tradeoff: performance × latency × cost; be mindful of pricing impacts
144-
- “Context engineering” is emerging; 2026 update of this course will focus more here
145-
146139
---
147140

148-
# Syllabus (Sessions 0–3)
141+
# Sessions 0–3: Foundations
149142

150-
- Session 0: Product mindset; RAG as a recommender; improvement flywheel
151-
- Session 1: Synthetic data and retrieval evals; precision/recall; baselines
152-
- Session 2: From evals to training data; reranking; embedding fine-tuning
153-
- Session 3: UX that collects data; streaming; chain of thought; validation
143+
- **Session 0**: Product mindset; RAG as a recommender; improvement flywheel
144+
- **Session 1**: Synthetic data and retrieval evals; precision/recall; baselines
145+
- **Session 2**: From evals to training data; reranking; embedding fine-tuning
146+
- **Session 3**: UX that collects data; streaming; chain of thought; validation
154147
- Pacing note: Week 3 is intentionally lighter—use it to catch up and get ahead
155148
- Focus on UX patterns; not the most critical week content‑wise
156149

157-
<v-click>
158-
159-
## Main takeaway
150+
---
160151

161-
- fast retrieval evals (precision/recall on key chunks)
162-
- rerank/fine‑tune to get a 10-20% improvement
163-
- deploy and collect real data via UX
152+
# Sessions 0–3: Main Takeaway
164153

165-
</v-click>
154+
- Fast retrieval evals (precision/recall on key chunks)
155+
- Rerank/fine‑tune to get a 10-20% improvement
156+
- Deploy and collect real data via UX
166157

167158
---
168159

169-
# Syllabus (Sessions 4–6)
160+
# Sessions 4–6: Advanced Topics
170161

171-
- Session 4: Topic modeling; query segmentation; prioritization frameworks
172-
- Session 5: Specialized indices; multimodal search (docs, images, tables, SQL)
173-
- Session 6: Query routing; tools-as-APIs; single vs multi-agent; measurement
162+
- **Session 4**: Topic modeling; query segmentation; prioritization frameworks
163+
- **Session 5**: Specialized indices; multimodal search (docs, images, tables, SQL)
164+
- **Session 6**: Query routing; tools-as-APIs; single vs multi-agent; measurement
174165
- Week 6 is lighter; focus on routing and preview the context‑engineering direction
175166

176-
<v-click>
177-
178-
## Main takeaway
167+
---
179168

180-
- Figure out whats important to you and your users
181-
- Build specialized indices for those usecases
182-
- Make sure the Agent is able to use the specialized indices
169+
# Sessions 4–6: Main Takeaway
183170

184-
</v-click>
171+
- Figure out what's important to you and your users
172+
- Build specialized indices for those use cases
173+
- Make sure the agent is able to use the specialized indices
185174

186175
---
187176

188-
# Resources & Contributions
189-
190-
Feel free to share this with coworkers, but don't post these links on social media. You can completely welcome to write your own notes and share them online! (Please link back to us some how)
177+
# Resources
191178

192-
<v-click>
193-
194-
## Resources
179+
Feel free to share this with coworkers, but don't post these links on social media. You're completely welcome to write your own notes and share them online! (Please link back to us somehow)
195180

196181
![QR Codes for Resources](./assets/images/codes.jpeg)
197182

198183
- Study notes (work in progress): https://567-labs.github.io/systematically-improving-rag/
199184
- Talks/"greatest hits": https://567-labs.github.io/systematically-improving-rag/talks/
200185
- Slack: https://join.slack.com/t/improvingrag/shared_invite/zt-3dkinqb3q-vknvaBLoTx5tBj4PpGOVjw
201186
- Contribute via PRs/issues; add examples; suggest edits
202-
</v-click>
203187

204-
<v-click>
188+
---
205189

206-
## Recommendations
190+
# Recommended Talks
207191

208-
- Skylars RAG anti‑patterns Talk
192+
- Skylar's RAG anti‑patterns Talk
209193
- Anton's Text Chunking Strategies Talk
210194
- Exa's Why Google Search Sucks for AI Talk
211195
- Colin's Agentic RAG Talk
212-
</v-click>
213196

214197
---
215198

216-
# Q&A
199+
# Q&A and Next Steps
217200

218201
Short Q&A (About the Class Format) and then we'll let you watch the first videos
219202

220-
1) Find 'Optional: Watch Lecture' on your Calendar as a shortcut
221-
2) Feel free to watch at 2x speed!
203+
1. Find 'Optional: Watch Lecture' on your Calendar as a shortcut
204+
2. Feel free to watch at 2x speed!
222205

223-
See you at the office hours!
206+
See you at the office hours!

0 commit comments

Comments
 (0)