Merge pull request #56 from 567-labs/improve-index-navigation

jxnl · web-flow · commit 448f518dcdaa · 2025-07-05T01:09:05.000-04:00
Improve docs/index.md as comprehensive table of contents
diff --git a/docs/index.md b/docs/index.md
@@ -10,15 +10,15 @@ date: 2025-04-10
 
 ## Data-Driven Product Development for AI Applications
 
-_A systematic approach to building self-improving AI systems_
+*A systematic approach to building self-improving AI systems*
 
 !!! abstract "About This Book"
-This book provides a structured approach to evolving Retrieval-Augmented Generation (RAG) from a technical implementation into a continuously improving product. You'll learn to combine product thinking with data science principles to create AI systems that deliver increasing value over time.
+    This book provides a structured approach to evolving Retrieval-Augmented Generation (RAG) from a technical implementation into a continuously improving product. You'll learn to combine product thinking with data science principles to create AI systems that deliver increasing value over time.
+    
+    Most teams focus on the latest models and algorithms while missing the fundamentals: understanding their data, measuring performance, and systematically improving based on user feedback.
 
 ## The RAG Improvement Flywheel
 
-At the core of this book is the RAG improvement flywheel - a continuous cycle that transforms user interactions into product enhancements.
-
 ```mermaid
 graph TD
     A[Synthetic Data & Evaluation] --> B[Learning from Evaluations]
@@ -33,129 +33,171 @@ graph TD
     style E fill:#dfd,stroke:#333,stroke-width:2px
 ```
 
-!!! tip "Beyond Technical Implementation"
-This book goes beyond teaching you how to implement RAG. It shows you how to think about RAG as a product that continuously evolves to meet user needs and deliver business value.
-
-## Chapters
+## Workshop Series
 
 ### [Introduction: Beyond Implementation to Improvement](workshops/chapter0.md)
+Shifting from technical implementation to product-focused continuous improvement. Understanding RAG as a recommendation engine and the improvement flywheel.
 
-Understand why systematic improvement matters and how to approach RAG as a product rather than just a technical implementation.
+### [Chapter 1: Kickstarting the Data Flywheel with Synthetic Data](workshops/chapter1.md)
+Common pitfalls in AI development, leading vs. lagging metrics, precision and recall for retrieval evaluation, and synthetic data generation techniques.
 
-### [Chapter 1: Starting the Flywheel](workshops/chapter1.md)
+### [Chapter 2: Converting Evaluations into Training Data for Fine-Tuning](workshops/chapter2.md)
+Why generic embeddings fall short, converting evaluation examples into few-shot prompts, contrastive learning, and re-rankers as cost-effective strategies.
 
-Learn how to overcome the cold-start problem, establish meaningful metrics, and create a data foundation that drives product decisions.
+### Chapter 3: User Experience and Feedback Collection
 
-### [Chapter 2: From Evaluation to Enhancement](workshops/chapter2.md)
+#### [Chapter 3.1: Feedback Collection](workshops/chapter3-1.md)
+Making feedback visible and engaging (increasing rates from <1% to >30%), proven copy patterns, and enterprise feedback collection through Slack integrations.
 
-Transform evaluation insights into concrete product improvements through fine-tuning, re-ranking, and targeted enhancements.
+#### [Chapter 3.2: Streaming and Interstitials](workshops/chapter3-2.md)
+Psychology of waiting, implementing streaming responses for 30-40% higher feedback collection, and meaningful interstitials.
 
-### [Chapter 3: The User Experience of AI](workshops/chapter3-1.md)
+#### [Chapter 3.3: Quality Improvements](workshops/chapter3-3.md)
+Interactive citations, chain of thought reasoning for 8-15% accuracy improvements, and validation patterns reducing errors by 80%.
 
-Design interfaces that both delight users and gather valuable feedback, creating a virtuous cycle of improvement.
+### Chapter 4: Understanding Your Users
 
-### [Chapter 4: Understanding Your Users](workshops/chapter4-1.md)
+#### [Chapter 4.1: Topic Modeling and Analysis](workshops/chapter4-1.md)
+Moving from individual feedback to systematic pattern identification and transforming "make the AI better" into specific priorities.
 
-Segment users and queries to identify high-value opportunities and create targeted improvement strategies.
+#### [Chapter 4.2: Prioritization and Roadmapping](workshops/chapter4-2.md)
+Impact/effort prioritization using 2x2 frameworks and building strategic roadmaps based on user behavior patterns.
 
-### [Chapter 5: Building Specialized Capabilities](workshops/chapter5-1.md)
+### Chapter 5: Building Specialized Retrieval
 
-Develop purpose-built solutions for different user needs spanning documents, images, tables, and structured data.
+#### [Chapter 5.1: Understanding Specialized Retrieval](workshops/chapter5-1.md)
+Why monolithic approaches reach limits, two complementary strategies (extracting metadata vs. creating synthetic text), and two-level measurement.
 
-### [Chapter 6: Unified Product Architecture](workshops/chapter6-1.md)
+#### [Chapter 5.2: Implementing Multimodal Search](workshops/chapter5-2.md)
+Advanced document retrieval, image search challenges, table search approaches, SQL generation, and RAPTOR hierarchical summarization.
 
-Create a cohesive product experience that intelligently routes to specialized components while maintaining a seamless user experience.
+### Chapter 6: Unified Architecture
 
-### [Key Takeaways: Product Principles for AI Applications](misc/what-i-want-you-to-takeaway.md)
+#### [Chapter 6.1: Query Routing Foundations](workshops/chapter6-1.md)
+The API mindset, organizational structure, evolution from monolithic to modular architecture, and performance formula.
 
-Core principles that will guide your approach to building AI products regardless of how the technology evolves.
+#### [Chapter 6.2: Tool Interfaces and Implementation](workshops/chapter6-2.md)
+Designing tool interfaces, router implementation using structured outputs, and dynamic example selection.
 
-## Talks and Presentations
+#### [Chapter 6.3: Performance Measurement](workshops/chapter6-3.md)
+Measuring tool selection effectiveness, dual-mode UI, user feedback as training data, and creating improvement flywheel.
 
-Explore insights from industry experts and practitioners through our collection of talks, lightning lessons, and presentations:
+## Expert Talks
 
-### [Featured Talks](talks/index.md)
+### Foundation and Evaluation
 
-- **[Fine-tuning Re-rankers and Embedding Models for Better RAG Performance](talks/fine-tuning-rerankers-embeddings-ayush-lancedb.md)** - Practical approaches to enhancing retrieval quality (Ayush from LanceDB)
-- **[RAG Anti-patterns in the Wild](talks/rag-antipatterns-skylar-payne.md)** - Common mistakes and how to fix them (Skylar Payne)
-- **[Semantic Search Over the Web with Exa](talks/semantic-search-exa-will-bryk.md)** - Building AI-first search engines (Will Bryk)
-- **[Understanding Embedding Performance through Generative Evals](talks/embedding-performance-generative-evals-kelly-hong.md)** - Custom evaluation methodologies (Kelly Hong)
-- **[Online Evals and Production Monitoring](talks/online-evals-production-monitoring-ben-sidhant.md)** - Monitoring AI systems at scale (Ben Hylak & Sidhant Bendre)
+**[Building Feedback Systems](talks/zapier-vitor-evals.md)** - Vitor (Zapier)  
+Simple UX changes increased feedback collection 4x. Key insight: specific questions like "Did this run do what you expected?" dramatically outperform generic prompts.
 
-[View all talks →](talks/index.md)
+**[Text Chunking Strategies](talks/chromadb-anton-chunking.md)** - Anton (ChromaDB)  
+Why chunking remains critical even with infinite context windows. Default chunking strategies in popular libraries often produce terrible results.
 
-## For Product Leaders, Engineers, and Data Scientists
+**[Embedding Performance Evaluation](talks/embedding-performance-generative-evals-kelly-hong.md)** - Kelly Hong  
+Model rankings on custom benchmarks often contradict MTEB rankings - public benchmark performance doesn't guarantee real-world success.
 
-!!! info "What You'll Learn"
-    **For Product Leaders**
-    - How to establish metrics that align with business outcomes
-    - Frameworks for prioritizing AI product improvements  
-    - Approaches to building product roadmaps for RAG applications
-    - Methods for communicating AI improvements to stakeholders
-    **For Engineers**
-    - Implementation patterns that facilitate rapid iteration
-    - Architectural decisions that enable continuous improvement
-    - Techniques for building modular, specialized capabilities
-    - Approaches to technical debt management in AI systems
-    
-    **For Data Scientists**
-    - Methods for creating synthetic evaluation datasets
-    - Techniques for segmenting and analyzing user queries
-    - Frameworks for measuring retrieval effectiveness
-    - Approaches to continuous learning from user interactions
+### Training and Fine-Tuning
 
-## Quick Wins: High-Impact RAG Improvements
+**[Enterprise Search Fine-tuning](talks/glean-manav.md)** - Manav (Glean)  
+Custom embedding models achieve 20% improvements through continuous learning. Smaller, fine-tuned models often outperform larger general-purpose models.
 
-Based on real-world implementations, here are proven improvements you can implement quickly:
+**[Fine-tuning Re-rankers](talks/fine-tuning-rerankers-embeddings-ayush-lancedb.md)** - Ayush (LanceDB)  
+Re-rankers provide 12-20% retrieval improvement with minimal latency penalty - "low-hanging fruit" for RAG optimization.
 
-!!! success "Top 5 Quick Wins"
-    1. **Change Feedback Copy** 
-       - Replace "How did we do?" with "Did we answer your question?"
-       - **Impact**: 5x increase in feedback collection
-       - **Effort**: 1 hour
-    
-    2. **Use Markdown Tables**
-       - Format structured data as markdown tables instead of JSON/CSV
-       - If tables are complex, represent it in XML
-       - **Impact**: 12% better lookup accuracy
-       - **Effort**: 2-4 hours
-    
-    3. **Add Streaming Progress**
-       - Show "Searching... Analyzing... Generating..." with progress
-       - Stream the response as it's being generated when possible
-       - **Impact**: 45% reduction in perceived latency
-       - **Effort**: 1 sprint
-    
-    4. **Implement Page-Level Chunking**
-       - For documentation, respect page boundaries, and use page-level chunking. Humans tend to create semantically coherent chunks at the page level.
-       - **Impact**: 20-30% better retrieval for docs
-       - **Effort**: 1 day
+### Production and Monitoring
+
+**[Production Monitoring](talks/online-evals-production-monitoring-ben-sidhant.md)** - Ben & Sidhant  
+Traditional error monitoring doesn't work for AI since there's no exception when models produce bad outputs.
+
+**[RAG Anti-patterns](talks/rag-antipatterns-skylar-payne.md)** - Skylar Payne  
+90% of teams adding complexity see worse performance. Silent failures in document processing can eliminate 20%+ of corpus without detection.
 
-!!! tip "Medium-Term Improvements (2-4 weeks)"
+### Specialized Retrieval
+
+**[Agentic RAG](talks/colin-rag-agents.md)** - Colin Flaherty  
+Simple tools like grep and find outperformed sophisticated embedding models due to agent persistence and course-correction capabilities.
+
+**[Better Data Processing](talks/reducto-docs-adit.md)** - Adit (Reducto)  
+Hybrid computer vision + VLM pipelines outperform pure approaches. Even 1-2 degree document skews dramatically impact extraction quality.
+
+**[Multi-Modal Retrieval](talks/superlinked-encoder-stacking.md)** - Daniel (Superlinked)  
+LLMs fundamentally can't understand numerical relationships. Use mixture of specialized encoders for different data types.
+
+**[Lexical Search](talks/john-lexical-search.md)** - John Berryman  
+Semantic search struggles with exact matching and specialized terminology. Lexical search provides efficient filtering and rich metadata.
+
+## Quick Wins: High-Impact RAG Improvements
+
+!!! success "Top Quick Wins"
+    1. **Change Feedback Copy**: Replace "How did we do?" with "Did we answer your question?"
+    2. **Use Markdown Tables**: Format structured data as markdown tables instead of JSON/CSV
+    3. **Add Streaming Progress**: Show "Searching... Analyzing... Generating..." with progress
+    4. **Page-Level Chunking**: For documentation, respect page boundaries for better retrieval
+
+!!! tip "Medium-Term Improvements"
     - **Fine-tune embeddings**: $1.50 and 40 minutes for 6-10% improvement
     - **Add re-ranker**: 15-20% retrieval improvement
     - **Build specialized tools**: 10x better for specific use cases
-    - **Implement contextual retrieval**: 30% better context understanding
-    - **Create Slack feedback integration**: 5x more enterprise feedback
+    - **Slack feedback integration**: 5x more enterprise feedback
+
+## How to Use This Resource
+
+**For Beginners**: Start with the [Introduction](workshops/chapter0.md), then work through chapters sequentially.
+
+**For Quick Wins**: Jump to the [Quick Wins section](#quick-wins-high-impact-rag-improvements) for immediate improvements.
 
-!!! info "Learn from the Experts"
-    Before implementing, learn from these practical talks:
-    - [**RAG Anti-patterns in the Wild**](talks/rag-antipatterns-skylar-payne.md) - Common mistakes across industries and how to fix them
-    - [**Document Ingestion Best Practices**](talks/reducto-docs-adit.md) - Production-ready parsing for tables, PDFs, and complex documents
+**For Specific Problems**: Check the [FAQ](office-hours/faq.md) for answers to common questions.
+
+**For Complete Implementation**: Follow the full workshop series from Chapter 1 through 6.3.
+
+## Key Insights
+
+**Most Important**: Teams that iterate fastest on data examination consistently outperform those focused on algorithmic sophistication.
+
+**Most Underutilized**: Fine-tuning embeddings and re-rankers are more accessible and impactful than most teams realize.
+
+**Biggest Mistake**: 90% of teams add complexity that makes their RAG systems worse. Start simple, measure everything, improve systematically.
+
+## Frequently Asked Questions
+
+- **"Is knowledge graph RAG production ready?"** Probably not. [See why →](office-hours/faq.md#is-knowledge-graph-rag-production-ready-by-now-do-you-recommend-it)
+- **"How do we handle time-based queries?"** Use PostgreSQL with pgvector-scale. [Learn more →](office-hours/faq.md#how-do-we-introduce-a-concept-of-time-and-vector-search-to-answer-questions-like-whats-the-latest-news-without-needing-to-move-to-a-graph-database)
+- **"Should we use DSPy for prompt optimization?"** It depends. [See when →](office-hours/faq.md#what-is-your-take-on-dspy-should-we-use-it)
+- **"Would you recommend ColBERT models?"** Test against your baseline first. [See approach →](office-hours/faq.md#would-you-recommend-using-colbert-models-or-other-specialized-retrieval-approaches)
+
+[Browse All FAQ](office-hours/faq.md){ .md-button } [View Office Hours](office-hours/index.md){ .md-button }
+
+## For Product Leaders, Engineers, and Data Scientists
+
+!!! info "What You'll Learn"
+    **For Product Leaders**: Establish metrics that align with business outcomes, prioritization frameworks, and roadmapping approaches
+    
+    **For Engineers**: Implementation patterns for rapid iteration, architectural decisions, and modular capabilities
+    
+    **For Data Scientists**: Synthetic evaluation datasets, query segmentation techniques, and continuous learning approaches
+
+## Navigate by Topic
+
+**Evaluation & Metrics**: [Chapter 1](workshops/chapter1.md) • [Kelly Hong Talk](talks/embedding-performance-generative-evals-kelly-hong.md) • [Vitor Zapier Talk](talks/zapier-vitor-evals.md)
+
+**Fine-tuning & Training**: [Chapter 2](workshops/chapter2.md) • [Ayush LanceDB Talk](talks/fine-tuning-rerankers-embeddings-ayush-lancedb.md) • [Manav Glean Talk](talks/glean-manav.md)
+
+**User Experience**: [Chapter 3 Series](workshops/chapter3-1.md) • [Streaming Guide](workshops/chapter3-2.md) • [Quality Improvements](workshops/chapter3-3.md)
+
+**Architecture & Routing**: [Chapter 6 Series](workshops/chapter6-1.md) • [Query Routing](talks/query-routing-anton.md) • [Multi-modal Retrieval](talks/superlinked-encoder-stacking.md)
 
 ## About the Author
 
-Jason Liu brings practical experience from his work at Facebook, Stitch Fix, and as a consultant for companies like HubSpot, Zapier, and many others. His background spans computer vision, recommendation systems, and RAG applications across diverse domains.
+Jason Liu brings practical experience from Facebook, Stitch Fix, and as a consultant for companies like HubSpot, Zapier, and many others. His background spans computer vision, recommendation systems, and RAG applications across diverse domains.
 
 !!! quote "Author's Philosophy"
-   "The most successful AI products aren't the ones with the most sophisticated models, but those built on disciplined processes for understanding users, measuring performance, and systematically improving. This book will show you how to create not just a RAG application, but a product that becomes more valuable with every interaction."
+   "The most successful AI products aren't the ones with the most sophisticated models, but those built on disciplined processes for understanding users, measuring performance, and systematically improving."
+
+---
 
 ## Getting Started
 
 Begin your journey by reading the [Introduction](workshops/chapter0.md) or jump directly to [Chapter 1](workshops/chapter1.md) to start building your evaluation framework and data foundation.
 
----
-
-IF you want to get discounts and 6 day email source on the topic make sure to subscribe to
+If you want to get discounts and 6 day email source on the topic make sure to subscribe to
 
-<script async data-uid="010fd9b52b" src="https://fivesixseven.kit.com/010fd9b52b/index.js"></script>
+<script async data-uid="010fd9b52b" src="https://fivesixseven.kit.com/010fd9b52b/index.js"></script>