No sidebar. No autoplay. No attention traps. Just learning.

Read ~5m
6 terms · 5 segments

Layering every technique in RAG, one query at a time - David Karam, Pi Labs (fmr. Google Search)

5chapters with key takeaways — read first, then watch
1

RAG Quality Engineering & Foundational Retrieval

0:15-6:105m 55sIntro
2

Enhancing Ranking with Cross-Encoders & Custom Semantics

6:11-11:004m 49sConcept
3

Integrating User Preferences & Orchestrating LLM Queries

11:02-14:233m 21sArchitecture
4

Supplementary Retrieval & Model Distillation for Scale

14:25-17:102m 45sTraining
5

The Role of Product Design & Empirical RAG Development

17:13-20:173m 4sUse Case

Video Details & AI Summary

Published Jul 29, 2025
Analyzed Feb 1, 2026

AI Analysis Summary

This talk provides a practical framework for improving Retrieval Augmented Generation (RAG) systems by focusing on outcomes, analyzing failures, and applying techniques based on complexity-adjusted impact. It covers foundational retrieval methods like BM25 and embeddings, advanced ranking with cross-encoders, incorporating non-relevance signals and user preferences, and strategies for LLM query orchestration. The speaker also discusses cost optimization through model distillation and highlights the critical role of product design and an empirical approach in building robust RAG applications.

Title Accuracy Score
10/10Excellent
36.6s processing
Model:gemini-2.5-flash