Advanced RAG tutorial with Llamaindex & OpenAI GPT: Sentence Window Retrieval vs Basic Chunking

Показать описание

Correction: at 1:53, I said that an embedding is a x digit string. That is not correct, it should be a list of x numbers.

Why build your own retrieval augmented generation pipeline when OpenAI's custom GPTs can do it out of the box? Did you know that the OpenAI solutions, as of the making of this video, are not scalable to large knowledge bases? Also, having your own pipeline gives you a lot more control over the design which you will need if you are building an enterprise grade top-notch system.

In this tutorial, we will talk through a number of advanced techniques such as sentence window retrieval, hierarchical automerge retrieval, returning Top K results vs. greedy search, reranking etc.

We will also work through some code and do a real comparison between basic chunking vs. sentence retrieval strategies.

Hubel Labs

Рекомендации по теме

Комментарии

This is currently the best RAG tutorial on the internet.

mikestaub

As always, well prepared, easy to follow video that delivers a lot of information and value. Thank you!

nazihfattal

Your explanations and delivery is on point. Thank you for an excellent content and relaxed narration style.

MaliRasko

Great video with an awesome easy to follow explanation of RAG. Reminds of a recent Andrej Karpathy video.

sitedev

Wow, that’s a wonderful piece of advice from such a talented professional in the field. Thank you 😊

arjoai

Fascinating! Your approach to teaching and presenting is poetic. It is well organized, well explained, and well illustrated. Indeed, kudos to you. If I could, I would subscribe to your channel twice!

unclecode

clear effective explanations thank you

victorfeight

Great job explaining window and how vector store and doc store relate and where window lives. I’ve been trying to understand this aspect of llamaindex, and you made it very clear!

danielvalentine

Llama Index has new version 0.10 - will migrate your codes n learn same time.
Thanks for introducing Sentence Window Retrieval. Most basic straight-split and retrieve/chat doesnt produce much meanings on our docs.

ginisksam

Awesome video explained very clearly! Thanks a ton!
If I may ask, what tool do you use for those visual flows. Love it!

sivi

Really cool video! Is there an "ideal" or "recommended" value of window_size?

sayanbhattacharyya

Excellent video. Liked the workflow you showed in the beginning. What SW are you using to create this workflow?

Work_Pavan-muye

I'm a bit confused on the use case for re-ranking. Doesn't that defeat the purpose the top-k search in that we include all chunks, significantly increasing the number of tokens we use? Is the idea to do re-ranking with a smaller & cheaper LLM before sending the resultant top-K chunks to a more robust LLM?

peteredmonds

Seeing your explaination at around 09:30, it seems like we can only use K windows to serve as knowledge base to answer a prompt. What If the prompt asks information that is contained in more than K windows? Like if I have several documents containing each a bio of a person, and if the user asks to sort those 10 people by age... how can it figure it out? I guess we can use a big value for K, if the cosine similarity engine can take it... but I am guessing providing too much context to the LLM will cost a lot of money?

goonymiami

Small correction: embedding are not a 1536 digit number but of vector of size 1536

peteredmonds

Great video? Is the diagram anywhere to refeeence?

chiggly

it is 1784 in teahistory.txt and 1794 in chinahistory.txt so bit confusing
but anw great tutorial Thanks

jannessantoso

Advanced RAG tutorial with Llamaindex & OpenAI GPT: Sentence Window Retrieval vs Basic Chunking

Introduction to Query Pipelines (Building Advanced RAG, Part 1)

High-performance RAG with LlamaIndex

Advanced RAG tutorial with Llamaindex & OpenAI GPT: Sentence Window Retrieval vs Basic Chunking

Advanced RAG Techniques with @LlamaIndex

End to end RAG LLM App Using Llamaindex and OpenAI- Indexing and Querying Multiple pdf's

LlamaIndex Webinar: Advanced RAG with Knowledge Graphs (with Tomaz from Neo4j)

Understanding Embeddings in RAG and How to use them - Llama-Index

Build Agents from Scratch (Building Advanced RAG, Part 3)

Learn How to build Advance RAG Based Project with Langchain & LlamaIndex

A deep dive into Retrieval-Augmented Generation with Llamaindex

Vector Search RAG Tutorial – Combine Your Data with LLMs with Advanced Search

Advanced RAG 01: Small-to-Big Retrieval with LlamaIndex

ADVANCED Python AI Agent Tutorial - Using RAG

LlamaIndex Sessions: 12 RAG Pain Points and Solutions

LLMs for Advanced Question-Answering over Tabular/CSV/SQL Data (Building Advanced RAG, Part 2)

Advanced RAG with ColBERT in LangChain and LlamaIndex

Building Production-Ready RAG Applications: Jerry Liu

RAG in 2024: Advancing to Agents

Step-by-Step Guide to Building a RAG LLM App with LLamA2 and LLaMAindex

Discover LlamaIndex: Ask Complex Queries over Multiple Documents

'I want Llama3 to perform 10x with my private knowledge' - Local Agentic RAG w/ llama3

Python Advanced AI Agent Tutorial - LlamaIndex, Ollama and Multi-LLM!

New course with TruEra and LlamaIndex: Building and Evaluating Advanced RAG

Multi-modal Retrieval Augmented Generation with LlamaIndex