Retrieval-Augmented Generation (RAG)

Показать описание

This video explains the Retrieval-Augmented Generation (RAG) model! This approach combines Dense Passage Retrieval with a Seq2Seq BART generator. This is tested out on knowledge intensive tasks like open-domain QA, jeopardy question generation, and FEVER fact verification. This looks like a really interesting paradigm for building language models that produce factually accurate generations!

Thanks for watching! Please Subscribe!

Paper Links:

Thanks for watching!

Time Stamps
0:00 Introduction
2:05 Limitations of Language Models
4:10 Algorithm Walkthrough
5:48 Dense Passage Retrieval
7:44 RAG-Token vs. RAG-Sequence
10:47 Off-the-Shelf Models
11:54 Experiment Datasets
15:03 Results vs. T5
16:16 BART vs. RAG - Jeopardy Questions
17:20 Impact of Retrieved Documents zi
18:53 Ablation Study
20:25 Retrieval Collapse
21:10 Knowledge Graphs as Non-Parametric Memory
21:45 Can we learn better representations for the Document Index?
22:12 How will Efficient Transformers impact this?

Connor Shorten

Рекомендации по теме

Комментарии

Welcome back.. It was a long time since you posted on YouTube..
We were waiting for you. :)
Thank you for sharing knowledge

AbdennacerAyeb

New background, who dis??
Great to see you back making videos!

whatsinthepapers

Glad to see you again. I was getting worried. Your videos are great, thanks so much for the content.

arielf

Welcome back, thanks for the great work you are doing .

xgezmji

Really great paper. To some extent all of NLP can be treated as a QA task.

DistortedV

Very nice video! Really excited to try these techniques out

imranq

Can you explain why this RAG model seems popular? It seems like all they've done is connect a pre-trained retrieval model and connected it to a pretrained seq2seq model, and trained them together. They also just did a simple concatenation of the retrieved passages with the initial input before inputting it to the seq2seq model. This all seems like really basic stuff. So am I just missing something here? Because you could just also get rid of the retrieval model if you already knew which documents you wanted the seq2seq model to use and could just directly concat those with the original input.

himatammineedi

is it possible for you to add the link to the ppt presentation used for this video in the description?

sandeepunnikrishnan

Can someone expand on the snippet at 4:45 explaining how the query works with the encoded samples?

In the video, the speaker states, "And then when we ask a query, like we have this new x sequence with a mask at the end of it, we're going to treat that like a query, encode that query, and then use this maximum inner product search...".

My understanding is that we encode the masked x (where x is the input) with the same query encoder as what encodes the context information, then use MIPS to find essentially the most similar context to x, which is then processed by the generator to append to x.

Any help clarifying would be much appreciated.

thefourthbrotherkaramazov

Any ideas on some methodologies to perhaps evaluate the performance of the retrieval mechanism within the RAG model? thanks

TheAmyShows

Amazing video. What about finetuning this for different tasks? Authors say we do not need to fine-tuning the document encoder.. but other things.sny comments on that?

shaz

i want to know if RAG is a model or a framework or just an approach ? question might be dumb to ask, but i really want to know

bivasbisht

Retrieval-Augmented Generation (RAG)

What is Retrieval-Augmented Generation (RAG)?

Back to Basics: Understanding Retrieval Augmented Generation (RAG)

A Helping Hand for LLMs (Retrieval Augmented Generation) - Computerphile

What is RAG? (Retrieval Augmented Generation)

Intro to RAG for AI (Retrieval Augmented Generation)

Retrieval Augmented Generation (RAG) Explained: Embedding, Sentence BERT, Vector Database (HNSW)

How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini

What is Retrieval Augmented Generation (RAG) - Augmenting LLMs with a memory

What is RAG (Retrieval-Augmented Generation)? #ai #rag #technology

What is Retrieval-Augmented Generation (RAG)?

RAG Explained

How to set up RAG - Retrieval Augmented Generation (demo)

AI Explained - AI LLM Retrieval-Augmented Generation (RAG)

Local Retrieval Augmented Generation (RAG) from Scratch (step by step tutorial)

Retrieval Augmented Generation (RAG) | Embedding Model, Vector Database, LangChain, LLM

Retrieval-Augmented Generation (RAG)

Build a Large Language Model AI Chatbot using Retrieval Augmented Generation

Retrieval Augmented Generation (RAG) and Vector Databases [Pt 15] | Generative AI for Beginners

Learn RAG From Scratch – Python AI Tutorial from a LangChain Engineer

Introduction to Retrieval Augmented Generation (RAG) and Implementing with Databricks

Was ist Retrieval Augmented Generation?

When Do You Use Fine-Tuning Vs. Retrieval Augmented Generation (RAG)? (Guest: Harpreet Sahota)

Understanding Retrieval Augmented Generation (RAG)

Why Everyone is Freaking Out About RAG