RAG from scratch: Part 9 (Query Translation -- HyDE)

preview_player
Показать описание
HyDE (Hypothetical Document Embeddings) is an approach to improve retrieval that generates hypothetical documents that could be used to answer the user input question. These documents, drawn from the LLMs knowledge, are embedded and used to retrieve documents from an index. The idea is that hypothetical documents may be better aligned with the indexes documents than the raw user question.

Slides:

Code:

Reference:
Рекомендации по теме
Комментарии
Автор

the worst thing about this playlist is it's ending...
thanks for the amazing explanation, your effort is really appreciated,

wish to update it ASAP

ahmed_hefnawy
Автор

I love the lack of b.s. that these videos have... genius simple explanation, very pragmatic, well structured and super practical... I'm Lance huge fan!!

MrIsaacbabsky
Автор

I like to become a Software Engineer like Lance Martin, the way he explains everything swiftly inspires me🙏

sarveshshirude
Автор

I love the creativity in this approach!

freemindsupplements
Автор

Thank you .. all great videos. Learned about RAG and LangChain from scratch through this playlist

ncde
Автор

This is really neat. Thanks for the awesome series.

AppsWithCode
Автор

Lance, we can't wait for the updates :(

sylap
Автор

there's this part that confuses me, if you are asking a question that you have no knowledge of, say regarding private documents that are only stored in the vector db, then how would you be able to generate hypothetical document? Or in other words, if you already have the information to generate hypothetical document, why bother doing the query? doesn't this defeat the purpose? I know I must be missing something here, thanks ahead if someone can help clarify a bit.

drizzle
Автор

Would be nice to see the differences in distance for one technique versus another.

CostaMichailidis
Автор

I am a little confused with this technique, isn't it making the RAG app prone to hallucinations?

rutambhagat
Автор

Dear Lance, when is the next set of videos from this series coming up (i.e., routing and query construction)? Thanks!

elyxavier
Автор

I really like the videos. I just to understand how can we evaluate the performance (quality of response) of different query translation techniques for a specific use case or domain.

manideep
Автор

Nice video! But I think that is needed to use a diferent documents, the retrieval for back propagating docs was the same as que original question

guillermogaete
Автор

What scinario is the best to use HyDE?

bingyaoli
Автор

I can see this being problematic in domains that the model is not trained on. If it produces a hypothetical document that is way off the rails, that hypothetical document won't return good results.

aidan_kang