Vector Search RAG Tutorial – Combine Your Data with LLMs with Advanced Search

preview_player
Показать описание
Learn how to use vector search and embeddings to easily combine your data with large language models like GPT-4. You will first learn the concepts and then create three projects.

✏️ Course developed by Beau Carnes.

🏗️ MongoDB provided a grant to make this course possible.

⭐️ Contents ⭐️
⌨️ (00:00) Introduction
⌨️ (01:18) What are vector embeddings?
⌨️ (02:39) What is vector search?
⌨️ (03:40) MongoDB Atlas vector search
⌨️ (04:30) Project 1: Semantic search for movie database
⌨️ (32:55) Project 2: RAG with Atlas Vector Search, LangChain, OpenAI
⌨️ (54:36) Project 3: Chatbot connected to your documentation

🎉 Thanks to our Champion and Sponsor supporters:
👾 davthecoder
👾 jedi-or-sith
👾 南宮千影
👾 Agustín Kussrow
👾 Nattira Maneerat
👾 Heather Wcislo
👾 Serhiy Kalinets
👾 Justin Hual
👾 Otis Morgan
👾 Oscar Rahnama

--

Рекомендации по теме
Комментарии
Автор

What kinds of projects do you plan to make with Vector Search?

beau
Автор

Woah you're teaching this is the first time I've ever seen one from you

psikosen
Автор

அண்ணனுக்கு வணக்கம்🙏! சிறப்பா செஞ்சிருக்கீங்க ரொம்ப நல்லா இருந்துச்சு! 😄

umeshkumarasamy
Автор

I understand MongoDB sponsored this but I’d really have appreciated WHY someone should choose MongoDB vs other options. Same with embedding model. WHY use the hugging face model vs OpenAI Ada. There are so many different options for vector store and model, so a tutorial that deep dives into this decision is super important.

JeremyJanzen
Автор

Hi, thanks for video!

What about a follow-up questions in RAG?
Example
Q: Suggest some movie with Johny Depp
A: <some title of the movie>
Q: What year was it filmed?
A: ...

voloUA
Автор

This is brilliant. Thanks so much from a grateful student at the School Of Code

peterfaretra
Автор

There’s a lot missing. I get this is basic, but the metadata is crucial.. and 90% of people will be using cosine similarly, especially in RAG systems. Great video by the way. It’s awesome that you take time out to help others…

Canna_Science_and_Technology
Автор

people having trouble with loading sample data: be on the main screen and click project drop down menu on the top place to see "view all projects", next will be Overview screen, there is right pointing arrow close to it "view database deployments", there you will see your Cluster0, click it, next screen right side you will see buttons "connect", "configuration", and " ...", click the dots button to see "Load sample dataset".

ummnine
Автор

Thanks for the video tutorial. Helped me to understand the core ideas used in this technology!

muttdev
Автор

You @beau are a much better teacher, I wish you created most of the tutorials! .. but then I don't want you to burn out! take care of yourself Sir.

MrStargazer
Автор

It would really help everyone if you followed the best practices of using your tokens/logins safely. The old practice what you preach. Many of your viewers might not really know how to do that. They NEED to do it. I appreciate it makes your video less expository and is a burden in terms of prep.

andyhenrie
Автор

Where code for project two is available ? in github repository it is different, thanks

ugoceruti
Автор

The files for project two in the Github repository do not match this video. Could you kindly verify the files please? Thanks

pvqvsqd
Автор

Guys please make a video with opensource llms API, like palm or hugging face. Please..

out-of-sight
Автор

Where is the sample_data used in project 2? Doesn't seem to be in the repository that is linked

lawful_neutral
Автор

Fantastic source of information! Learnt a lot 🤓

carl-w
Автор

You can generate vector embeddings by calling rest api exposed by Vendors like HuggingFace, OpenAI etc. One thing to note that, these vendors employ rate limiting at their ending basically throttling the no of request that you can make to theirs apis within second. You need to buy subscription accordingly depending on your requirement

vinitsunita
Автор

Great video! I really enjoyed your introduction to Rag. Your explanation was clear and informative. I noticed you broke the text into segments instead of using the whole text. Could you explain the reasoning behind this approach? Thank you in advance!

niedland
Автор

Can you please upload these 3 files in the git repo? aerodynamics.txt, chat_conversation.txt and log_example.txt.

vadirajabhat
Автор

Hi. Could you be so kind to add the three TXT files mentioned in project#2?. The are mandatory for completing the example... thanks.

mtalamona