Pinecone LangChain - Questions/Answer on Your Own TXT/PDF Files - Code in 9 Minutes!

preview_player
Показать описание
This video guides you through the basics of loading a custom TXT and a PDF file externally into Pinecone as embeddings(vectors). It also guides you on the basics of querying your custom TXT/PDF file to get answers back (semantic search) from the Pinecone vector database, via the OpenAI LLM API. Using LLMs to query your own data is a powerful application to become operationally efficient for various tasks requiring looking up large documents.

Thanks for watching! 🙏

😃 SUBSCRIBE 🌟 👍 LIKE 🌟 💬 COMMENT 🌟 SHARE

▬▬▬▬▬▬▬▬▬▬▬▬▬▬ Resources and Support ▬▬▬▬▬▬▬▬▬▬▬

Thanks for watching! 😃
Рекомендации по теме
Комментарии
Автор

Watch LangChain Tutorials: :

Git Repo:

goodaitechnology
Автор

Hi! Thanks for the video.
What the embed_model = 'text-embedding-ada-002' variable is used for?

Daniel-flsi
Автор

In the second example with PDF, you are getting the PDF content in the `book_texts` variable, but after that, you are not updating the Pinecone with this `book_texts` data. (i.e.: book_docsearch = for t in book_texts], embeddings, index_name = index_name))

The `book_docsearch` contains the older pinecone object for the TXT file. So, how it's giving the results from the PDF?
Please explain/clarify.

rajatagrawal
Автор

Great video, one problem I'm facing while creating book_docsearch. If I'm running the cell again it's generating embedding again. Is there a way to use the earlier embedding already stored, if I just want to do inference

antiquechariot
Автор

Thanks! You upsert txt docs, is it the same for uploading pdf, and instead of using .txt use .pdf in the script?

thespiritualmindset
Автор

Very interesting, im eager to learn about this but I only have the very basic understanding of Python lol! What videos do you recommend I should watch before this? it would be of great help! :)

renderllama
Автор

Great Video! Even though I had no clue about this it helped me and I didn't quite understand fully but i'm sure this would help ppl in need. music i a bit too loud tho.

AbsoluteGT.
Автор

Thank you so much for this video! So I’m having a problem, every time I run my script it says the token size is too large. But I see it at like 1500 tokens with 50 token overlap. And I uploaded a small PDF and asked a very simple question. Yet every time I run the script the token size gets bigger and bigger. Does pinecone and/or open ai embeddings need to be cleared or initialized somehow? I’m a beginner so any help you can offer would be great!

brrocks
Автор

Can you suggest which vector DB has best results for pdf Question Answering?

NayanAkhade-rq
Автор

Can we have a meet call to discuss potential collaboration opportunities?

UsamaMansoor-yw
Автор

Can you please make a video on how to deploy it on website so that customer can chat with my pdf data and get answers on time.

topacademy
Автор

Will there be any false replies or wrong replies? Suppose, If I am asking like, "how many times a name "asdfsdfsadf" used in the story"

vrynstudios
Автор

Obviously this video starts somewhere after certain knowledge has already been delivered. A terminology primer would be useful.

rayfellers
Автор

very informational, but it is very unfortunate that you have that annoying background music.

fluffykitties