Nvidia's Free RAG Chatbot supports documents and youtube videos (Zero Coding - Chat With RTX)

preview_player
Показать описание
Chat With RTX is a free chatbot released by Nvidia. This chatbot can be used as an AI chatbot, RAG with documents, and RAG with YouTube videos. In this video, I show how to install and use the chatbot. I also test its inference time, accuracy, and hallucination with both Mistral 7B and Llama2 13B parameter Large Language models (LLMs).

00:00 Intro to Chat with RTX
01:14 Installation guide
02:12 UI walk-through
03:08 Testing the AI chatbot (testing response accuracy, inference time, memory, and hallucination)
07:12 RAG with documents
11:10 RAG with Youtube videos

#rag #chatbot #nvidia #llm #openai #gpt #chatgpt
Рекомендации по теме
Комментарии
Автор

Interesting and informative video, Considering the plethora of LLMs and tools popping up, it's a matter of saturation in architecture and speed.

Interestingly all the RAG support PDF files, but not one supports Databases. What is your view?

navanshukhare
Автор

Very interesting video thank you Farzad! Looking forward to your works.

hadi-yeg
Автор

Can we learn C++ or other programming languages on Chat with RTX, is it worth it? Like better than chat gpt 4 in learning programming languages, what do you say?

omicron
Автор

Thanks for sharing! When I downloaded and installed, there was no Llama2 13B INT4 to choose from?
There were only Chat With RTX 0.2, Mistral 7B INT4 1.0. Because my graphics card is NVIDIA GeForce GTX 4060 8g, is it possible that the video memory is too small? Thanks.

jxm
Автор

For the model "Mistral 7B int4"
Something interesting I noticed was what the model was or was not allowed to say. For example: I "trained" it my own dataset that was a single text file that simply said "cat" a hundred times.


I asked the model what a dog was. It did not know.
I asked the model what a horse was: It did not know.
I asked the model what a cat was: and it gave an in detail explanation of what a cat was.

My conclusion is this. When the model trains off of our training data, it seems to supplant gaps of information with its default AI model, despite not being explicit that it is doing so. I want to test the capabilities and advantages / disadvantages in the coming days, and share my results.

darkmatter
Автор

Your tutorials are amazing! They've been incredibly helpful. I have a request: could you create a tutorial on building a chatbot using Node.js, React, and Next.js that can upload data to a Vector database and interact with it? I believe a tutorial on this topic would be incredibly valuable and interesting.

godfreyogbeide
Автор

Make a video on comparison and which is better chat with RTX vs RAG?

omicron
Автор

Llama isn't included with the download. Only Mistral.

Araphex
Автор

i dont lnow, i installed it but i cant pass any youtube video URL and also when it;s displaying the reffrence doc dosent make it linkable

adriangpuiu
Автор

Why do you pronounce your 'Th' sounds as 'D' sounds? Example: The word is 'this' not 'dis'.

Also, much of your information is incomplete, and expects a certain knowledge level of all of your viewers. Example: You said "this chatboard is only for users with access to Series 30 or 40"- 30 or 40 series of what? Cats? Cars? Nvidia GPUs?

nathan_sweet