Nvidia's Free RAG Chatbot supports documents and youtube videos (Zero Coding - Chat With RTX)

Показать описание

Chat With RTX is a free chatbot released by Nvidia. This chatbot can be used as an AI chatbot, RAG with documents, and RAG with YouTube videos. In this video, I show how to install and use the chatbot. I also test its inference time, accuracy, and hallucination with both Mistral 7B and Llama2 13B parameter Large Language models (LLMs).

00:00 Intro to Chat with RTX
01:14 Installation guide
02:12 UI walk-through
03:08 Testing the AI chatbot (testing response accuracy, inference time, memory, and hallucination)
07:12 RAG with documents
11:10 RAG with Youtube videos

#rag #chatbot #nvidia #llm #openai #gpt #chatgpt

Farzad Roozitalab

Рекомендации по теме

Комментарии

Interesting and informative video, Considering the plethora of LLMs and tools popping up, it's a matter of saturation in architecture and speed.

Interestingly all the RAG support PDF files, but not one supports Databases. What is your view?

navanshukhare

Very interesting video thank you Farzad! Looking forward to your works.

hadi-yeg

Can we learn C++ or other programming languages on Chat with RTX, is it worth it? Like better than chat gpt 4 in learning programming languages, what do you say?

omicron

Thanks for sharing! When I downloaded and installed, there was no Llama2 13B INT4 to choose from?
There were only Chat With RTX 0.2, Mistral 7B INT4 1.0. Because my graphics card is NVIDIA GeForce GTX 4060 8g, is it possible that the video memory is too small? Thanks.

jxm

For the model "Mistral 7B int4"
Something interesting I noticed was what the model was or was not allowed to say. For example: I "trained" it my own dataset that was a single text file that simply said "cat" a hundred times.

I asked the model what a dog was. It did not know.
I asked the model what a horse was: It did not know.
I asked the model what a cat was: and it gave an in detail explanation of what a cat was.

My conclusion is this. When the model trains off of our training data, it seems to supplant gaps of information with its default AI model, despite not being explicit that it is doing so. I want to test the capabilities and advantages / disadvantages in the coming days, and share my results.

darkmatter

Your tutorials are amazing! They've been incredibly helpful. I have a request: could you create a tutorial on building a chatbot using Node.js, React, and Next.js that can upload data to a Vector database and interact with it? I believe a tutorial on this topic would be incredibly valuable and interesting.

godfreyogbeide

Make a video on comparison and which is better chat with RTX vs RAG?

omicron

Llama isn't included with the download. Only Mistral.

Araphex

i dont lnow, i installed it but i cant pass any youtube video URL and also when it;s displaying the reffrence doc dosent make it linkable

adriangpuiu

Why do you pronounce your 'Th' sounds as 'D' sounds? Example: The word is 'this' not 'dis'.

Also, much of your information is incomplete, and expects a certain knowledge level of all of your viewers. Example: You said "this chatboard is only for users with access to Series 30 or 40"- 30 or 40 series of what? Cats? Cars? Nvidia GPUs?

nathan_sweet

Nvidia's Free RAG Chatbot supports documents and youtube videos (Zero Coding - Chat With RTX)

Build a Large Language Model AI Chatbot using Retrieval Augmented Generation

NVIDIA NIM: The Game-Changer in Gen AI Deployment (Build a RAG)

How RAG Turns AI Chatbots Into Something Practical

Build your own RAG (retrieval augmented generation) AI Chatbot using Python | Simple walkthrough

All You Need To Know About Running LLMs Locally

RAG Implementation Medical Chatbot with Mistral 7B LLM LlamaIndex GTE Colab Demo

Deploy AI Models to Production with NVIDIA NIM

Graph RAG UI: Powerful Chat with your Docs!

Local Retrieval Augmented Generation (RAG) from Scratch (step by step tutorial)

'I want Llama3 to perform 10x with my private knowledge' - Local Agentic RAG w/ llama3

Which nVidia GPU is BEST for Local Generative AI and LLMs in 2024?

Chatbots with RAG: LangChain Full Walkthrough

How To Install PrivateGPT - Chat With PDF, TXT, and CSV Files Privately! (Quick Setup Guide)

How to chat with your PDFs using local Large Language Models [Ollama RAG]

Guiding Chatbots / AI with Actions in NeMo Guardrails

Retrieval-Augmented Generation chatbot, part 1: LangChain, Hugging Face, FAISS, AWS

All-In-One Chatbot: RAG, Generate/analyze image, Web Access, Summarize web/doc, and more...

How Large Language Models Work

Real time RAG App using Llama 3.2 and Open Source Stack on CPU

Build a Modern AI Chatbot in Next.js 14 (2024)

Vector databases are so hot right now. WTF are they?

Build Anything with AI Agents, Here's How

Run your Own Private Chat GPT, Free and Uncensored, with Ollama + Open WebUI

Run a GOOD ChatGPT Alternative Locally! - LM Studio Overview