LlamaIndex Webinar: PrivateGPT - Production RAG with Local Models

Показать описание

Want to learn how to build a production-ready RAG stack using purely local models?

In this video we're excited to host Ivan and Daniel, creators of PrivateGPT - the leading project for privacy-aware RAG.

it’s powered by LlamaIndex, and you can also learn how to use LlamaIndex modules at the right level of abstraction to build your own production RAG system 🔥

Besides this we will also have an open QA session about proper UX, productionization and privacy challenges.

LlamaIndex

Рекомендации по теме

Комментарии

Great webinar! Very informative.
I have a few questions:
1) Does anyone know nvidia's equivalent (4090, T400...) to Apple's M3 mentioned at the beginning of the webinar in terms of LLM inference use cases?
2) Still in the hardware section, can a single instance of LLM running in a single GPU answer multiple requests at the same time? How does it work for the GPU when 300 users are querying the pipeline at the same time?
3) On the question of attacks and people trying to poison the application, what do you think of the Nemo-Guardrails framework?
4) Concerning Private GPT's option of manually choosing between querying the documents and querying the llm directly, are there any solutions that would automatically detect whether the query requires the use of the domain-specific pipeline rag or whether the query can be sent directly to the llm?
Thank you very much for your time!

ripeer-dqbj

I'm interested in using PrivateGPT. Can I use it if I host the local model on a different server and access it through the oobabooga api?

vgtgoat

This question never answered on many platforms, how to ensure that it support multiple user simultaneously. It's crashes

imadahmad

Can I run this on Google COLAB FREE VERSION? That is a Linux environment ?

scitechtalktv

What is the github repo of this project that you are presenting here? How can we make the same system as you ?

MrAlket

LlamaIndex Webinar: PrivateGPT - Production RAG with Local Models

LlamaIndex Webinar: PrivateGPT - Production RAG with Local Models

LlamaIndex Webinar: Building LLM Apps for Production, Part 1 (co-hosted with Anyscale)

Building Production-Ready RAG Applications: Jerry Liu

How TO SetUp and USE PrivateGPT | FULLY LOCAL | 100% PRIVATE

Build a Large Language Model AI Chatbot using Retrieval Augmented Generation

Panel Discussion w/ LlamaIndex: Building Custom LLMs in Production

LlamaIndex Sessions: Practical challenges of building a Legal Chatbot over your PDFs

LlamaIndex Webinar: How to Win a LLM Hackathon

Preserving privacy using local RAG systems with LlamaIndex and Tonic Validate

PrivateGPT Demo (Mobile View)

Llama Index: Document management reaction

How to Compare Multiple Large PDF Files Using AI (w/ Jerry Liu, Co-Founder of LlamaIndex)

LlamaIndex: How to use LLMs to Interface with Multiple Data Sources

How to Use Mixtral 8x7B with LlamaIndex and Ollama Locally

Knowledge Graph Construction Demo from raw text using an LLM

AI Agents and Data Integration with GPT and LLaMa with Jerry Liu - 628

Data Privacy for LLMs

Setup PrivateGPT on Your Linux Server

Building a RAG LLM Clinical Chatbot with John Snow Labs in Databricks

Unstructured” Open-Source ETL for LLMs

Ronen Eldan | The TinyStories Dataset: How Small Can Language Models Be And Still Speak Coherent

LangChain Document Question-Answering Webinar

Learn to Implement Guardrails in Generative AI Applications

🦜🔗 LangChain Components | Beginner's Guide | 2023