Shipitcon: Retrieval Augmented Generation in Practice - Scalable GenAI with LangChain and VectorDBs

preview_player
Показать описание

Shipitcon 2023 Talk: Lessons learned building Retrieval Augmented Generation, or “Chat with Documents” platforms and APIs that scale, and deploy on Kubernetes.

This talk will cover use cases for Generative AI, limitations of Large Language Models, use of RAG, Vector Databases and Fine Tuning to overcome model limitations and build solutions that connect to your data and provide content grounding, limit hallucinations and form the basis of explainable AI.

It covers use of LLAMA2, HuggingFace TGIS, SentenceTransformers embedding models using Python, LangChain, and Weaviate and ChromaDB vector databases.
Рекомендации по теме