filmov
tv
Pinecone Workshop: LLM Size Doesn't Matter — Context Does
![preview_player](https://i.ytimg.com/vi/GkQ52svNUhM/maxresdefault.jpg)
Показать описание
This discussion is critical for every AI, Engineering, Analytics, and Product leader interested in deploying AI solutions. Pinecone and Prolego share the surprising insights from their independent research studies on optimizing LLM RAG applications.
Although state-of-the-art LLMs like GPT-4 perform best on general benchmarks, small and open-source LLMs perform just as well when given the right context. These results are critical for: (1) overcoming policy constraints preventing you from sending data to model providers, (2) reducing the costs and increasing the ROI of your RAG, and (3) giving you more control over your models and infrastructure.
Although state-of-the-art LLMs like GPT-4 perform best on general benchmarks, small and open-source LLMs perform just as well when given the right context. These results are critical for: (1) overcoming policy constraints preventing you from sending data to model providers, (2) reducing the costs and increasing the ROI of your RAG, and (3) giving you more control over your models and infrastructure.
Pinecone Workshop: LLM Size Doesn't Matter — Context Does
Don't naive RAG do hybrid search instead (Pinecone Weaviate or pgvector + full text search &...
Building the Future with LLMs, LangChain, & Pinecone
Building Production-Ready RAG Applications: Jerry Liu
Build enterprise-grade Q&A at scale with Open LLMs on AWS
Hugging Face LLMs with SageMaker + RAG with Pinecone
LlamaIndex Webinar: Build/Break/Test LLM Apps Showcase (co-hosted with TrueEra, Pinecone)
Building Production-Grade LLM Apps
Optimizing query performance with Pinecone's Rerank API and Fleak
The ULTIMATE Local AI Setup: LLMs, Qdrant, n8n (NO CODE!!)
RAG Workshop with Langchain and LlamaIndex
How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini
The Future of Multi-Modal Search
Harnessing RAG with Open-Source LLMs — @matthew_berman, @jamesbriggs
Boosting Your Chatbot's Intelligence with Retrieval
PGVector vs Pinecone vs Redis - how should you choose a vector database?
Troubleshooting Search and Retrieval: Enhance Performance with Powerful AI Solutions
Generative AI | Jobs with GPT 3, GPT 4, ChatGPT Knowledge and Skills | Career Talk With Anand
Vector Database Workshop - Scaling LLMs by Yujian Tang
LlamaIndex: Building RAG Applications with Multiple Data Sources
ClippyGPT - How I Built Supabase’s OpenAI Doc Search (Embeddings)
The missing pieces to your AI app (pgvector + RAG in prod)
Dan Gibson - An Introduction to Retrieval Augmented Generation - PyData London 2024
LlamaIndex Webinar: Document Metadata and Local Models for Better, Faster Retrieval
Комментарии