filmov
tv
LlamaIndex Workshop: Multimodal + Advanced RAG Workhop with Gemini
![preview_player](https://i.ytimg.com/vi/fdpaHJlN0PQ/maxresdefault.jpg)
Показать описание
The Google Gemini release included both exciting multi-modal capabilities as well as semantic retrieval.
In this workshop, we cover two cool LLM + RAG use cases with Google Gemini:
1️⃣ Multi-modal RAG: Use the Gemini model to extract structured outputs from images. Then learn how to index these texts + images and build a QA system from it (also using Gemini).
2️⃣ Advanced RAG: Learn how to use the brand-new Semantic Retrieval API. You can decompose it into different components - custom embedding-based retrieval and custom response synthesis.
We had the pleasure of co-host this with folks from the Google Labs team (Cher Hu, Lawrence Tsang, Michael Chen)
Timeline:
00:00-27:20 Advanced RAG
27:20-52:59 Multimodal
In this workshop, we cover two cool LLM + RAG use cases with Google Gemini:
1️⃣ Multi-modal RAG: Use the Gemini model to extract structured outputs from images. Then learn how to index these texts + images and build a QA system from it (also using Gemini).
2️⃣ Advanced RAG: Learn how to use the brand-new Semantic Retrieval API. You can decompose it into different components - custom embedding-based retrieval and custom response synthesis.
We had the pleasure of co-host this with folks from the Google Labs team (Cher Hu, Lawrence Tsang, Michael Chen)
Timeline:
00:00-27:20 Advanced RAG
27:20-52:59 Multimodal
Комментарии