filmov
tv
Multimodal AI

Показать описание
In this episode Lan and Ryan show Martin a multimodal AI application that solves a real business problem. “Multimodal” means that the application processes video, audio, and text to create output. Lan and Ryan demo the finished application and also dig into the code to show how to call Google’s Vertex AI.
Chapters:
0:00 Intro
1:01 The business problem
2:04 Demo
2:45 Code walkthrough
5:15 Takeaways
6:07 Wrap-up
Resources:
#serverless #ai #vertexai #cloudfunctions
Speakers:
Lan Tran, Customer Solution Engineer
Ryan Sibbaluca, Customer Solution Engineer
Martin Omander, Developer Advocate
Products Mentioned: Vertex AI, Cloud Functions
Chapters:
0:00 Intro
1:01 The business problem
2:04 Demo
2:45 Code walkthrough
5:15 Takeaways
6:07 Wrap-up
Resources:
#serverless #ai #vertexai #cloudfunctions
Speakers:
Lan Tran, Customer Solution Engineer
Ryan Sibbaluca, Customer Solution Engineer
Martin Omander, Developer Advocate
Products Mentioned: Vertex AI, Cloud Functions
How do Multimodal AI models work? Simple explanation
Multimodal AI
AI Explained - Multimodal AI
Multimodal AI in action
Brilliant Labs launches Frame, the first multimodal AI glasses 👓
What Is Multimodal AI? | AI Tutorials For Beginners | Gemini | ChatGPT | Gemma | Simplilearn
Google Unveils Project Astra:Your Multimodal AI Assistant #adkeeda #ai #projectastra #google #aiart
FRAME: multimodal AI Smart Glasses #shorts
How a Purpose-Built Database for Multimodal AI Can Save You Time and Money?
What is Multi Modal AI - An Easy Explanation For Anyone
Apple Unveils MM1 A Multimodal AI Model Family | #AppleAI #AITrends
What is Multi-modal AI? | What is by Digit EP9 | #multimodalai #multimodal #AI
Multimodal AI: LLMs that can see (and hear)
How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini
what is so unique about multimodal AI
The most important AI trends in 2024
Ray-Ban Meta Multimodal AI - Early Access!
Multimodal AI from First Principles - Neural Nets that can see, hear, AND write.
Gemini AI MultiModal Model Course
Get Started with Multimodal AI Disease Prediction | Intel Software
GPT-4o, AI overviews and our multimodal future
Building Multimodal AI RAG with LlamaIndex, NVIDIA NIM, and Milvus | LLM App Development
Multimodal AI-Deep Learning-What is Multimodal AI-How it works-Applications
The Ray-Ban Meta Smart Glasses have multimodal AI now! #rayban #ai #artificialintelligence
Комментарии