Multimodal AI

preview_player
Показать описание
In this episode Lan and Ryan show Martin a multimodal AI application that solves a real business problem. “Multimodal” means that the application processes video, audio, and text to create output. Lan and Ryan demo the finished application and also dig into the code to show how to call Google’s Vertex AI.

Chapters:
0:00 Intro
1:01 The business problem
2:04 Demo
2:45 Code walkthrough
5:15 Takeaways
6:07 Wrap-up

Resources:

#serverless #ai #vertexai #cloudfunctions

Speakers:
Lan Tran, Customer Solution Engineer
Ryan Sibbaluca, Customer Solution Engineer
Martin Omander, Developer Advocate

Products Mentioned: Vertex AI, Cloud Functions
Рекомендации по теме
Комментарии
Автор

Thanks for sharing. It was cool to see the root prompt in the code informing the LLM on what persona to take on.

go.ryanpie
Автор

Thanks for this informative video. I just completed the L400 Gen AI partner training, and its time to start thinking about applications - so this was very inspiring.

robinyoulton
Автор

Thanks x a billion. Much respect, much love!

TheyCanceledhim
Автор

Thank you 👏
Sometimes, it’s fun to run these locally (SQLite), thanks a bunch!

banzai
Автор

I want to deploy Gemini multimodel Libraries to extend my business and professional career development. ❤

amediarts
Автор

Thanks for sharing!
I'm new to Firebase. Could you show step-by-step how to setup the projects and run locally please.

ttwoosta
Автор

Its been decades since we are exploring the horizon without any outcome? Kindly furnish the time line

DishimokDishimok
Автор

Lewis Laura Thompson Barbara Martinez Sharon

MichaelNeumann-nv
visit shbcf.ru