Gemini 2.0 Live API SHOCKS! + More AI Tools

preview_player
Показать описание

First up, Google's new Gemini 2.0 multimodal live API is a incredible, allowing you to talk, show, and share your screen with AI. Then, I'll show you how I used Gemini to help create an iOS widget in Xcode and even translate Vietnamese street audio! Finally, we'll dive into a deep comparison of different AI language models (ChatGPT, Claude, and Gemini Flash) to see which is the best for summarization and long-form content. Plus, a bonus look at my AI Advent Calendar content in the community for December!

0:00 Intro to Gemini 2.0 Live API
0:24 Talking to Gemini (accent & emotion analysis)
0:40 Showing Gemini (dragon fruit recognition)
0:52 Screen Sharing with Gemini (Minecraft gameplay)
1:32 Gemini Helps with Xcode and iOSWidget Creation
3:34 Gemini Translates Vietnamese Street Audio
4:08 The Future of AI Agents
4:47 AI Models Comparison
8:14 Transcript Analysis Demo
11:20 Why I Switched to Gemini Flash for Content Summaries
12:07 Outro
Рекомендации по теме
Комментарии
Автор

Great update Mike. Thank you. I’m going to have bad dreams about that Avatar though. 😂

lesf
Автор

Just came across your site....amazing videos. Can you tell me if I am piecing this together properly....would it be possible to create an app that could audio record between two or more people in a room and then automatically transcriber the audio into a transcript? Or would it be better to simply audio record the whole conversation, much like the Vietnamese street recording, and convert it?

harbukshsekhon
Автор

context why Gemini is so good is its always been multi model. so under the hood its passing the video to googles vision AI, then back to the LLM. I've been using Gemini pro since May 2024 for multi vision its been pretty good since then for dealing with Pdfs, images, video etc

Hawkfrygroup
Автор

The AI avatar is *mostly* believable, but the head motion is a little exaggerated.

JohnFrazier
Автор

Please tell us how to download apk file from bolt ai app builder after creating an complete app to test in android phone iphone and to launch in Play Store and apple store please

Kvn
Автор

Is there anyway for the listener to return your voice to its original sound? I guess I’m in the minority but I find the voice enhancement creepy.

rjperkins
Автор


What's your go to AI model? Share below! 👇

CreatorMagicAI