Gemini 2.0 for developers

Показать описание

Discover Gemini 2.0, the latest of Google’s multimodal AI models. This model is capable of generating native image and audio output and includes enhanced spatial understanding, and tool usage (Google Search, code execution and function calling). Explore the new Multimodal Live API allowing developers to build real-time, multimodal applications with audio and video-streaming inputs from cameras or screens. Try Gemini 2.0 Flash (experimental) in Gemini API, Google AI Studio and VertexAI.

Chapters:
0:00 - Gemini 2.0 Developer Announcement
0:41 - Multimodal Live API
3:58 - Multilingual native audio out
5:25 - Gemini 2.0 features
6:11 - Start building with Gemini 2.0

Resources:
Try Gemini 2.0 Flash (experimental)

Try code examples for Gemini 2.0 Flash

Watch videos on Gemini 2.0:
Building with Gemini 2.0:
Multimodal Live API:

Follow the new “Google AI for Developers” social channels

#GoogleAI #Gemini

Products Mentioned: Gemini 2.0, Google AI Studio, Gemini API, Vertex AI

Рекомендации по теме

Комментарии

I just used it to analyze gameplay almost in real time, it blew my mind, but crashed every few minutes. Incredible model

mrbananapsychooo

Great improvement! Good job to all involved 👏

banzai

These interactions won't last more than 3 minutes, i get "something went wrong" then have to start all over

bitcode_

I tried to share screen with Gemini while watching a movie, but it just kept responding to the movie every two or three seconds. It seems it cannot tell the difference between my voice and sound in the video.

kekewan-ll

I feel like I can make an assistant for.... anything... with these APIs. Multi-lingual assistants, even though I am not. Insane.

NewNerdInTown

Wait a second, if this is out.. Astra can't be far off!! 😊

Maybe by the end of this year?!

EchoYoutube

It can listen to you while interrupting so crazy!😮

aigriffin

I'm so disappointed. There's no file access.

dr.mikeybee

Esto es extraordinario, habre un abanico de posibilidades impresionantes

stanleyillidge

Google Ai Studio - Always returns error "An internal error has occurred"

orikla

What about user voice isolation without that voice experience is very limited

tijendersingh

Would be nice… if studio didn’t throw and error every minute or two.

Jason_vinion

Thats very impressive, nice work Google! 3:08 a remark, wouldn't it be better if when a user interrupts the ai reading that the ai stops like immediately and listens what the user want and not after the user is finished with the new request.

Richard_GIS

When it started to speak English, French and Korean was mind-blowing 😮

Lruiz

Can tts output the word timings, too? We'd like to have closed captions for accessibility requirements.

DangRenBo

I can't reproduce the demo with the car to convertible, no image is generated, anyone else?

jackquiver

How do I embed voice enabled help in my website using Gemini? For eg. I own a bank & I want my customers to learn how to use the website - say connecting another bank to transfer money.

ArunKumar-jkpq

Hi, is it possible to share my screen with Gemini, and show my visual trading setup, showing some days and trades, and explaining with audio my setup, at the same time on the video, in order to code this human trading decisions in csharp ? for let's say for example quantower api ? thanks for your answer

evanbassmusic

We can put this multimodal live API to a robot and interact with them more naturally

checkoverstripes

Web-socket technology is simply not designed for such a high volume of data transfer and concurrency - thats why we see so much complaints about errors - before we were using SSE (Server Sent Events) and it was much more robust. We need to improve websockets in order to be able to stablish long stabe and multimodal conversations

hoomansedghamiz

Gemini 2.0 for developers

Gemini 2.0 for developers

CAUTION: Google's Gemini 2 is ACTUALLY useful

Introducing Gemini 2.0 | Our most capable AI model yet

Building with Gemini 2.0: Native image output

Building with Gemini 2.0: Spatial understanding

Building with Gemini 2.0: Native tool use

Building with Gemini 2.0: Multimodal live streaming

Gemini 2.0 is HERE! Google's Biggest AI Update Yet!

GEMINI ♊️ IT IS OVER WITH THE KARMIC! They can't stop thinking about you! 💕 January 16-31, 2025...

Gemini 2.0 is Out NOW! Full Breakdown + How to Use for Free

Behind the Scenes of Gemini 2.0

Googles GEMINI 2.0 Just SHOCKED The ENTIRE INDUSTRY! (OpenAI Beaten) Full Breakdown

💀 R.I.P. YouTube Tutorials: Gemini 2.0 AI is the NEW Teacher

Gemini 2.0 for games demo | Playing Clash of Clans

Gemini 2: Google's Latest AI Challenging OpenAI o1

Never Browse Alone? Gemini 2 Live and ChatGPT Vision

Gemini 2 0 API to Make com Easy Setup!

Aider Upgraded + Gemini 2 0 Thinking The Ultimate AI Coding Agent! 🚀

Devialet Gemini II 2 Sound Test

Gemini 2.0 Flash Can Teach You Anything with Screen Share

Unveiling Gemini 2 0 The Future of AI

ChatGPT Vs. Google Gemini Part 2 #comedy #meme #skit

Gemini 2 0 Flash Thinking Model

Gemini 2 0 Tutorial for Content Creators: No Tech Skills Needed!