Never Browse Alone? Gemini 2 Live and ChatGPT Vision

preview_player
Показать описание
The ‘Gemini 2 Era’ begins … with screen-sharing? But really, it’s a great free tool, for curiosity satisfying rather than bleeding-edge intelligence. I give you the benchmarks, the highlights and of course, the latest from OpenAI Advanced Voice Mode with Vision.

Plus Deep Research in Gemini Advanced, Simple Bench updates, Santa and what might be for some of you Google’s deflating admission.

Chapters:
00:00 - Introduction
00:38 - Live Interaction
03:43 - Gemini 2.0 Flash Benchmarks
05:10 - Audio and Image Output
06:38 - Project Mariner (+ WebVoyager Bench)
08:49 - But Progress Slowing Down?
10:43 - OpenAI Announcements + Games

Рекомендации по теме
Комментарии
Автор

This is the only channel I still trust to get my Tic Tac Toe news

slgnssp
Автор

The real shipmas is the frequency of these AI Explained Videos.

markopolio
Автор

The debate over slowing progress in LLMs overlooks a key point: while model advancement rates may be debatable, we're nowhere near realizing the potential of existing capabilities. Emergence isn't just about unexpected model capabilities appearing; it's also about practitioners discovering unexpected possibilities through creative applications of current systems.

robkline
Автор

the most impressive thing for me is that they actually have the capacity to roll this out. we've come a long way since google got caught flat-footed and had nothing more than poor old lamda-based bard prototypes because everything else was too heavy to serve

autingo
Автор

So far this new Gemini is the only amazing thing to come out during OpenAI’s 12 days

Creepaminer
Автор

Thank you so much for your videos! Quick uploads, high quality, intelligent, and yet still fun to watch. In the past weeks, the amount of time I have decreased drastically. I stopped watching a lot of different AI YouTube channels. But let me tell you this: I did not miss a single video of yours, and I don't plan to ever miss one!

pareak
Автор

At first I thought Sam Altman was a hero but the more time passes and the more he speaks the more I realise he's just a hypeman. I don't blame him, it's his job, but it does reduce how much I trust him.

Amazing video as usual Mr Explained!

SirQuantization
Автор

“There isn’t really a wall per se, but there is a bit of a hill that we need to hike.” - Sundar Pichai

MaJetiGizzle
Автор

Pichai was saying that it gets really steep, but when the "competition" was mentioned, he changed tune.(investors are

georgesos
Автор

30 minutes ago I opened Gemini, just setting up basic parameters I want carried out as part of the background, to checkout every couple of weeks, as an experiment. As always thank you Phillip

williamjmccartan
Автор

Most surprising fact from today’s video is that your name is Phillip :D

ibonitog
Автор

the tic-tac-toe part was gold 🤣 Amazing video as always! Thank you for the laugh and the great info 👏

turner-tune
Автор

Major progress, I suspect, will shift from scaling giant general models to assembling smaller, narrower-domain specialized models -- along with memory storage and management components, and some kind of domain identification/routing element -- into a sort of modular system that's smarter than the sum of its parts.

cacogenicist
Автор

Ahh, blessed voice of reason :). And yep, even long "pre-AI" HUDs (that essentially calculate the odds, advise "textbook best play" etc. during hands) were a big issue in games like online Poker (sites initially tried to stop their use but now some at least actually bill it as a "feature").

AI will just expand that to, well, potentially _every_ game (and _that_ actually seems a sane use case to me, even with hallucinations, because it gives you an edge BUT - unless you've got a gambling problem - the stakes aren't _that_ high, unlike e.g. medical diagnosis, driving etc.).

anonymes
Автор

i've tried Gemini 2 Flash in my native language (french) and the results were HILAROUSLY bad. i asked it, "hey! can you hear me okay?" and it wrote me, i kid you not, *an essay about the meaning of what the phrase "hey! can you hear me okay?"*, instead of just replying. it did that for anything i asked. like i would literally just say "hello!" and instead of saying hello back to me it would offer translations, suggestions, explanations, of... "hello", instead of talking. i've never seen a language model do that before.

TheYoxiz
Автор

6:20 the reference to the mistake on the previous video is hilarious

Likou_
Автор

I was and still am supporting OpenAI yet this last year they have been hit hard with a lot of the key developers leaving. One of the biggest issues I think as well was that 01 was to be ChatGPT 5.0 yet it wasn't what they were hoping for. The only answer they have to fix the current issue is to simply put more compute power and why it has started to cost so much as the compute power should only be needed at the training level.

OZtwo
Автор

I used the studio to help me solve a puzzle live in a videogame. Who even needs game guides anymore :)

Raulikien
Автор

Proud of OAI shipping Gemini Flash 2.0 and all those amazing tools for their shipmas lol

HAL.
Автор

This is definitely big for their data collection program😯🔥

julianzurn