How Google Is Building Jarvis In Real Life

preview_player
Показать описание
Is Google back in the AI game or is this all hype? Google recently unveiled Gemini, a multi-modal AI model that can process text, images, video, and audio seamlessly. This ground-up model with a massive 1 million token context window could enable a real-world Jarvis-style assistant that helps with tutorials, content creation, redecorating, and more. I discuss key highlights from Gemini's capabilities compared to models like GPT-4, the vast data Google is leveraging from Search, Maps, and YouTube, and why multi-modality represents the next frontier in the quest for AGI.

Video Chapters:
00:00 Introduction
00:31 Anything-to-Anything
01:42 Google's Insane Data Moat
02:42 Why Multimodality Matters
04:46 Gemini's Three Variants
06:03 Is Google back in the game?

Connect on socials:

#ai #googlegemini #chatgpt #gpt4 #openai #geminiultra #gemininano #s24ultra
Рекомендации по теме
Комментарии
Автор

Mind boggling-- need a tutorial to learn how to use Gemini. any online recommendations?

kbssidhuex-ias
Автор

Great video and overview. YouTube is definitely a massive trove of data

vp-land
Автор

I don't know. They always seem to find a way to trip over their own shoelaces. Maybe ask this question again if they don't have another Sundar-snap in january... Can't keep up with advancement if they keep prioritizing firing people for that stock price bump.

bryanwoods