Gemini-1.5 Pro Experiment (0801): NEW Updates to Gemini BEATS Claude & GPT-4O (Fully Tested)

Показать описание

In this video, I'll be talking about the new Gemini-1.5 Pro Experiment (0801). This is a very new update by Google to the Gemini 1.5 Pro. This new update makes the Gemini model beat Claude-3.5-Sonnet and GPT-4O. The model is now fully on-par with SOTA models and is ranking above every other model in LMSys Arena. This model can be used on Google AI Studio for Fully FREE. It is even better in Coding Tasks and is also really good at doing Text-To-Application, Text-To-Frontend and other things as well. I'll be testing it to find out if it can really beat other LLMs and i'll also be telling you that how you can use it.

-------
Key Takeaways:

📈 Google's New Gemini 1.5 Pro Experiment Model: Discover the latest AI model from Google, surpassing Claude 3.5 Sonnet and GPT-4O in the LM SYS Arena.

🔍 Free Access to Google AI Studio: Test out Gemini 1.5 Pro Experiment Model for free and experience cutting-edge AI technology firsthand.

🤔 No Benchmarks Yet: Explore the capabilities of this experimental model yourself, as official benchmarks haven't been released.

🔜 Possible Gemini-2 Preview: This might be an early look at the Gemini-2 model, rumored to have the same 2 million output tokens as previous versions.

✅ Impressive Performance: From math problems to coding tasks, Gemini 1.5 Pro excels in various challenges, proving its robust AI capabilities.

🎨 Multimodal Abilities: Beyond text, this AI can handle images, video, and more, making it a versatile tool for content creators and developers.

💡 AI Innovation from Google: Stay ahead of the curve with one of the best models from Google, potentially surpassing competitors like Claude 3.5 Sonnet.

---------
Timestamps:

00:00 - Introduction
00:55 - About Gemini 1.5 Pro Experiment 0801
01:54 - Testing (Textual)
06:37 - Textual Question Final Results
07:02 - Multimodal Testing
08:14 - Final Conclusion
08:53 - Ending

Рекомендации по теме

Комментарии

It answers like a politician. Google upsets me with its insane policy guard. I swear Nuns built and trained it.

hope

Just tested it. I like it. I did a kids story. Lots of humour and whimsical. Absolutely brilliant. Ten times better then any other LLM and beats ChatGPT by miles.

Dystopia

Great, can you make video with Aider and this?

bestcinemaonline

Make a video making a full stack app using next.js and suparbase with aider and this❤

search-bd

use Aider with it bro please love your content wish I can support more!

hamzaIVX

I'm interested if you designed question 7 and 9 like that on purpose because they throw me off too. I know what the long and short diagonals are because I've studied them. And so for me and you, they're defined and we know what they mean. This is their name. But for someone who hasn't studied them, even if they are better at maths than us, they will struggle greatly with this question. For them, this question is more complicated. They would think along these lines: a long diagonal being from one corner to another corner which would result in the line having the greatest length possible. And then of creating a diagonal within the hexagon which would result in the shortest length. Interesting points here are the fact that the two lines don't have to be from the same starting place but also can be. Another point is that they don't have to be at a vertex/corner of the shape (if you know about short diagonal then yes they do, but if you don't know what it refers to then no, they don't). This question will take anyone a great deal of thinking and time unless they're just repeating it from before. Earlier when i said the greatest length to be the long diagonal, we could say that it doesn't have to be the longest diagonal possible, just the longer one out of the two lengths. Is it the longest diagonal of the shape or the longest diagonal out of the two. Depends on what the long diagonal is defined by (for those who haven't studied it and know what it refers to). Therefore, it's undefined and so either one is good. But it's just another point that rises up. It's a good test question though to see what the model says. Typically, the models i have tested so far are not great at reviewing or performing multiple-stage operations. Such as if a program has an issue, the models will try to apply what they know to fix the error so that there is no error. Rather than taking a second look at everything else by realising that this error might suggest a mistake elsewhere which would result in the program not achieving its objective, even if this error was to be resolved. They will often get stuck on a problem and continue to suggest the same solution or fail to provide the correct solution in those scenarios. Interesting questions, if someone asked me those two questions in person, I would walk away. Currently the models have a great advantage of speed and memory and lack intelligence. So they need a lot of guidance and prompting. Then there are the token limits that get in the way. Gemini's context window makes it really interesting. For programming for instance.

ZeerakImran

honestly doesn’t beat 3.5 sonnet with my tests

juliovac

Would love to see an aider video with this. The insane context should be really good for that.

fuba

Please please please make a video aider with this

Unifactyt

Of course we want an aider video with it

ganian

seeing it working with Aider will be a great video to watch. Big fan of your nice channel. Thank you ❤

mr.arshed

Since your benchmark is uploaded to YouTube and Google has the transcripts, I wonder if that means that this new version is trained... on your benchmark?

supercurioTube

More video with aider + Gemini-1.5 Pro Experiment (0801) Please🙏

marma

No, not No equal, no better than Claude 3.5 sonnet

ilyass-alami

Alway show me the news in A.I. Good Job

TsillALevi

I am tired of testing Gemini and seeing it fail every single time. Maybe after a few years I will test it.

haydar_kir

I don’t see a join button on your channel

stonedoubt

Has any of the tested AI succeeded on svg generation test?

alissonprimo

Let's ask it if the us government uses this kind of AI and better since 2012, and if you are an AI as well because googles alphaproof proofed that's good in math by achieving silver last week 😅

RealLexable

Gemini-1.5 Pro Experiment (0801): NEW Updates to Gemini BEATS Claude & GPT-4O (Fully Tested)

Gemini-1.5 Pro Experiment (0801): NEW Updates to Gemini BEATS Claude & GPT-4O (Fully Tested)

Gemini 1.5 Pro Experimental - New Features with Major Upgrade

Gemini 1.5 Pro Tested - The WORST Frontier Model Yet

¡GOOGLE ADELANTA a TODOS! - Nuevo Gemini Pro 1.5 Experimental

Gemini 1.5 Pro Beats GPT 4 and Claude 3.5. Did it Pass the Coding Test?

Gemini 1.5 Pro (Experimental 0801) - Google's Model Outperforms Everyone

Meet Gemini 1.5 Pro: Google’s Most Powerful AI Yet

¡NUEVO Gemini 1.5 Pro de Google Experimental! ¿El Mejor Modelo de IA Hasta Ahora?

Google ПОБЕДИЛ? Gemini 1,5 Pro обгоняет GPT4-o

Did GEMINI Flash Just Killed RAG with new PDF update?

¡Google SORPRENDE con la IA del MILLÓN DE TOKENS! (Gemini 1.5)

¡BOMBAZO! Gemini 1.5 Pro DESTRONA a GPT-4 en Chatbot Arena ¡Mira cómo!

AI News : Strawberry / Q* Hype, Llama-4, Gemini Price Cut, New LLMs, Mistral Agents, FLUX, Qwen2

¡Google ha filtrado el MEJOR MODELO de IA! 🤯 Lo pongo a prueba

Mistral Agents, Huge GPT-4o Updates & More AI Use Cases

LG CM9960 VS LG CM9530 | VOLUMEN MEDIO

AI SEO News: This 'AI Triple Threat' Changes EVERYTHING…

Massive AI News: Google TAKES THE LEAD! LLama 4 Details Revealed , Humanoid Robots Get Better

UPS Installation Video - Secure Power SP200 1000VA, 2000VA & 3000VA

AI News: The Busiest Week in AI in A Looong Time!

Noticias IA: ¡Google TOMA LA DELANTERA! Se Revelan Detalles de Llama 4, y Mejores Robots Humanoides

HH Electronics TNi Installation Range Introduction Video

Ep #108: OpenAI Voice, AI Cloud Wars, Microsoft’s AI Revenue, Meta Earnings and AI, & SB-1047

AI News: ‘Friend’ Is Surveillance AI Pretending To Help People