Claude 3.5 is the new KING of AI 👑 Beats GPT4o

preview_player
Показать описание
Claude 3.5 Sonnet is the new benchmark for AI. Here's my full test!

Join My Newsletter for Regular AI Updates 👇🏼

Need AI Consulting? 📈

My Links 🔗

Media/Sponsorship Inquiries ✅

Links:
Рекомендации по теме
Комментарии
Автор

I need harder tests, reply to this comment with your suggestions!

matthew_berman
Автор

I was using Claude 3.5 earlier today to help me with a caching issue on a WordPress site and it showed me WordPress PHP I didn't even know existed and its result was spot on ... and so fast too.

amj
Автор

I coded an entire project with Claude 3.5, and even includes API and queuing. I was able to work with it for about 5 hours before I hit my limit for the night, I also almost maxed out the context window.

RM-xsci
Автор

I made a 3D fps with Claude in python but good to see It can make snake lol, I think you're really gonna have to start upping the complexity of some of your tests.

countofst.germain
Автор

*Having multiple streams of income is a game-changer for stability. Relying solely on a job may not provide enough financial security due to high rates of tax, it is important to explore additional investment opportunities to surpass one"s expectation*

LucindaJohnson-pyhi
Автор

How long does it take 50 people to dig a 10 foot hole .. that depends if they're on a salary or an hourly rate...

karlwest
Автор

Tougher questions should be used for new models

aymandonia
Автор

I used it to build out and entire video streaming platform, from planning with PUML to api with yii2, to web services with AWS, to mobile with flutter and Web app with ReactJS, it was literally did all that in 2-3 weeks. Its insane

DinitoThompson
Автор

I can't wait until the day where "one" is the answer to how many words are in the response to this prompt. Just, "one".

jonberrydotnet
Автор

The hype isn't just hype. It's INCREDIBLE

OriginalRaveParty
Автор

Wow! Thank you for your test videos. So helpful, and fascinating to boot! Great channel!

joyflowmonger
Автор

Thanks for your video, once again you did it great! Very understandable even for non native speakers. I watch each of your videos, continue like this 😎

thibaultwislez
Автор

Would love to see a follow up to this video where you explore advanced data analysis use cases for this model. Thanks for the video, Matt!

ryguy
Автор

Since Claude 3.5 knowledge cut off is Feb 2024, wouldn't it have the answers provided by its training?

crippsuniverse
Автор

I watched all of your videos. Keep up the amazing work!

jabak
Автор

first time i heard of a LLM answering the upside down glass problem.

dafunkyzee
Автор

Finally, the moment I waited for since Claude 1, , , , web access, iOS app, incredible logic & multimodality…

fahadxxdbl
Автор

It seems like reasoning about its own output is an important step for any model that is hoping to get to AGI.

johnbollenbacher
Автор

I wonder if it even makes sense to test AI with a puzzle for which there exists an answer in internet.

The same for new test questions in a popular channel. It will only work once. The next model will have it in their training set.

So it won't be a test for reasoning, it would be a test for copy-pasta.

centurn
Автор

TBH, pleasantly surprised how really good Sonnet is, real ChatGPT competitor

dmistclesgee