OpenAI’s new “deep-thinking” o1 model crushes coding benchmarks

preview_player
Показать описание
Let's take a first look at the new ChatGPT o1 model - a state-of-the-art reasoning AI model from OpenAI that shows unmatched abilities in math, science, and coding.

#programming #ai #thecodereport

💬 Chat with Me on Discord

🔗 Resources

📚 Chapters

🔥 Get More Content - Upgrade to PRO

Use code YT25 for 25% off PRO access

🎨 My Editor Settings

- Atom One Dark
- vscode-icons
- Fira Code Font

🔖 Topics Covered

- What is o1?
- Update on Devin AI programmer
- How does OpenAI o1 work?
- GPT-4o vs o1 benchmarks
- o1 vs Claude
- What is the best LLM in 2024?
- Trends in Artificial Intelligence
Рекомендации по теме
Комментарии
Автор

"Or maybe I'm just a horse influencer saying, a car won't take your job, but a horse driving a car will"...deep stuff man

divineaghulor
Автор

If my job was coding solutions to problems with rigorously-defined requirements, this would be concerning.

SpontaneouslyDeliberate
Автор

I like how Turing test now is how many r's are there in Strawberry.

arunkennedy
Автор

PHD student here, the key to beat any LLM is to use a stick

last_fanboy_of_golb
Автор

O1 is a hilarious name for a program which has an exponential energy bill

Beknown
Автор

Thanks to fireship for almost giving me a heart attack at the beginning and then relieving me at the end lol

Trait
Автор

Most of my job as a software engineer is meetings, design, documentation, and watching Fireship. Sitting down to code probably only accounts for 20%. I'm either totally safe or I'm doing it wrong and I'm in imminent danger.

AwesomeDwarves
Автор

5:40 *"Ai won't take your job, but another man using Ai will.."*

naeemulhoque
Автор

I think it’s pretty amazing they managed to build the equivalent of an all knowing but also friendly and helpful person on stackoverflow considering the lack of real training data.

Автор

If only a PhD were about skills like programming and solving equations. Literally every PhD student uses solvers for anything more complex than basic calculus anyways. The challenge of a PhD is learning how to think about things in unique ways and pushing boundaries and exploring new possibilities.

richbaird
Автор

I've been seeing people freaking out about this new model, "it's better than PHD humans at X, Y, Z!" where X, Y, Z basically amounts to data processing... like oh my god??? A computer can process data faster than a person???? WHAT???? lmao

bengrzybowski
Автор

“It’s basically just like GPT4 with the ability to recursively prompt itself”. Exactly. We are in the parlor tricks phase of this hype cycle.

andrewcampbell
Автор

0:19 - it is now 100% proven that English is the hardest subject.

ThisIsNotAUsername-vo
Автор

Impressive it can beat PhD students. But remember a PhD in breakdancing is not the same as being a breakdancer.
This one could be called GPT-Raygun.

marc-io
Автор

It still can't count how many r's in strawberry.
I think we good for a while...

romangeneral
Автор

Is it just me who feels so sad that words are disappearing from the internet ? In this video, the word drug is censored just to please an algorithm. The other day I even saw someone who censored the word hate in «she hates being called wifey» smh

TheGrandChelem
Автор

all this energy to just not pay employees properly, it's crazy

waltersumofan
Автор

The potential of AI is indeed vast yet it falls short at times. In the end, it's a tool, at least for now.

RILDIGITAL
Автор

This is concerning, it took the AI over 10, 000 attempts with access to every relevant example on the internet during a contest to get gold lmao

joshroberts
Автор

Fuck it, I’m becoming a plumber.

I’m also tired of these “snake game” examples. It’s just a glorified google at that point. Tons of snake examples on the web.

midicine