OpenAI’s new “deep-thinking” o1 model crushes coding benchmarks

Показать описание

Let's take a first look at the new ChatGPT o1 model - a state-of-the-art reasoning AI model from OpenAI that shows unmatched abilities in math, science, and coding.

#programming #ai #thecodereport

💬 Chat with Me on Discord

🔗 Resources

📚 Chapters

🔥 Get More Content - Upgrade to PRO

Use code YT25 for 25% off PRO access

🎨 My Editor Settings

- Atom One Dark
- vscode-icons
- Fira Code Font

🔖 Topics Covered

- What is o1?
- Update on Devin AI programmer
- How does OpenAI o1 work?
- GPT-4o vs o1 benchmarks
- o1 vs Claude
- What is the best LLM in 2024?
- Trends in Artificial Intelligence

Рекомендации по теме

Комментарии

"Or maybe I'm just a horse influencer saying, a car won't take your job, but a horse driving a car will"...deep stuff man

divineaghulor

If my job was coding solutions to problems with rigorously-defined requirements, this would be concerning.

SpontaneouslyDeliberate

I like how Turing test now is how many r's are there in Strawberry.

arunkennedy

PHD student here, the key to beat any LLM is to use a stick

last_fanboy_of_golb

O1 is a hilarious name for a program which has an exponential energy bill

Beknown

Thanks to fireship for almost giving me a heart attack at the beginning and then relieving me at the end lol

Trait

Most of my job as a software engineer is meetings, design, documentation, and watching Fireship. Sitting down to code probably only accounts for 20%. I'm either totally safe or I'm doing it wrong and I'm in imminent danger.

AwesomeDwarves

5:40 *"Ai won't take your job, but another man using Ai will.."*

naeemulhoque

I think it’s pretty amazing they managed to build the equivalent of an all knowing but also friendly and helpful person on stackoverflow considering the lack of real training data.

If only a PhD were about skills like programming and solving equations. Literally every PhD student uses solvers for anything more complex than basic calculus anyways. The challenge of a PhD is learning how to think about things in unique ways and pushing boundaries and exploring new possibilities.

richbaird

I've been seeing people freaking out about this new model, "it's better than PHD humans at X, Y, Z!" where X, Y, Z basically amounts to data processing... like oh my god??? A computer can process data faster than a person???? WHAT???? lmao

bengrzybowski

“It’s basically just like GPT4 with the ability to recursively prompt itself”. Exactly. We are in the parlor tricks phase of this hype cycle.

andrewcampbell

0:19 - it is now 100% proven that English is the hardest subject.

ThisIsNotAUsername-vo

Impressive it can beat PhD students. But remember a PhD in breakdancing is not the same as being a breakdancer.
This one could be called GPT-Raygun.

marc-io

It still can't count how many r's in strawberry.
I think we good for a while...

romangeneral

Is it just me who feels so sad that words are disappearing from the internet ? In this video, the word drug is censored just to please an algorithm. The other day I even saw someone who censored the word hate in «she hates being called wifey» smh

TheGrandChelem

all this energy to just not pay employees properly, it's crazy

waltersumofan

The potential of AI is indeed vast yet it falls short at times. In the end, it's a tool, at least for now.

RILDIGITAL

This is concerning, it took the AI over 10, 000 attempts with access to every relevant example on the internet during a contest to get gold lmao

joshroberts

Fuck it, I’m becoming a plumber.

I’m also tired of these “snake game” examples. It’s just a glorified google at that point. Tons of snake examples on the web.

midicine

OpenAI’s new “deep-thinking” o1 model crushes coding benchmarks

OpenAI’s new “deep-thinking” o1 model crushes coding benchmarks

This New AI Model Is Genius - DESTROYS OpenAI o1 in REASONING

China STUNS OpenAI With NEW Model That BEATS o1 (DeepSeek-R1-Lite)

OpenAI’s new “deep thinking” o1 Model crushes coding benchmarks

OpenAI's new deep thinking Model o1, Best for Developers

Newest OpenAI's 'Deep-thinking' o1 Model - Tested with Real Examples

DeepSeek AI can Reason now! Beating OpenAI o1?

Open AI's o1 Model Uses 'Deep Thinking' To CRUSH Coding, Science, And Math Benchmarks

Microsoft’s AI Agents Take the Lead: 4 Key Insights from Ignite

OpenAI’s o1 Models: The Future of Deep Thinking AI 🤖 #shorts

Open-Source Q-Star! The First OPEN 'Thinking' Model (DeepSeek r1)

Beginners guide: OpenAI’s new “deep-thinking” o1 model with NotebookLM.google #AI #Notebooklm #o1...

Deepseek-r1 vs OpenAI-o1 who is the best reasoning model?

How Can AI THINK? (OpenAI's GPT-o1)

What Is OpenAI Hiding in Their 'Deep-Thinking' AI Model?

Deepseek-R1-Lite (Tested): This OPENSOURCE Model BEATS O1 & CLAUDE 3.5 SONNET!?

The NEW REASONING AI you shouldn't ignore!!

Q STAR 2.0 - new MIT breakthrough AI model IMPROVES ITSELF in REAL TIME (new Strawberry?)

The New AI Model Will Blow Your Mind: OBLITERATES OpenAI o1 in REASONING #ai #chatgpt #technology

Deepseek-R1-Lite: BEST Opensource LLM EVER! Beats Claude 3.5 Sonnet + O1! - (Fully Tested)

Ex-OpenAI CTO Murati’s New Strartup is Revealed!

DeepSeek-R1-Lite: Open Source Reasoning LLMs are HERE!

New Google Model Ranked ‘No. 1 LLM’, But There’s a Problem

AI News: Musk Says AGI 2026, Open-Source Q*, Flux.1 Updates, Quantum AI, and more!