OpenAI's O3 and O3-Mini in 12 Minutes

preview_player
Показать описание

OpenAI Unveils O3 and O3 Mini: Next-Gen Reasoning Models!

In this video, we dive into OpenAI's major announcement of their next-generation reasoning models, O3 and O3 Mini, made during their '12 days of OpenAI' event. 🎉 These models are set for public safety testing today, with a full release expected by the end of January. Notable highlights include O3’s impressive 71.7% accuracy on SuiteBench Verified coding benchmarks and state-of-the-art performance in mathematical and scientific benchmarks. O3 Mini offers cost-effective, customizable reasoning with options for low, medium, and high reasoning efforts. Watch the video for live demos and evaluations, and learn how these advancements could shape the future of coding and software development in 2025. 🚀 If you enjoyed this deep dive, don’t forget to like, comment, share, and subscribe!

00:00 OpenAI Unveils O3 and O3 Mini
00:21 Introduction to O3 and O3 Mini
00:42 O3's Benchmark Performance
02:25 Epic AI's Frontier Math Benchmark
03:06 ARC Prize Foundation Announcement
04:36 O3 Mini: Cost-Efficient Reasoning
05:29 Live Demo of O3 Mini
09:26 O3 Mini's Math and Latency Performance
10:26 API Features and Future Plans
11:28 Conclusion and Call to Action
Рекомендации по теме
Комментарии
Автор

OpenAI was so freaked out by Google that they skipped o2 and went straight to o3 😂😂

jaysonp
Автор

Está buenísimo, habría que probarlo. Felíz Navidad!!🎉🎊🎇🎅😄

nht-sk
Автор

The golden question here is, will O3 have the absolutely essential Web Search function that only GPT-4o has now ? Otherwise it’s semi-useless…

ArianeQube
Автор

Pretty impressive especially on those closed benchmarks. They need to work on their PR though...

stonedizzleful
Автор

Thanks for doing this! Pretty awesome to see Greg in there!

gitmaxd
Автор

We launched this new model. But it's not available 😮

haroldpierre
Автор

O2 wasnt an option as its already a legal name, easy to overlook until you go to actually register something.

Just hope they dont prioritise Crapple with its desktop capabilities, and that it has web search (even better, MCP).

tomgreen
Автор

they presented something which is not really released, showing random charts, I don't care about their chart until the model is available and testable
Just watch how they presented Sora and how it's actually hallucinating. I don't care about words

ukervwc
Автор

Nice to see the numbers … but in realworld I wasn’t impressed by o1 for coding compared to Claude sonnet.

Dennis_Troeger
Автор

Nobestudy intelligence is quite interesting

NoelSebastian-qm
Автор

Hey google annouce anothet model plz .. bury the o3 "agi" 😂

matkeyboard
Автор

AGI… A Google Investment? Open AI has to win this… I am worn out by the hype.

calvingrondahl