ChatGPT O1 Explained

Показать описание

I reverse-engineered OpenAI's O1-Preview model using O1-Preview! I asked it to generate the full research paper with code and I gave it dozens of related research papers from the past few years as context. It recreated a working version of the O1 model to the best of it's ability and In this video, we'll go over all the details of the model, the code, and the research techniques that make the O1 model series state of art across so many benchmarks. LETS SPREAD THIS AI POWER, I can't wait to see what you think, enjoy!

Code & paper for this video:

Deploy your own AI trading bot (no code):

Want more AI/ML education? Connect with me here:

⏱️ **Chapters:**
0:00 - Introduction: Reproducing OpenAI's o1 Model Series
1:30 - Generating a Research Paper Using o1 Preview
2:30 - Overview of 'o1-nano': An Open Source, Explainable Model
3:30 - Understanding Chain-of-Thought Reasoning in o1 Models
4:30 - How Reinforcement Learning is Used in Training and Inference
5:30 - Exploring Reasoning Paths and Subtasks During Inference
6:30 - Unpacking OpenAI's Reasoning Tokens
7:30 - Overview of the Model Architecture
8:30 - Core Components: Transformer, Chain-of-Thought Module, Reasoning Token Generator
9:30 - Training the Model to Reason Better Using Reinforcement Learning
13:30 - Historical Papers Leading to o1: Chain-of-Thought and 'Let's Verify Step by Step'
15:30 - The New Scaling Law: Inference Time Scaling
16:30 - The Usage of of Reinforcement Learning
17:30 - Demo of the Code: Running the Test
18:30 - Conclusion: Open Source Code and Research Paper as a Starting Point
19:00 - Closing Remarks and Encouragement to Explore the GitHub Repository

Don't forget to like, share, and subscribe for more deep dives into AI advancements!

I Built a Sports Betting Bot with ChatGPT:

I Built a Trading Bot with ChatGPT:

Watch ChatGPT Build an AI Startup:

Watch ChatGPT Build a Finance Startup:

Watch Me Build a Startup Playlist:

🔔 Subscribe and hit the notification bell to join the AI revolution!

Рекомендации по теме

Комментарии

Havent seen or kept up with your channel in a long while but glad to see you're still creating awesome and well-explained content!

Rob-Herrera

Give this man the compute he will give you opensource O1

anaskhan-lzhk

It feels good to have you back Siraj Raval. I mean the REAL you. the you that talks about stuff in AI that really matters and dearing to get into the deep deep R&D take on AI. I always consider you as one of the original visionaries and pioneers in teaching and promoting and leading enthusiast and professional forward with inspiration and ideals. Good to have you back hope the stigma and condemnation of the past stays in the past and ur up there like other AI channels like "yannic kilcher (by Yannic Kilcher)" "machine learning street talk (w/ Tim Scarfe and Keith Duggar)" and so many others. Never forget you are one earliest earliest originals.

HD-Grand-Scheme-Unfolds

Ha this is great, I was thinking a few days ago about what would happen if we used o1 to document itself and the paper chain, and you went and did it!

mootytootyfrooty

Awesome man, great to see you back into AI.

alexiades

evaluations and benchmarks missing. The current evaluation is a simple check of the model's performance on arithmetic problems. It generates a batch of arithmetic problems, runs them through the model, and computes an average reward based on whether the model's output matches the expected result. How is it compared to otther o1 models, gpt4o, claude sonnet etc?

ChronicleContent

I’m sorry to be “that negative guy” in the comments, but some of your claims here are overstretched, and the concepts you’re throwing around are at a surface level. You made little reference to the importance of reward models in PPO and did not distinguish between per-step and global evaluation (a critical aspect of creating the tree structure you made reference to). There’s also no evidence that reasoning models require special tokens. Finally, the applicability of your method here is super constrained, whereas other MCTS-based methods with language models manage to generalize to non-math based tasks.

You’ve produced excellent videos in the past, but this one unfortunately falls short

zacharybamberger

i have been waiting for you to make AI tech videos again from so long.
your videos were my introduction to AI domain, Thanks

ShivamPradhan-cx

Interesting conjecture! I've been chasing down a lot of the same papers/have the same pet project, will have to take a closer look at your implementation

yikesawjeez

Few people teach AI as well as Siraj, ❤😊

pankajdesai

Back again!! Keep publishing this type of videos for which you were known for!! 😊

MrMyjanusn

But why? How is this video benefiting anyone ? Ntng learned, ntng interesting. I used to like ur videos, but unfortunately ur content are not exciting anymore

_OKEANOUS_

ChatGPT O1 Explained

ChatGPT O1 Explained

ChatGPT o1 - In-Depth Analysis and Reaction (o1-preview)

OpenAI’s new “deep-thinking” o1 model crushes coding benchmarks

ChatGPT o1 explained

Is This GPT-5? OpenAI o1 Full Breakdown

ALMOST PERFECT! ChatGPT-o1 Tested--And Explained!

OpenAI's New o1 Model Explained

GPT-o1: The Best Model I've Ever Tested 🍓 I Need New Tests!

ChatGPT Plus Review: Enhanced Features and Capabilities | ChatGPT 4o 4o mini | @S3CloudHub

OpenAI's New AI GPT-o1 STUNS The ENTIRE INDUSTRY Surprises Everyone! (STRAWBERRY RELEASED!)

The new ChatGPT o1 preview and 01 mini explained

OpenAI o1 Released!

Chat GPT Explained in 5 Minutes | What Is Chat GPT ? | Introduction To Chat GPT | Simplilearn

Data Analysis with NEW ChatGPT o1 Model (Hands on Lab)

How To Analyse Stocks With o1-Preview (ChatGPT o1 Stock Analysis Tutorial)

OpenAI o1-preview & o1-mini announcement Explained

ChatGPT-O1 Changes Programming as a Profession. I really hated saying that.

ChatGPT o1 Explained - How to Benefit for You and Your Small Business

Is AGI Finally Here? Chat GPT o1 Model Explained! #ai #agi #artificialintelligence #shorts

ChatGPT o1 (Strawberry) - Automated Waterfront Zoning Analysis

Build anything with o1 agents - Here’s how

ChatGPT o1 Models Release Update: Next-Level AI Features Explained

Every ChatGPT model explained! 🤖💡 #chatgpt #openai #4o #samaltman #strawberry #o1

Is AGI Finally Here? Chat GPT o1 Model Explained! #ai #agi #artificialintelligence #shorts