OpenAI O1 models probably trained gpt-4o and turbo in chain of thought

Показать описание

did openai train gpt-4o with strawberry using reinforcement learning on Chain of Thought.? openai's new orion o1-preview models has made a step change in logic and reasoning from older models. however many are claiming it's easily replicated just by using chain of thought, but for this to work the models have to be good at chain of thought in the first place. in this video, chris looks under the hood at the generated chain of the thought for the orion o1 models, and compares it with gpt-4o, claude 3.5 sonnet, and llama 3's cot. at the end of this video you'll have a better idea of how this works. he does this using games such as sudoku and tic-tac-toe.

Рекомендации по теме

Комментарии

This is actually a really interesting vid, subscribed!

spoony

Nicely done. Very clearly described. Thank you.

calebweintraub

Bro.. gpt4 was already trained on CoT.. here on o1 U looking at a more complex prompt strategy with multiple recalls or smth

TheTruthOfAI

why would we expect a consistent answer to the time it takes for a human endeavour which would have a variety of human actions which would have a range of expected durations? There isn't a "correct" answer. It'd be more like a normal distribution with a guess being a tice doss ... sorry a dice toss for where it lands in that normal distribution. I would be suspicious if the answer was always the same to a question with a lot of imprecision.
Ask me three times how long it'd take me to go into town and shop for half a dozen items .. you'll get three different answers.

mijmijrm

OpenAI O1 models probably trained gpt-4o and turbo in chain of thought

OpenAI O1 models probably trained gpt-4o and turbo in chain of thought

Watch OpenAI o1 and Anthropic's Claude go head to head in coding and reasoning

François Chollet on OpenAI o-models and ARC

Training Script & Data to update LLM to o1 Reasoning (Sky-T1 UC Berkeley)

rStar-Math by Microsoft: Can SLMs Beat OpenAI o1 in Math?

Reasoning Models and Chinese Models

Mother of Likely Murdered OpenAI Whistleblower Reveals All, Calls for Investigation of Sam Altman

Which jobs will AI replace first? #openai #samaltman #ai

Large Reasoning Models

Researchers STUNNED As A.I Improves ITSELF Towards Superintelligence (BEATS o1)

Goodbye ChatGPT o1... Ultimate Claude 3 Guide 2025 (How to use Claude AI for beginners)

Openai o3 and o3-mini

Happened Again! OpenAI Just CHANGED the DEFINITION OF AGI

How to get OpenAI & Claude API for FREE | Unlimited Usage | o1, GPT-4o, Claude 3.5 Sonnet & ...

OpenAI o1 Explains 1000 Years of Indian History 🤖 | Testing OpenAI o1 Model on History and Facts!

Testing the limits of ChatGPT and discovering a dark side

Elon Musk on Sam Altman and ChatGPT: I am the reason OpenAI exists

Let's build GPT: from scratch, in code, spelled out.

How AI Took Over The World

ChatGPT's New Task Scheduling Feature | Baby Step to the Agentic Era?

AI NEWS: ChatGPT New Feature 'TASKS' | White House New AI Mandate | AutoGen v0.4 and more!

BREAKING: OpenAI Releases New Model o3 & o3 Mini SHOCKING Everyone!

How ChatGPT Works Technically | ChatGPT Architecture

Using ChatGPT with YOUR OWN Data. This is magical. (LangChain OpenAI API)