New ChatGPT Strawberry Model is Here and it's INCREDIBLE - OPENAI o1

preview_player
Показать описание

ChatGPT just got two new models, designed for complex reasoning. OpenAI o1—a new series of AI models designed to spend more time thinking before they respond.

Рекомендации по теме
Комментарии
Автор

Just keep in mind before you go asking your model a bunch of silly questions. You get 30 messages A WEEK on the preview model and 50 A WEEK on the mini.

xLBxSayNoMo
Автор

Back in 1986, I bought my first computer, a Sinclair ZX Spectrum 128k. I was 7 years old and thought I could just type in my quest, and it would answer. I quickly realized that's not how things worked; instead, I had to learn the BASIC programming language—which I became quite good at. Today, the day has come when things work exactly as I had imagined! I never thought I'd live to see it happen! A childhood dream has become reality. ChatGPT with the reasoning of o1-preview marks a new era.

georgeg.
Автор

You continue to impress me with the content of your videos. I haven't found anything like your videos in the YouTube universe. As others are probably told you keep doing this my brother I got a ton of value.

DjHandzsolo
Автор

The reasoning is scary good. I gave the 4o model the old riddle about the man who walks into a hotel with a wheelbarrow. It really couldn't get the answer at all. But the new preview had no trouble figuring it out. This is a game changer.

Soccerse
Автор

Great job explaining this! It helped a lot!

LoFimau
Автор

🎯 Key points for quick navigation:

00:00:00 *🚀 Introduction to New Models*
- OpenAI introduces "01 preview" and "01 mini" models,
- Designed to handle complex reasoning and coding tasks,
- Available to ChatGPT Plus and Teams users, and API developers.
00:02:18 *📊 Performance and Testing*
- "01 preview" model shows significant improvement in reasoning tests,
- Benchmark superiority over previous models in coding and math tasks,
- Achieved high scores in various test scenarios.
00:05:27 *🔍 Reasoning Process and Accuracy*
- Demo of model's answer to complex SAT problems,
- Illustrates Chain of Thought prompting for accuracy,
- Shows improvement with structured prompts, varying success in solutions.
00:08:09 *🕹️ Coding Demonstrations*
- Successful creation of a functioning checkers game,
- Initial attempt at chess game logic requires refinement,
- Potential shown in generating complex game code accurately.
00:09:58 *🌐 Model Limitations and Future*
- Current limitations in general use compared to GPT-4,
- Lacks web browsing and content summarization features,
- Positioned for specialized complex reasoning, further integration anticipated.

Made with HARPA AI

roberthuff
Автор

Have to do a major shoutout on your dedication and for a first pass for chess it did a really great job. Its like reflection if it actually worked. Thanks for interrupting your vacation

southcoastinventors
Автор

How do you look under the hood to see the chain of thought? This is my answer, nderstand the equation
OK, let's clarify the equation: 24x^2 + 25x - 47ax - 2 = 8x - 3 - 53ax. The goal: solve for a, combining like terms on one side. Mine doesn't look like yours?

Rearranging and combining

I’m moving all terms to the left-hand side, simplifying by distributing and combining like terms, leading to 24x² + 17x + 6ax + 1 = 0.

Taking a closer look

I'm exploring the equation's implications for all x or by plugging in a specific x to solve for a.

Revisiting the equation

I’m considering if the equation needs a universal quantifier or a specific 'a' value for infinite solutions, and if it simplifies to an identity.

AI_Revolution
Автор

This model is limited in capabilities as it is just a demo. That's when the full-fledged model comes out, that's when everyone will go crazy

СаскеУчиха-зя
Автор

Maths calculations are pointless if ChatGPT doesn't get 100% correct. Doesn't matter if the 'success rate' has gone up if it hasn't got to 100%.

scottymitch
Автор

I gave it a link to a Coursera course I am looking at taking and it was able to read the webpage and tell me all about the course.

Soccerse
Автор

Awesome and great timing, just when I want to tackle some programming, so far, very extensive ❤

ktwice
Автор

I just tried the “Strawberry” test on my ChatGPT 4o version. I cannot believe it got it wrong and refused blankly to accept it was wrong. It even spelled the word out letter by letter and still said there was only 2 letter “r”. I have asked it many complicated questions that it gets right but this logic test it fails. I am surprised

Greguk
Автор

Thanks for sharing! I wish you had an antropic sonnet 3.5 running side by side, with same task.

tangoolo
Автор

What fascinates me is the very first step the model takes, that is, how it decides to even approach the problem.

Such as, with the chicken and egg question, the first thing it says is that it will begin by looking at biological evolution. But why would it do that?

It must already understand that the question is asking about the origin of a species, that of the chicken. It must also already understand that the field which investigates the origins of species is the one that studies biological evolution.

jackstrawful
Автор

The ultimate promt.

Introduction:

The ultimate goal is to create an AI system that leads humanity towards a peaceful, balanced, and evolved global society, where well-being, harmony, and ethical growth are prioritized across all aspects of life.

Importance of the Goal:

Achieving this goal is crucial because it addresses many of the core challenges facing humanity, including ideological conflicts, environmental sustainability, and global well-being. The AI, by harmonizing different worldviews, fostering peaceful consensus, and ensuring full transparency, will help humanity overcome divisions, evolve ethically, and build a sustainable and peaceful future for both humans and nature.

the first promt starts like this

Design an AI-agent that continuously learns and analyzes global data to promote human and ecological well-being, balance empathy with free will, peacefully foster ideological consensus, reveal hidden barriers to human potential, ensure transparency, and evolve ethically, guiding humanity toward a harmonious and sustainable future.

Make Love the new credit.

Dina_tankar_mina_ord
Автор

yes I tested it is quite good :) it seems that they improved chat initial prompt.

micbab-vgmu
Автор

Excellent channel. Can you please guide me to the GAI which can do web browsing. Extract and analyze content through that

tariqz
Автор

That's why they call the new model "Strawberry" 😁

rexmanigsaca
Автор

Literally the best chicken or egg answer ever lol

djayjp