Is OpenAI o1 Actually Better Than ChatGPT-4o? OpenAI's Newest Flagship Model and Its Capabilities

preview_player
Показать описание
OpenAI has unveiled "OpenAI o1-preview," a new series of AI models engineered to enhance complex reasoning by allocating more time to think before responding. These models significantly outperform previous iterations like GPT-4o in challenging tasks across science, coding, and mathematics. The first model in this series is now available in ChatGPT and via API as a preview.

Introducing OpenAI o1-preview

👨‍💻 Ask Me Anything about AI -- Access Exclusive Content ☕

-------------------------------------------------
➤ Follow @webcafeai

-------------------------------------------------

Key Takeaways:

✩ Enhanced Reasoning Abilities: OpenAI o1 models are designed to spend more time deliberating, enabling them to tackle complex problems and reason through intricate tasks more effectively than previous models.
✩ Technical Fields: In testing, the new model scored 83% on an International Mathematics Olympiad qualifier (compared to GPT-4o's 13%) and reached the 89th percentile in Codeforces coding competitions, matching the capabilities of PhD students in physics, chemistry, and biology.

▼ Extra Links of Interest:

automate everything. 👇

🌲 Do You Create Content?

My Setup To Record Content (amazon storefront) 📷

Become an Early Adopter 🍻

Introducing OpenAI o1

I build things for fun 🤠
Рекомендации по теме
Комментарии
Автор

While most cases will be perfectly fine with 4o, the times you'll need o1's prowess will be so important the extra time shouldn't matter. The only problem is, the quota right now of 30 messages for o1-preview (50 for 01-mini), but hopefully this will be temporary.

funmeister
Автор

nice, thanks for the quick comparison! Just found out about o1. I've been using 4o at work for C# coding, so I'll be really curious this week to use o1 all week and see how it does. However, I often give 4o little screenshots of my app so I might have to switch back and forth for those, hmm.. either way, interesting options OpenAI is given us lately. Also, I do use Free Claude for GPT4o completely fails me and Claude often finds me a better answer quicker when 4o just can't do it. Thanks!

CleoCat
Автор

Thanks Corbin. But I did not understand your advice at the end, in terms of when to use each model, I thought it was a little confusing.

scottgreg
Автор

I've been doing some advanced mathematics with an assist with from 4o versus o1 preview. There's a substantial difference. I would compare it to looking at something with a magnifying glass versus a microscope. The answers are way deeper with 01 preview.

memeoshorts
Автор

Corbin, my take in a very quick test today (as a business user not a coder) is the following:

It seems to be a step in the right direction. But isn’t quite the leaps and bounds it was “advertised” to be. As someone who works in marketing. I get the hype. But ultimately they pushed it too early.

Some major flaws (that you didn’t discuss):
- Lack of memory, it has no context at all about what we were working on. It’s like I just hired a new employee day 1.
- Lack of customization, I spent most of my time “retraining” it to understand the context of me project we’d been working on (silly new employees)

Topic you covered:
- Man. The no file upload. That was bigger than I thought. Even just during the “retrain” session I had. It made the process longer to put all of it in the chat box vs uploading my files.

I’m hopeful for what it brings, but it feels like a rushed to market product to get VC funding vs a full release.

hedgeghogcn
Автор

I found o1 better at coding advanced Python scripts, but the cap seems a bit low. They might consider raising the prices to access this model, but I believe the competition will catch up soon. Is it really a smart move to keep such a low cap?

Ytmois
Автор

O1 was gonna be better in most high reasoning cases, but for the rest they would work same
I also did a review of O1
would love your feedback as im just a beginner at this!

AinHab
Автор

Love your work so much Corbin.

Thank you 🫶

Such a rad idea to compare the two models.

I have to admit I use the attachment feature loads as well as screenshots, so that’s a huge bonus for me on traditional ChatGPT 4

That’s said, I love that the new model slows down as it thinks.

Feel blessed to have access to them both I guess 😂❤

nickygood
Автор

Here's the main differences between ChatGPT 4o and o1

OpenAI o1-preview model does not have access to the following advanced tools and features:

Memory
Custom instructions
Data analysis
File uploads
Web browsing
Discovering and using GPTs
Vision
Voice

You will need to switch over to GPT-4o to access these tools.

paulwilliamgoyeneche
Автор

Hey Everyone 🤠
Find the parts that interest you:

0:00 - Introduction to OpenAI's new model
0:38 - Key difference: attachments feature
2:10 - Coding capabilities comparison
4:25 - Quality of outputs: 01 preview vs 40

Recap by Bumpups ✏️

bumpupsapp
Автор

o1 -preview seems waybetter but its limited per wweek.

actionnew
Автор

Why aren't there millions of comments? You are so entertaining 😅

faustprivate
Автор

Its much better at coding, much better than Claude 3.5 as well

Island_Algorithms
Автор

It's a "preview" after all.

ozkmtnaffmkt
Автор

Perez Ruth Moore Richard Davis Shirley

GaryJackson-qw
Автор

Johnson Ronald Rodriguez Thomas Lopez Carol

StevensonAries-zs
Автор

Harris Betty Jackson Donna Moore Deborah

IngersollMaria-zs
Автор

White Donna Martin Linda Anderson Jeffrey

SonmerfieldWendell-eq