Counting with OpenAI o1

Показать описание

OpenAI

Рекомендации по теме

Комментарии

Is THIS the reason they named the model Strawberry? Because it can solve the strawberry r problem!

entrepreneerit

I had fun recording this demo with quite a trivial question: “how many r’s in strawberry?”

Why did I ask this to showcase one of the most intelligent AI models? LLMs process text at a subword level. A question that requires understanding the notion of both character and word confuses them. GPT-4o can't get this.

Our new model o1-preview "thinks harder" to avoid mistakes and gets it correct

hyungwonchung

Marvelous progress, we are getting closer to the AI singularity.

DragonFly

As a math major, I am now preparing psychologically to become unemployed

romaingerard

Holy moly this is it we are finally getting a new model

crisrampante

you put us on hold again for the new sound mode.

ugurergun

it fails if you misspell it with an extra 'r'.

Q: how many r's in strawberrry
O1 : There are three letter "r"s in "strawberrry".

Sree-xf

Still waiting for the Voice feature after a year of announcing it

issiewizzie

I see the same reasoning mistake happening in o1, how can I reach out to your team for feedback?

arashterooni

Amazing model as it seems... We'll have to wait for more benchmarks, though.

justkoru

I have a strong feeling this is simple embedded XML tags, nested tags, identifiers, attributes, and a structured tag-based processing instruction, as I've a custom GPT that gets that answer right every time because of the "thinking" XML tags integrations.

I know that sounds extremely simple for something like a new model from openai but I can't help but think about all the researchers that had left.... Just prior to this o1 announcement🤔

MrAbkejoe

I queried GPT-4 and GPT-4. o about ten times, asking each how many 'r's are in the word "strawberry." Nine times, both versions correctly responded that there are three 'r's. However, one time, GPT-4. o incorrectly stated that there are only two 'r's.

Robocafe

No real progress in this specific task. You can ask o1 how many "t"s in strawberry and it will tell you that there is 0 "t"s. It gets "r"s correctly only because it's learning dataset is contaminated with this meme question and answer, but there is no true reasoning happening in this class of tasks.

outeast

I do not get it. So are they using a different tokenization method to do this? Classic tokenizations does not preserve this kind of textual information very well if at all. Are they adding some "metadata" to the vectors to preserve details about the textual representation (that may even be split into subwords?)? 🤔

lorenzoblz

It’s crazy how the errors of generative AI are being quickly resolved.

Kolstee

It would be awesome if chatgpt could be trained to play tabletop games, the RPG - dungeon crawling ones with cards and involved evolving gamestate. I figure this would also be a good way to study how to make the language model navigate a specific domain requiring human interaction. I imagine a hidden prompt tailored to each game would be required, either by hardcoding it or maybe the LM can be trained to generate and update the hidden prompt by itself? my2c

DamianReloaded

I like how they praise every model when it comes out and then they trash it to shit when a new one is made.

Logan-hhme

so they have a longer loading times for it to do better

CameronLestagez

Why is the prompt how many r’s IN are in strawberry? Why is there an extra IN? What is this English?

stonechen

GPT4o can answer this question too, just add "think step by step" in prompt

AerisRG

Counting with OpenAI o1

Counting with OpenAI o1

OpenAI o1 Strawberry X Counting Problem ⚡️

Open Source 'Thinking' Models Are Catching Up To OpenAI o1 Already...

Math problems with GPT-4o

GPT-4o talking to GPT-4o

GPT-o1: The Best Model I've Ever Tested 🍓 I Need New Tests!

Coding using ChatGPT AI broke me

What GPT-4 Can Really Do

YouTube Summarizer by Ollama x LangChain, Bonus: Gemini and OpenAI | Case Done Ep 19

Getting started with Sora

How to count OpenAI Tokens from OpenAI platform?

o1 - What is Going On? Why o1 is a 3rd Paradigm of Model + 10 Things You Might Not Know

OpenAI Just Revealed They ACHIEVED AGI (OpenAI o3 Explained)

STUDENT GETS EXPOSED-ChatGPT! #chatgpt #ai

NEW DeepSeek-V3 is INSANE (FREE): RIP 3.5 Sonnet & O1?

OpenAI o3 and o3-mini—12 Days of OpenAI: Day 12

How to Trick ChatGPT in 15 Seconds - Fooling AI #ai #chatbot #chatgpt #gpt

O3 Just BROKE the AI Ceiling 🤯 (AGI is HERE - This Changes Everything)

OpenAI Unveils o3! AGI ACHIEVED!

BREAKING: OpenAI's new O3 model changes everything

How to count OpenAI GPT Tokens before API Call in NodeJS

Character voices with GPT-4o voice

TECH NEWS! OpenAI's ChatGPT hits 200 MILLION weekly users and counting

OpenAI o3 model: AGI or Cheating?