GPT-4: MIT Exams w/ 100% score for Mathematics? No MIT!

Показать описание

MIT published a pre-print, on GPT-4 scoring the perfect 100% on MIT Mathematics final exams, MIT Major Mathematics. And an exact 90% score as a plain vanilla GPT-4, without prompt engineering at all. We prove this statement wrong.

An autoregressive transformer architecture is the perfect mathematical reasoning machine, according to this pre-print by MIT, Harvard, Stanford and Boston Univ? A pure vanilla GPT-4 without (!) any prompt engineering receives a 90% MIT exam score? No way MIT!

All rights with the authors of this published arxiv pre-print (not a peer reviewed publication):
Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models
Sarah J. Zhang, Samuel Florin, Ariel N. Lee, Eamon Niknafs, Andrei Marginean, Annie Wang, Keith Tyser, Zad Chin, Yann Hicke, Nikhil Singh, Madeleine Udell, Yoon Kim, Tonio Buonassisi, Armando Solar-Lezama, Iddo Drori

#gpt4
#massachusettsinstituteoftechnology
#harvarduniversity
#stanforduniversity

Discover AI

Рекомендации по теме

Комментарии

I don't understand the statement that "they didn't show the data"? Didn't they use GPT4? That has whataever data was used in its training, right? ANYONE using GPT4 has the same limitation in not being able to show the data that was used in their study.

kevinboles

Awe inspiring! I was hooked and drawn in to the end and I matched your excitement… thank you

Pure_Science_and_Technology

On point 1: Using Plugins (Wolfram), GPT can perform calculations? I assumed the GPT-4 entry was allowed plugins

OM-ynpt

Helloo
Great video and thank you for covering topics like this! I have some advice to ask from you, where can I contact you?

jalalelzein

Was this published as a warning about preprints? Some papers that have passed peer review seem to me to be using questionable performance metrics.

densonsmith

I have a feeling a major backtrack on this is coming

fkxfkx

When I viewed your first video on this subject (when the word Hoax still appeared in the title) it was obvious, given GPT-4's inability to deliver anything but trivial mathematical calculations, that the MIT math tests were purely symbolic manipulations.
Long ago when I encountered professional mathematicians at university (a limited set, to be sure), they were contemptuous of numerical calculations which destroy the precision of a pure mathematical vision. If I wanted to talk about numbers, they would direct me to the engineering side of the campus, in much the same way that a fancy hotel desk would direct the Roto-rooter (sewer cleaner) man to the tradesman's entrance at the back of the building.
It is with that block of salt that I understood the claims of GPT-4 mathematical perfection.

johnbrisbin

I genuinely hope that this is just some sort of misunderstanding.

NeuroScientician

GPT-4: MIT Exams w/ 100% score for Mathematics? No MIT!

GPT-4: MIT Exams w/ 100% score for Mathematics? No MIT!

What is GPT4 and How You Can Use OpenAI GPT 4

How to get Apple Intelligence in the EU for FREE (100% working method)

I literally connected my brain to GPT-4 with JavaScript

AI Learns to Walk (deep reinforcement learning)

'Sparks of AGI' - Bombshell GPT-4 Paper: Fully Read w/ 15 Revelations

GPT 4 is Smarter than You Think: Introducing SmartGPT

Mythbusters Demo GPU versus CPU

MIT Refutes MIT Paper On Chat GPT!

ChatGPT is a perfectly balanced AI with no exploits

TRICKS you can do in SCIENTIFIC CALCULATORS🔥#viral #shorts

I Made a Game Using ChatGPT

ChatGPT vs Google Bard: The ULTIMATE AI Prompt Test

OMG😱😱😱😱😱..!!!! Ransomware Menyerang.....

GPT-4 has been unleashed

How to make 10000$ in 1 hour #trading #shorts #crypto #trading #indicator #bitcoin #btc #makemoney

How Well Can GPT-4 See? And the 5 Upgrades That Are Next

Programming Language Tier List

How To Use Chat GPT by Open AI For Beginners

Chatgpt Trading Strategy Test Using Tradingview | Chatgpt Trading For Beginners

OpenAI Releases GPT Strawberry 🍓 Intelligence Explosion!

Jak rozwiązać każdy problem w Excelu z pomocą AI? - Excel AI

Utilizo Chat GPT Para Hacer Apuestas Deportivas

Solve math using chat gpt