First Look At GPT-4 With Vision

Показать описание

Making this video was quite a rollercoaster! From Dall-e 3 not yet been releaed, to confirmed multi-modal GPT-4 release, I cannot believe I have hijacked such a funny timing.

Special thanks to bruhmoment for providing me the Bard results, and Raphael for BeMyEyes access

This video is supported by the kind Patrons & YouTube Members:
🙏Andrew Lescelius, alex j, Chris LeDoux, Alex Maurice, Miguilim, Deagan, FiFaŁ, Daddy Wen, Tony Jimenez, Panther Modern, Jake Disco, Demilson Quintao, Shuhong Chen, Hongbo Men, happi nyuu nyaa, Carol Lo, Mose Sakashita, Miguel, Bandera, Gennaro Schiano, gunwoo, Ravid Freedman, Mert Seftali, Mrityunjay, Richárd Nagyfi, Timo Steiner, Henrik G Sundt, projectAnthony, Brigham Hall, Kyle Hudson, Kalila, Jef Come, Jvari Williams, Tien Tien, BIll Mangrum, owned, Janne Kytölä, SO

Рекомендации по теме

Комментарии

Making this video was quite a rollercoaster! From Dall-e 3 not yet been releaed, to confirmed multi-modal GPT-4 release, I cannot believe I have hijacked such a funny timing.

bycloudAI

Just wanted to say, you're like the only AI 'tuber I've seen who isn't full of "THIS IS SO HYPE" and scammy vibes, or overly simplified tutorials. Awesome stuff man, good editing as well.

Clybius

Future Image captioning for datasets is going to be absolutely insane!

nilaier

Finally a life changing innovation that comes from using AI

L_QTx

I wonder if it can help out with electrical circuits

itsbalanse

I think OpenAI started rolling out the image feature already on it's own platform for plus users

Taireyn

when i see how accurate it can describe random peoples rooms, i cant help but thinking:
with this we finally solved the problem of how to automatically transform our vacuum robot enabled mass surveillance data into an easily searchable format 😅

LostMekka

if this could be fitted into specs it will become Jarvis level technology, we all could become Iron man

ChandravijayAgrawal

OpenAi just tweeted about vision coming to chatgpt

amallukose

I've just discovered this channel today after searching for a good AI news coverage channel. Great content overall.

My suggestion would be to slow down a bit and maybe provide more in-depth as well as simple explanations for some of the concepts. You go through a lot of details quickly and it's kind of hard to follow at times(maybe not this video specifically, but previous ones definitely suffer from information overload), more background information and context would be helpful for viewers who are new to the topic. Other than that, keep up the good work. Looking forward to more.

acousticdoodling

I've been researching the multimodal LLM's field for a while, and I have an idea why opensource models perform poorly compared to GPT-4. Most of the models are based on augmenting LLM's with vision transformers, such as CLIP (EVA) or pure VIT and they are very simple models that can operate only with 336x336 images at max. So i think that they aren't able to distinguish text and labels because the letters are compressed to just a blob of pixels that even human cannot recognize

why_we_still_here-wq

AI oops. At 3:25 the "assistant" wrongly says, "When about to land, pull the brake on right." But the brake is on the left under the pilot's left hand. Specifically this is the speed brake, which at constant airspeed controls the angle of descent. (Also, while rolling out pulling fully against the backstop at varying pressure applies the wheel brake to that amount.)

lonlipscomb

You're a legend man, keep on uploading

TopCuby

on one hand, it is super impressive how much can be done within the current paradigm and with what level of precision, but on the other - don't you also feel like the promises of AGI and something that transcends 'use huge datasets to train transformer models to imitate said datasets and then further finetune and modify them to make them perform specific tasks that fall within the logic of those datasets' seem just as far off as they did 8 months ago? or do you think that the exponential curve is real after all?

YUTPIA

What about audio? Have any of the LLM been pointed towards automatically translating speech-recording to other languages?

bennguyen

Wow. if only this API was released to the public...

le

I'd like gpt 4 to be prompted to create a randomised infinite sequence of visual prompts that are fed into dall-e 3 so that there is a constant output of random images in high resolution.

TheAkdzyn

That cool and but can it tag correctly those NH and danbooru works compared to some of those lazy posters :v ?

sharpcircle

"Serval boxes of computer parts sitting on a table" seems pretty satisfying for me.
I'm pretty tech oriented and I still had to squint to know what half of those boxes were all about lol :v
Their quite niche items so I don't blame an AI if he's at least able to at minimum figure out what is represented in general.

sharpcircle

imagine being one of the patreons shouted out at the end of the video...

tatacraft

First Look At GPT-4 With Vision

First Look At GPT-4 With Vision

GPT-4 First Impression - A New Era Begins?

GPT-4.5 shocks the world with its lack of intelligence...

How To Use Chat GPT 4 - First Look At Updates

GPT-4 is Here! A First Look and Summary of the OpenAI Developer Demo | ChatGPT Version 4

GPT-4o talking to GPT-4o

OpenAI's Cancels GPT-4.5!? First Look At GPT-4.1 with 1M Context

GPT-4 First Impressions: A Major Improvement over ChatGPT - AI Adventures

How I made a REAL Full Stack Chat App in 2hr with Cursor

GPT-4 vs GPT-3.5 - First Impression! (Unraveling Minified React Code)

This is why OpenAI's charging 30x more for GPT-4.5 (First Look!)

4 ChatGPT hacks that will save you a ton of time!

ChatGPT Voices can now BREATHE! Realistic AI Voices on phone #ai #ailearning #openai #chatgpt

Chat gpt Vs Meta Ai Image generator || which is the Best #chatgpt #meta #best #image

Google's Gemini just made GPT-4 look like a baby’s toy?

MetaGPT: The Next-Gen AI Chatbot OUTSHINES GPT-4! 🤩 (FIRST LOOK)

First Look of Chat GPT 4 Advanced Tips and Tricks to Maximize Your Chatting Experience!

Hands-on with GPT-4 - Impressive!

The ChatGPT app for iPhone is finally here!

AutoGPT Revealed: First Glimpse at AGI Shocks the Industry! (Autonomous GPT-4)

How to hack ChatGPT: The ‘Grandma Hack’

How to Trick ChatGPT in 15 Seconds - Fooling AI #ai #chatbot #chatgpt #gpt

CHAT GPT 4 : FIRST LOOK!! #chatgpt #openai #chatgpt4

Getting Started with GPT-4o: First Impressions and Tips