But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

preview_player
Показать описание
Breaking down how Large Language Models work

---

Here are a few other relevant resources

Build a GPT from scratch, by Andrej Karpathy

If you want a conceptual understanding of language models from the ground up, @vcubingx just started a short series of videos on the topic:

If you're interested in the herculean task of interpreting what these large networks might actually be doing, the Transformer Circuits posts by Anthropic are great. In particular, it was only after reading one of these that I started thinking of the combination of the value and output matrices as being a combined low-rank map from the embedding space to itself, which, at least in my mind, made things much clearer than other sources.

Site with exercises related to ML programming and GPTs

History of language models by Brit Cruise, @ArtOfTheProblem

An early paper on how directions in embedding spaces have meaning:

---

Timestamps

0:00 - Predict, sample, repeat
3:03 - Inside a transformer
6:36 - Chapter layout
7:20 - The premise of Deep Learning
12:27 - Word embeddings
18:25 - Embeddings beyond words
20:22 - Unembedding
22:22 - Softmax with temperature
26:03 - Up next
Рекомендации по теме
Комментарии
Автор

I graduated from Computer Science in 2017. Back then, the cutting edge of ML were Recurrent Neural Networks, in which I based my thesis. This video (and I'm sure the rest of this series) just allowed me to catch up to years of advancements in so little time.

I cannot describe how important your teaching style is to the world. I've been reading articles, blogs, papers on embeddings and these topics for years now and I never got it quite like I got it today. In less than 30 minutes.

Imagine a world in which every teacher taught like you. We would save millions and millions of man hours every hour.

You truly have something special with this channel and I can only wish more people started imitating you with the same level of quality and care. If only this became the standard. You'd deserve a Noble Prize for propelling the next thoustand Nobel Prizes.

iau
Автор

Grant casually uploading the best video on Transformers on YouTube

DynestiGTI
Автор

The fact that meaning behind tokens is embedded into this 12000 dimensional space, and you get relationships in terms of coordinates and direction, that exists across topics is mind blowing. Like, Japan —> sushi is similar to Germany —> bratwurst is just so darn neat

tempo
Автор

This is heaven for visual learners. Animations are correlated smoothly with the intended learning point ...

lewebusl
Автор

I was trying to understand chatGPT through videos and texts on the Internet. I always said: I wish 3b1b releases a video about it, it's the only way for someone inexperienced to understand, and here it is. Thank you very much for your contributions to youtube!!

billbill
Автор

The return of the legend! This series is continuing, that is the best surprise of YouTube, thanks Grant, you have no idea how much the young population of academia is indebted to you.

Silent_Knife
Автор

2 years ago I started studying transformers, backpropagation and the attention mechanism. Your videos were a corner stone for my understanding of those concepts!
And now, partially thanks to you, I can say: “yeah, relatively smooth to understand”

lucasamadsen
Автор

I wish i had a friend as passionate as this channel is. It's like finding my family I've always wanted to have

Kargalagan
Автор

I don't even know how many times I'm going to rewatch this.

parenchyma
Автор

I have been working on transformers for the past few years and this is the greatest visualization of the underlying computation that I have seen. Your videos never disappoint!!

nicholaitukanov
Автор

Thank you! You're so late 3Blue1Brown, it took me 10 hours of videos + blogs last year to understand what a transformer is! This is the long waited video! I'm sending this to all my friends.

jerryanyu
Автор

You are such an AMAZING teacher. I feel like you've really given thought to the learners perception and are kind enough to take the time and address asides and gotchas while you meticulously build components and piece them together all with a very natural progression that's moving towards "something" (hopefully comprehension). Thank you so much for your time, effort, and the quality of your work.

ogginger
Автор

here's to hoping this is not an April fools

JustinLe
Автор

You *must* turn the linguistic vector math bit into a short. is pure gold.

chase_like_the_bank
Автор

It's absolutely ridiculous how many aspects of this topic finally clicked for me in this intro video already. This was incredibly well explained an I'm so thrilled for the next chapters. Thank you very much, Grant!

tielessin
Автор

Man! You never fail to enlighten, entertain, and inspire us, nor do we get enough of your high-quality, yet very digestible, content! Thank you, Grant!

jaafars.mahdawi
Автор

Its astonishing, amazing that this kind of info and explaination quality is available for free, this is way better than a University would explain it

yashizuko
Автор

Grant shows just how creative you can get with linear algebra. Who would have guessed language (?!) was within its reach?

Mutual_Information
Автор

The genius in what you do is taking complicated concepts and making them easy to digest. That's truly impressive!

mahdimoradkhani
Автор

Blown away by the elegance - both visually and conceptually - in which this extremely complicated topic was taught! I never comment but was moved to express my sincerest gratitude! Thank you for all the time put into these beautiful videos.

alyssachen