What are Transformer Neural Networks?

preview_player
Показать описание
This short tutorial covers the basics of the Transformer, a neural network architecture designed for handling sequential data in machine learning.

Timestamps:
0:00 - Intro
1:18 - Motivation for developing the Transformer
2:44 - Input embeddings (start of encoder walk-through)
3:29 - Attention
6:29 - Multi-head attention
7:55 - Positional encodings
9:59 - Add & norm, feedforward, & stacking encoder layers
11:14 - Masked multi-head attention (start of decoder walk-through)
12:35 - Cross-attention
13:38 - Decoder output & prediction probabilities
14:46 - Complexity analysis
16:00 - Transformers as graph neural networks

Original Transformers paper:

Other papers mentioned:

Video style inspired by 3Blue1Brown

Music: Trinkets by Vincent Rubinetti

Links:

If you'd like to help support the channel (completely optional), you can donate a cup of coffee via the following:
Рекомендации по теме
Комментарии
Автор

Love the 3blue1brown esque style. Keep em coming

CodeEmporium
Автор

I think I speak for everyone when I say.. please keep posting! This is excellent stuff!

Mutual_Information
Автор

Your videos exhibit exceptional quality. Thank you for this outstanding contribution.

jwine
Автор

I put 2 months into studying this. I’m a Cultural Science major and I can finally say that I understood every part of this video.
I’m really proud of myself and very thankful for your excellent didactic style.

PaperTigerLive
Автор

The best help in understanding transformers while reading „Attention is all you need“ I have found.

justusmzb
Автор

i thought i was smart - but then i started getting my head around all this amazing machine learning and then i was humbled. thanks for sharing

anmusic
Автор

This video is a mix of actual explanation about the nature of transformers and long tangents about implementation details that are mixed together so perfectly it ensures that it's impossible follow or even know if the author of this video understands how transformers work.

googleyoutubechannel
Автор

Thanks for the nice content. I am not a beginner, but unfortunately it was hard for me to follow this video. There were many concepts/terms mentioned without a brief explanation, and the pace was rather fast. If you could publish the same video, with additional examples and clarifications would be much appreciated. I understand that one would need to look up some topics and references while watching the video, but in this case it felt like I have to look up things very often. Thanks again for your effort!

konna
Автор

Thank YOU! this is the most precise and straight forward thing about Attention that I ever have. When you said the word compatibility, I finally understood why do I need to take dot product between Queries and Keys.

pradiptahafid
Автор

Thank you for the clear and concise explanation! I understood this more than any other video I've seen yet (although I'm still learning). Looking forward to more videos like this!

theJeet
Автор

Our community is the best!!! 💪💪 Thank you very much for the amazing review!!!

adrewkin
Автор

i am at 1/3rd of the video, stopped to come here express gratitude before getting back to it! 3B1B is a proven style, no harm in using it, later on adding to it.

samirelzein
Автор

These are great! I can see this channel easily becoming as popular as Yannic Kilcher’s. Thank you for all your work, your explanations give a lot of clarity without sacrificing depth!

rmac
Автор

Transformers are incredibly complicated but you tried to simplify it for us any way it does need another look
Thank you very much indeed

AIdevel
Автор

incredible, insightful, delightful, enlightening

bijan
Автор

Exceptionally explained! Please make more content! This stuff is worth paying for ;)

SaheelGodhane_TheTramp
Автор

Dude I was looking for a channel like yours for weeks !

nprm
Автор

I watched a bunch of videos on this and I feel like after this, I actually understand it. Thank you!

jcorey
Автор

Thanks for the vid. This is exactly level of the details and explanation style I needed. Many other explanations are either too vague and miss important details or too hardcore and hard to follow. This is ideal: most of the technical details are here, but still easy to follow.

alexandrsavochkin
Автор

Maybe the best transformer explanation out there

andreasgian
visit shbcf.ru