filmov
tv
What are Transformer Neural Networks?

Показать описание
This short tutorial covers the basics of the Transformer, a neural network architecture designed for handling sequential data in machine learning.
Timestamps:
0:00 - Intro
1:18 - Motivation for developing the Transformer
2:44 - Input embeddings (start of encoder walk-through)
3:29 - Attention
6:29 - Multi-head attention
7:55 - Positional encodings
9:59 - Add & norm, feedforward, & stacking encoder layers
11:14 - Masked multi-head attention (start of decoder walk-through)
12:35 - Cross-attention
13:38 - Decoder output & prediction probabilities
14:46 - Complexity analysis
16:00 - Transformers as graph neural networks
Original Transformers paper:
Other papers mentioned:
Video style inspired by 3Blue1Brown
Music: Trinkets by Vincent Rubinetti
Links:
If you'd like to help support the channel (completely optional), you can donate a cup of coffee via the following:
Timestamps:
0:00 - Intro
1:18 - Motivation for developing the Transformer
2:44 - Input embeddings (start of encoder walk-through)
3:29 - Attention
6:29 - Multi-head attention
7:55 - Positional encodings
9:59 - Add & norm, feedforward, & stacking encoder layers
11:14 - Masked multi-head attention (start of decoder walk-through)
12:35 - Cross-attention
13:38 - Decoder output & prediction probabilities
14:46 - Complexity analysis
16:00 - Transformers as graph neural networks
Original Transformers paper:
Other papers mentioned:
Video style inspired by 3Blue1Brown
Music: Trinkets by Vincent Rubinetti
Links:
If you'd like to help support the channel (completely optional), you can donate a cup of coffee via the following:
Transformers, explained: Understand the model behind GPT, BERT, and T5
Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!
Illustrated Guide to Transformers Neural Network: A step by step explanation
What are Transformer Neural Networks?
Transformer Neural Networks - EXPLAINED! (Attention is all you need)
But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning
What are Transformers (Machine Learning Model)?
Transformer Neural Networks Derived from Scratch
What are transformers?
The complete guide to Transformer neural Networks!
Attention in transformers, visually explained | Chapter 6, Deep Learning
Transformers for beginners | What are they and how do they work
Attention is all you need (Transformer) - Model explanation (including math), Inference and Training
Transformer models and BERT model: Overview
MIT 6.S191 (2023): Recurrent Neural Networks, Transformers, and Attention
Visual Guide to Transformer Neural Networks - (Episode 2) Multi-Head & Self-Attention
The matrix math behind transformer neural networks, one step at a time!!!
Attention mechanism: Overview
What are Transformer Models and how do they work?
Attention for Neural Networks, Clearly Explained!!!
Positional Encoding in Transformer Neural Networks Explained
transformer neural network simply explained
Visual Guide to Transformer Neural Networks - (Episode 1) Position Embeddings
But what is a neural network? | Chapter 1, Deep learning
Комментарии