Transformer Decoder Architecture | Deep Learning | CampusX

Показать описание

The Decoder in a transformer architecture generates output sequences by attending to both the previous tokens (via masked self-attention) and the encoder’s output (via cross-attention). Each decoder layer consists of multi-head self-attention, cross-attention, and feed-forward layers. This structure allows the model to generate coherent sequences by considering both past outputs and relevant input context, making it effective for tasks like text generation and translation.

============================
Did you like my teaching style?
============================

📱 Grow with us:

⌚Time Stamps⌚

00:00 - Plan of Attack
02:22 - Simplified View
10:10 - Deep Dive into Architecture

Рекомендации по теме

Комментарии

Hello sir,
Because of your great teaching I have learned lot of core things in AI/ML and cracked my 1st job as AI/ML engineer.
Thankyou.

shreedharjagatap

bhai gaye sir explained really you so much sir gaye aap

NileshPatil-plfj

Timesteps:
0:00 - plan of attack
1:04 - 2 things to note (prerequisites, architecture on training perspective)
2:21 - Encoder Simplified View
7:43 - Decoder Simplified view
10:08 - Decoder Deep Dive Architecture(Working flow)
19:10 - Masked Multi head attention block
23:58 - cross and Multi-head attention block
29:31 - Feed forward block
37:58 - output block

Shisuiii

woke up with this notification. Guess I'll be late for the office today 😅

samarth

It's 01:33 AM, and I was just about to sleep... but then I saw this thumbnail. Guess sleep can wait! 🔥

saurabhbadole

when a god Decided to go on Earth and teach some machine learning or ai to humans, that was the same day when Nitish Singh was born 💯

soyam

Please make one tutorial on Physics informed neural network. I just love the way you teach and explain. Thanks

Joya-pwxq

Amazing, was waiting for your video please complete the playlist ASAP

Syed_Nazmus_Sakib

Thanks a lot for uploading the video, I was revising the transformer architecture and you uploaded this video. Thanks for that. And, please complete the playlist ASAP❤

anonymousman

Amazing explanation it stunned me in and out. Great one in layman language

sumanthpichika

Wow the next video came so fast this time! We are following! Keep up the good work!

meherunfarzana

Nitish, your teaching is amazing. Pls do one end-to-end project on NLP using a transformer.

anuprauthan

i feel m pda dunga kisiko ye ase smjhaya sir apne Love you

sagarbhagwani

Wow... A day when both our videos went live :-) Main dhanya ho gaya...

KumR

Sir theory and conceptual part complete hone per please code krne ka part bhi please upload krna.

raviparihar

It's really so nice and explanation video

SumitStar

Sir please upload part 2 of DSA, btw your DSA lecture was truly helpful and amazing 😅.

Mohd_Monis

Thank you Sir, I was waiting for this

alokshinde

sir your explanation is too excellent and can u please share your onenote notebook

RithwikVasa

When will the new video be uploaded. I started from top and reached here in 6 days. Now am too excited for the next one. Please hurry but with everything you can give

swarajrohan

Transformer Decoder Architecture | Deep Learning | CampusX

Decoder-Only Transformers, ChatGPTs specific Transformer, Clearly Explained!!!

Illustrated Guide to Transformers Neural Network: A step by step explanation

Transformer models: Encoder-Decoders

Transformer Decoder Architecture | Deep Learning | CampusX

Encoder-decoder architecture: Overview

Which transformer architecture is best? Encoder-only vs Encoder-decoder vs Decoder-only models

Transformer models: Decoders

Blowing up Transformer Decoder architecture

What are Transformers (Machine Learning Model)?

Transformers, explained: Understand the model behind GPT, BERT, and T5

Attention in transformers, visually explained | DL6

Transformer Neural Networks - EXPLAINED! (Attention is all you need)

Decoder architecture in 60 seconds

Attention mechanism: Overview

Transformers for beginners | What are they and how do they work

Attention is all you need (Transformer) - Model explanation (including math), Inference and Training

Why masked Self Attention in the Decoder but not the Encoder in Transformer Neural Network?

Building an Encoder-Decoder Transformer from Scratch!: PyTorch Deep Learning Tutorial

Transformer - Part 8 - Decoder (3): Encoder-decoder self-attention

Sequence-to-Sequence (seq2seq) Encoder-Decoder Neural Networks, Clearly Explained!!!

Encoder-Decoder Architecture: Overview

Encoder Decoder | Sequence-to-Sequence Architecture | Deep Learning | CampusX

Decoder training with transformers

Transformer models and BERT model: Overview