Coding Self Attention in Transformer Neural Networks

preview_player
Показать описание
#shorts #machinelearning #deeplearning
Рекомендации по теме
Комментарии
Автор

This is great! I wish I would have discovered your channel earlier. Thank you

MikeMm-nn
Автор

is the formula scaled = scaled + mask correct?

luisvizcaya
Автор

You need something for your mic. The popping is uncomfortable when listening to you with headphones on

BlayneOliver