Visualize the Transformers Multi-Head Attention in Action

preview_player
Показать описание

Attention Networks are used in modern AI technologies like BERT, GPTx, ChatGPT, etc. as it learns about relationships between different parts of the data that it encounters. The video provides conceptual depictions of what is happening 'under the hood' as abstract concepts in multi-dimensional space are manipulated during training and at inference time.

Python / PyTorch implementation referred to in this video:
Рекомендации по теме