filmov
tv
The moment we stopped understanding AI [AlexNet]

Показать описание
Activation Atlas Posters!
Special thanks to the Patrons:
Juan Benet, Ross Hanson, Yan Babitski, AJ Englehardt, Alvin Khaled, Eduardo Barraza, Hitoshi Yamauchi, Jaewon Jung, Mrgoodlight, Shinichi Hayashi, Sid Sarasvati, Dominic Beaumont, Shannon Prater, Ubiquity Ventures, Matias Forti
Welch Labs
References
AlexNet Paper
Carter, et al., "Activation Atlas", Distill, 2019.
`Olah, et al., "Feature Visualization", Distill, 2017.`
Templeton, et al., "Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet", Transformer Circuits Thread, 2024.
“Deep Visualization Toolbox" by Jason Yosinski video inspired many visuals:
Great LLM/GPT Intro paper
3B1Bs GPT Videos are excellent, as always:
Andrej Kerpathy's walkthrough is amazing:
Goodfellow’s Deep Learning Book
GPT-3 size, etc: Language Models are Few-Shot Learners, Brown et al, 2020.
GPT-4 training size etc, speculative:
Historical Neural Network Videos
Errata
1:40 should be: "word fragment is appended to the end of the original input". Thanks for Chris A for finding this one.
Комментарии