filmov
tv
Reading AI's Mind - Mechanistic Interpretability Explained [Anthropic Research]
![preview_player](https://i.ytimg.com/vi/Mhp8vpOksWw/maxresdefault.jpg)
Показать описание
Solving AI Doomerism: Anthropic's Research On AI Mechanistic Interpretability. This is a big first step into understanding what the underlying nodes within an AI model are actually "thinking".
Towards Monosemanticity: Decomposing Language Models With Dictionary Learning
This video is supported by the kind Patrons & YouTube Members:
🙏Andrew Lescelius, alex j, Chris LeDoux, Alex Maurice, Miguilim, Deagan, FiFaŁ, Daddy Wen, Tony Jimenez, Panther Modern, Jake Disco, Demilson Quintao, Shuhong Chen, Hongbo Men, happi nyuu nyaa, Carol Lo, Mose Sakashita, Miguel, Bandera, Gennaro Schiano, gunwoo, Ravid Freedman, Mert Seftali, Mrityunjay, Richárd Nagyfi, Timo Steiner, Henrik G Sundt, projectAnthony, Brigham Hall, Kyle Hudson, Kalila, Jef Come, Jvari Williams, Tien Tien, BIll Mangrum, owned, Janne Kytölä, SO
[Music] massobeats - warmth
[Video Editor] @askejm
Комментарии