[MERL Seminar Series Fall 2024] Tools from cognitive science to understand the behavior of large ...

Показать описание

[MERL Seminar Series Fall 2024] Tools from cognitive science to understand the behavior of large language models

Tom Griffiths, Princeton, presented a talk in the MERL Seminar Series on September 18, 2024.

Abstract:
Large language models have been found to have surprising capabilities, even what have been called “sparks of artificial general intelligence.” However, understanding these models involves some significant challenges: their internal structure is extremely complicated, their training data is often opaque, and getting access to the underlying mechanisms is becoming increasingly difficult. As a consequence, researchers often have to resort to studying these systems based on their behavior. This situation is, of course, one that cognitive scientists are very familiar with — human brains are complicated systems trained on opaque data and typically difficult to study mechanistically. In this talk I will summarize some of the tools of cognitive science that are useful for understanding the behavior of large language models. Specifically, I will talk about how thinking about different levels of analysis (and Bayesian inference) can help us understand some behaviors that don’t seem particularly intelligent, how tasks like similarity judgment can be used to probe internal representations, how axiom violations can reveal interesting mechanisms, and how associations can reveal biases in systems that have been trained to be unbiased.

Mitsubishi Electric Research Labs (MERL)

Рекомендации по теме

[MERL Seminar Series Fall 2024] Tools from cognitive science to understand the behavior of large ...

[MERL Seminar Series Fall 2024] Tools from cognitive science to understand the behavior of large ...

[MERL Seminar Series Spring 2024] Neural Certificates and LLMs in Large-Scale Autonomy Design

[MERL Seminar Series Spring 2024] The Debate Over 'Understanding' in AI's Large Langu...

[MERL Seminar Series Spring 2024] Enhancing the Efficiency and Robustness of Human-Robot Interaction

[MERL Seminar Series Spring 2024] Are Emergent Abilities of Large Language Models a Mirage?

[MERL Seminar Series Fall 2023] Robust and Physics-informed machine learning for low light imaging

[MERL Seminar Series Spring 2024] Decoding Hidden Worlds: Unprecedented Sensing and Connectivity...

[MERL Seminar Series Spring 2024] Computational models of human auditory and language processing

[MERL Seminar Series Spring 2022] Hybrid robotics and implicit learning

[MERL Seminar Series 2021] Reconfigurable Intelligent Surfaces for Wireless Communications

[TRO 2024] Safe Multiagent Motion Planning Under Uncertainty for Drones Using Filtered Reinforcem...

Climate Resiliency Nuts and Bolts

[ICASSP XAI-SA 2024] Why does music source separation benefit from cacophony?

Localizing MERL in Burundi, Colombia, and Malawi (English Audio)

Fall Policy Workshop - 9/26/24

Data Science For MERL Training Course | Interactive Dashboards | AppComs Institute 2024

Decentralized, Safe, Multi-agent Motion Planning for Drones Under Uncertainty via Filtered Reinfo...

Creativity in STEM

Philip Phillips (UIUC) Beyond BCS: An Exact Model for Superconductivity& Mottness@Harvard 02/03/...

XAI-SA 2024 - Invited talk: Gordon Wichern Investigations of memorization of audio generative models

Tuesday Night Rheumatology: Mastering SLE

Positive Invariant Sets for Safe Integrated Vehicle Motion Planning and Control

JSALT 2024 - Final Presentations

Toshiaki Koike-Akino Gives Seminar Talk at IEEE Boston Photonics