filmov
tv
Physics of Language Models: Part 3.1 + 3.2, Knowledge Storage, Extraction and Manipulation

Показать описание
Timecodes
0:00 - Prelude
6:59 - Toy Example and Motivation
12:07 - Definitions
16:07 - Result 1: Mixed Training
21:38 - Result 2: Pretrain and Finetune
23:37 - Result 3: Knowledge Augmentation
28:21 - Result 4: P-Probing
33:29 - Result 5: Q-Probing
36:25 - Result 6: Celebrity can help Minority
41:00 - Result 7: Bidirectional Model + MLM
46:02 - Start of Knowledge Manipulation
46:57 - Result 8: Knowledge Partial/Dual Retrieval
51:47 - Result 9: Knowledge Classification and Comparison
1:04:44 - Result 10: Knowledge Inverse Search (Reversal Curse)
1:15:37 - Conclusion
This is an expanded version of the talk I gave about the following two papers.
(Results 1-7)
(Results 8-10)
0:00 - Prelude
6:59 - Toy Example and Motivation
12:07 - Definitions
16:07 - Result 1: Mixed Training
21:38 - Result 2: Pretrain and Finetune
23:37 - Result 3: Knowledge Augmentation
28:21 - Result 4: P-Probing
33:29 - Result 5: Q-Probing
36:25 - Result 6: Celebrity can help Minority
41:00 - Result 7: Bidirectional Model + MLM
46:02 - Start of Knowledge Manipulation
46:57 - Result 8: Knowledge Partial/Dual Retrieval
51:47 - Result 9: Knowledge Classification and Comparison
1:04:44 - Result 10: Knowledge Inverse Search (Reversal Curse)
1:15:37 - Conclusion
This is an expanded version of the talk I gave about the following two papers.
(Results 1-7)
(Results 8-10)
Physics of Language Models: Part 1, Context-Free Grammar
ICML 2024 Tutorial: Physics of Language Models
Physics of Language Models: Part 3.1 + 3.2, Knowledge Storage, Extraction and Manipulation
Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process
Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems
Physics of Language Models: Part 1
Physics of Language Models | Large Language Models (LLMs)
Physics of Language Models - Extracting Knowledge
You Can Control How Well AI Models Learn By Controlling Physics
Physics of Language Models: Understanding the Fundamentals
Yuanzhi Li | Physics of Language Models: Knowledge Storage, Extraction, and Manipulation
LLM Explained | What is LLM
Transformers (how LLMs work) explained visually | DL5
Centripetal or Centrifugal Force Demo? #physics
On large language models and transformers: perspectives from physics, neuroscience, and theory
Nature Reviews Physics: Science in the age of Large Language Models
2024's Biggest Breakthroughs in Computer Science
The Biggest Physics News of 2024
Deep Dive: Quantizing Large Language Models, part 1
Large language models for problems in Physics
👀 Asking GCSE Students (Hamdi) How Much They Physics They Know - Part 1 #Shorts
To Strawberry and Beyond: Insights from 'Language Model Physics' Paper
Not all language model features are linear | Josh Engels | BITS Physics of Intelligence
Ethan Dyer - “Lessons from scale for large language models and quantitative reasoning”
Комментарии