GPT-2 Teaches GPT-4: Weak-to-Strong Generalization

Показать описание

Patreon Supporters:
- Tsubasa Kato
- Mike Wolf
- Paiva
- Tassilo Neubauer
- MonikerEpsilon
- Alexey Malafeev
- Jack Seroy
- JJ Hepburn
- Max Chiswick
- William Freire
- Edward Huff
- Gunnar Höglund
- Ryan Coppolo
- Cameron Holmes
- Emil Wallner
- Jesse Hoogland
- Jacques Thibodeau
- Vincent Weisser

The Inside View

Рекомендации по теме

Комментарии

some comments I received that didn't make it into the final cut:

- "for imitation saliency: there are some results in appendix E.3 figure 27 that show that if the strong model could actually imitate the weak model the generalization basically goes away"
- "I would emphasize more that the pretraining leakage problem is about the pretraining data leaking implicit supervision from humans, which sort of breaks the analogy where the only supervision the strong models are supposed to have is from the weak models"
- "i think the zero-shot baseline is an important caveat in terms of these techniques actually being useful"
- "I know you sort of mention this near the end but when watching it the first time I thought you were implying that the chess puzzles and RMs also used the confidence aux loss when they aren't"

TheInsideView

I made this video because a subscriber called Christopher emailed me saying the Collin Burns episode was one of his favorites and he wanted a video on the weak-to-strong generalization paper

TheInsideView

Feel like the results in the bar charts are missing a "base strong performance", which would be the results of the strong model before weak supervision. What if the strong model is already quite good at 0-shot on the evaluation tasks, this would help quantify how much is being gained from weak finetuning.

daniellawson

GPT-2 Teaches GPT-4: Weak-to-Strong Generalization

GPT-2 Teaches GPT-4: Weak-to-Strong Generalization

How OpenAI Plans To Control Superhuman Intelligence: Weak-To-Strong Generalization Paper Review

Smaller, Smarter, Stronger: Orca 2 Challenges GPT-4's Dominance

Superalignment: Tackling the Challenge of Supervising Smarter AI Systems | AI News

Phi-2, Mixtral, Superalignment, and Imagen 2 | This Week in the Future E22

Orca 2 🐳 GIANT Breakthrough For AI Logic/Reasoning

ORCA 2: Andrej Karpathy, Synthetic Data, GPT4 - AI Paper Explained

Decision Transformer: Reinforcement Learning via Sequence Modeling (Research Paper Explained)

Teaching and Generative AI: A Design Justice Labs Approach

UML class diagrams

Yejin Choi - Intuitive Reasoning as (Un)supervised Neural Generation @ UCL DARK

[ML News] GPT-4 Rumors | AI Mind Reading | Neuron Interaction Solved | AI Theorem Proving

Top 3 KI-Attacken & NEUE Mysteriöse Modelle | KI-News

NExT: Teaching Large Language Models to Reason about Code Execution

Hella New AI Papers This Week - June 29, 2024

Orca2: Overview

First Author Interview: AI & formal math (Formal Mathematics Statement Curriculum Learning)

Keynote at the CIFAR DLRL Summer School 2020

Why do neural networks generalise in the overparameterised regime?

Do Language Models Have a Critical Period for Language Acquisition? - ArXiv:2407.19325

[CLVision @ CVPR2020] Invited Talk 'Extrapolation via Adaptation' by Chelsea Finn

Giuseppe Mastrandrea - Exploring Generative AI: Foundations, Challenges, and Ethical Considerations

4 Mastering Edge Analytics and Prompt Engineering for Innovation in Teaching and Learning

Yapay BİLİM İNSANI da gördük | Robotik baştan mı tanımlanıyor?