Stanford CS25: V4 I Hyung Won Chung of OpenAI

Показать описание

April 11, 2024
Speaker: Hyung Won Chung, OpenAI

Shaping the Future of AI from the History of Transformer

0:00 Introduction
2:05 Identifying and understanding the dominant driving force behind AI.
15:18 Overview of Transformer architectures: encoder-decoder, encoder-only and decoder-only
23:29 Differences between encoder-decoder and decoder-only, and rationale for encoder-decoder’s additional structures from the perspective of scaling.

About the speaker:
Hyung Won Chung is a research scientist at OpenAI ChatGPT team. He has worked on various aspects of Large Language Models: pre-training, instruction fine-tuning, reinforcement learning with human feedback, reasoning, multilinguality, parallelism strategies, etc. Some of the notable work includes scaling Flan paper (Flan-T5, Flan-PaLM) and T5X, the training framework used to train the PaLM language model. Before OpenAI, he was at Google Brain and before that he received a PhD from MIT.

Рекомендации по теме

Stanford CS25: V4 I Hyung Won Chung of OpenAI

Stanford CS25: V4 I Hyung Won Chung of OpenAI

Stanford CS25: V4 I Jason Wei & Hyung Won Chung of OpenAI

Segredos da IA: Insights da OpenAI na Stanford CS25 #IA #STANFORD #XMACNA #shorts

Low Level Technicals of LLMs: Daniel Han

NN, Music Generation and hearing (revised)