David Silver - Deep Reinforcement Learning from AlphaGo to AlphaStar

preview_player
Показать описание
Recently, self-learning systems have achieved remarkable success in several challenging problems for artificial intelligence, by combining reinforcement learnng with deep neural networks. In this talk, I describe the ideas and algorithms that led to AlphaGo: the first program to defeat a human champion in the game of Go; AlphaZero: which learned, from scratch, to also defeat the world computer champions in chess and shogi; and AlphaStar: the first program to defeat a human champion in the real-time strategy game of StarCraft.

Bio: David Silver is a principal research scientist at DeepMind and a professor at University College London. David's work focuses on artificially intelligent agents based on reinforcement learning. David co-led the project that combined deep learning and reinforcement learning to play Atari games directly from pixels (Nature 2015). He also led the AlphaGo project, culminating in the first program to defeat a top professional player in the full-size game of Go (Nature 2016), and the AlphaZero project, which learned by itself to defeat the world's strongest chess, shogi and Go programs (Nature 2017, Science 2018). Most recently, he co-led the AlphaStar project, which led to the world's first grandmaster level StarCraft player (Nature 2019). His work has been recognised by the Marvin Minsky award, Mensa Foundation Prize, and Royal Academy of Engineering Silver Medal.

*Sponsors*
Man AHL: At Man AHL, we mix machine learning, computer science and engineering with terabytes of data to invest billions of dollars every day.

Рекомендации по теме
Комментарии
Автор

This is so high level. Thanks so much for sharing, David!

haraldgnaf
Автор

What would happen if two machines with the same version of alphastar are made to play against each other? Can we predict if the first one that moves will always win. Assume they are both are use same random weights with the same seed.

ChuckChekuri
Автор

I would like to add, did you even think about putting two programs in the same Arena with or without human players😎🤓 I'm just asking as a human

timjohnson