Deepmind AlphaZero - Mastering Games Without Human Knowledge

Показать описание

2017 NIPS Keynote by DeepMind's David Silver. Dr. David Silver leads the reinforcement learning research group at DeepMind and is lead researcher on AlphaGo. He graduated from Cambridge University in 1997 with the Addison-Wesley award.

Recorded: December 6th, 2017

Рекомендации по теме

Комментарии

The best exposition I've seen to date on what promises to be an AGI

palfers

Thank you for an excellent explanation. I'm looking forward to seeing where this leads.

SafeTrucking

Amazing talk, thanks to speaker and uploader

drancisdrake

❤Thank you very much publisher beautiful lesson and demonstration..

petergreen

I'd love to see this in more complex and open ended computer games. If you tell AlphaZero to play Cities Skylines and maximize the population and add secondary constraints like environmental quality and rci balance, I wonder what it would come up with

kayrosis

I wonder if you could use this to analyze where a kid is going wrong in his math understanding for example, as a tool to teach kids math. It could pinpoint the area of confusion and help the kid bridge that and gain insight by providing simpler examples.

peters

It's a wonderful achievement.
I think that it has the potential to change the world.

alph

Just wait to see what will happen when we achieve "reinforcement learning learning": when reinforcement learning can improve the reinforcement learning algorithm itself.

kephalopod

If you look at the three graphs at 30:25, you'll notice "jumps" in all three curves. At a jump, from left to right, the curve starts to level off, and then abruptly shoots up nearly vertically again, the slope changing quite suddenly. There must be some significance to these jumps. Perhaps the algorithm has suddenly discovered a particularly effective heuristic for evaluating board positions, or the algorithm actually is developing something like human "insight" or "intuition" at these jumps.

forestpepper

And at this point Stockfish resigned the game

richiester

It's often said RL without search. But there's always a search tree.

vegahimsa

Astonishing games from Alpha zero! Stockfish calculates 80mln positions per second. Alpha zero 70, 000. Human champion Carlsen probably can do 7. Human intuition is 10, 000x better than AI, but the amazing part is that AI intuition is 1000x better than a brute force approach. It seems that AI is about halfway there. BTW all 3 players are not equal, and Stockfish would probably need 10 or 100x increase in speed if it wasn't equipped with table bases, heuristics, openings, etc. How many years will it take for the 2nd half of the road to AGI? For starters, how long did it take for the first half?

peterpetrov

Thought he said he had automated several talks. I thought, man that would be super impressive.

Wemdiculous

I'm waiting for the day when AI will be able to design new games from scratch instead of just learning how to play already existing ones.

robostain_

To tell you the truth my friends, I'm more afraid of this technology than I'm fascinated in it. Greetings! ;)

basteqss

I was high af watching this and I could only focus on this guy saying “uuuum”

asink

Just a thought. Has it been contemplated what would happen if we could teach AlphaGoZero to teach humans play Go? How would that develop and what kind of players (who had never played GO) would that produce. And what would happen the day you put a traditional human player up against an AlphaGoZero player. I find very interesting.

PaytonTroy

I hope they release the remaining 90 games of alpha zero and stockfish 8

bunnygummybear

Yea, but can it perform on a cold wet night in Stoke....

duskie

Delusions (8:20) Reinforcement: That's the magic part. Pay attention.

Deepmind AlphaZero - Mastering Games Without Human Knowledge

Deepmind AlphaZero - Mastering Games Without Human Knowledge

Deepmind AlphaZero - Mastering Games Without Human Knowledge

AlphaZero: Shedding new light on the grand games of chess, shogi and Go

AlphaZero: DeepMind’s AI Works Smarter, not Harder

Google Deep Mind AI Alpha Zero Refutes 1.e4

DeepMind's AlphaGo Zero and AlphaZero | RL paper explained

AlphaGo - The Movie | Full award-winning documentary

'Exactly How to Attack' | DeepMind's AlphaZero vs. Stockfish

MuZero: DeepMind’s New AI Mastered More Than 50 Games

AlphaZero: DeepMind's New Chess AI | Two Minute Papers #216

Outrageous Artificial Intelligence: (Game 7) : DeepMind’s AlphaZero crushes Stockfish Chess Engine

Outrageous Chess AI: (Game 10) : DeepMind’s AlphaZero's outrageous Queen moves from other dimen...

Google Deepmind's AlphaZero Chess Engine Makes 'Inhuman' Knight Sacrifice

Lee Sedol vs AlphaGo Move 37 reactions and analysis

AlphaGo Zero: Starting from scratch

AlphaGo vs Lee Sedol Hand of God Move 78 Reaction and Analysis

Outrageous Artificial Intelligence (Game 2): DeepMind’s AlphaZero crushes Stockfish

Google Deepmind's AlphaZero Chess Engine Smashes Stockfish With The Dutch

AlphaGo Zero vs AlphaGo Master Game 1 of 20

Outrageous Chess AI: (Game 5) : Deepmind's AlphaZero: One of the most outrageous moves of the y...

Outrageous Artificial Intelligence: (Game 1) DeepMind’s AlphaZero crushes Stockfish Chess WC

Google Deepmind's AlphaZero Chess Engine Strangles Stockfish

Chess Grandmasters on Google Deepmind AlphaZero || Artificial Intelligence in Chess

Google's self-learning AI AlphaZero masters chess in 4 hours