AlphaZero: DeepMind's New Chess AI | Two Minute Papers #216

Показать описание

The paper "Mastering Chess and Shogi by Self-Play with a
General Reinforcement Learning Algorithm" is available here:

Our Patreon page with the details:

One-time payments:
Bitcoin: 13hhmJnLEzwXgmgJN7RB6bWVdT7WkrFAHh
Ethereum: 0x002BB163DfE89B7aD0712846F1a1E53ba6136b5A

Recommendations:

We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
Andrew Melnychuk, Brian Gilman, Christian Ahlin, Christoph Jadanowski, Dave Rushton-Smith, Dennis Abts, Emmanuel, Eric Haddad, Esa Turkulainen, Evan Breznyik, Frank Goertzen, Kaben Gabriel Nanlohy, Malek Cellier, Marten Rauschenberg, Michael Albrecht, Michael Jensen, Michael Orenstein, Raul Araújo da Silva, Robin Graham, Steef, Steve Messina, Sunil Kim, Torsten Reil.

Credits:

Károly Zsolnai-Fehér's links:

Рекомендации по теме

Комментарии

For some reason people seem to latch onto the "only 4 TPUs used", both in AlphaZero and in AlphaGo Zero.

Please clarify that this is only for the fully trained network, i. e. just to get the next move out of a playing AlphaZero.

During training, according to the paper 5000 first-generation TPUs and 64 second-generation TPUs were used.

Karol did clarify that this cannot yet be done on "commodity hardware", but the way things are presented both here and elsewhere, the 4 TPU figure is what sticks in people's minds.

NicolaiCzempin

Chills down the spine is the correct reaction.

fcarriedo

There's still debates going on that Stockfish was handicapped by not allowing it to use opening and endgame database. Also they say hardware they both ran at was incomparable. But certainly as an avid chess player I was really excited about this news. Unlike most of the engines the play of AlphaZero was way more human-like and easy to understand. Those ten games they published are super interesting and educational. Would love to see the other ones.

NeverInterpreter

first we make games so we have something to do in our free time, then we make programs that play these games for us

MrTurbo_

Next time AlphaZero applied to stock market becomes the richest "person" in the world in 4 hours. ^_^

DamianReloaded

They could play tens of thousands of games, publish them and feed chess enthusiasts for the next 3.000 years

francescomartella

1:05 the sudden jump in Elo Rating after a period of stagnation from 17hrs to 27hrs is interesting & scary!!! I wonder what caused the jump

bruceli

>Two Minute Papers
>6 Minute video

Not complaining at all! Not worth rebranding the channel over such a small gripe, but I find it funny that as the channel has grown, the videos seem to get longer and longer. Again, I have no problem with this and enjoy some of the added depth and explanations (and would hope many videos are 5+ minutes in the future!), but it's a bit funny considering the name of the channel.

Love the work, Karoly, especially your course you made available on Rendering/Ray tracing from the university of Vienna. I'm not fully through it yet, but it's been a pleasure to work through in my free time.

meegul

They only released 10 of the games. We want the other 90!!!!

andrewxx

Interesting thing about alpha zero it analyzes far fewer moves compared to stockfish, and still plays at such a high level.

capnrob

ChessNetwork's analysis it's a must watch!

DerrickBest

Sir, I watch a lot of YouTube on a lot of different subjects. And I mean all the time. But your channel is my absolute favorite. I get psyched every time I see you have posted something. Thank you so much for doing this. Not only are you exciting and have fantastic info you keep it short and to the point. Please, keep up the good work. My family and I are cheering you on!

cyleyoakum

Very interesting, thanks. Nice to hear Jerry and Daniel getting a shoutout, I have enjoyed their analysis of the AlphaZero games, I'll check out the other guy now. :-)

mrfans

Alpha Zero progress in chess skyrocketted and then the learning curve flattened after a few hours. It hit some sort of ceiling, yet it still betters every once in a while. Like Bruce Lee said, there are no limits, only plateaus.

julioandresgomez

God! I love your videos. I use them as means to select my next paper reading as there are tons of really interesting knowledge. What you say is totally true: 'What a time to be alive' :D

miscelanea

It should be noted that this was Stockfish 8. There are now stronger versions released.

Thornstream

so much to learn so little time ..and energy. i wish i never had to sleep

satoshinakamoto

What is weird is that Stockfish doesn't recommend some of the moves it made if you analyze it yourself.

thejaywalker

If it was to play dark chess (where you can only see the enemy pieces that can be captured), would it need even more generalized algorithm?

artman

Some points that most people are not aware of:
1. AlphaZero played against StockFish 8. At that time, StockFish 9 was already in the market.
2. AlphaZero was powered by a supercomputer (StockFish wasn't).
3. The 60 second rule is pure crap as the StockFish algorithm thinks accordingly at a specific moment of the game. If they played in the classical time format (90 min per side for the whole game), then StockFish might have won.

musicalnostalgician

AlphaZero: DeepMind's New Chess AI | Two Minute Papers #216

AlphaZero: DeepMind's New Chess AI | Two Minute Papers #216

AlphaZero: Shedding new light on the grand games of chess, shogi and Go

AlphaZero: DeepMind’s AI Works Smarter, not Harder

Magnus Carlsen Realizes His Opponent is Using STOCKFISH in Online Blitz Game

Google Deep Mind AI Alpha Zero Devours Stockfish

Magnus Carlsen on AlphaZero: Its willingness to sacrifice pieces is fascinating | Lex Fridman

Google Deepmind's AlphaZero Chess Engine Makes 'Inhuman' Knight Sacrifice

We're in the Endgame Now | Google Deepmind AI AlphaZero shows Stockfish a Thing or Two

AlphaZero Shocked Magnus By Sacrificing a Rook in the Opening | AlphaZero vs Magnus | Magnus Chess

Alpha Zero's 'Immortal Zugzwang Game' against Stockfish

How AlphaZero Completely CRUSHED Stockfish

Lee Sedol vs AlphaGo Move 37 reactions and analysis

The Strongest Computer Chess Engines Over Time

Google Deep Mind AI Alpha Zero Refutes 1.e4

Deep Mind AI Alpha Zero Sacrifices a Pawn and Cripples Stockfish for the Entire Game

New DeepMind AI Beats AlphaGo 100-0 | Two Minute Papers #201

Google Deepmind AI AlphaZero's Unpublished Brilliancy

AlphaGo Zero: Starting from scratch

Google Deep Mind Alpha Zero Sacs a Piece Without 'Thinking' Twice

AlphaZero destroyed chess openings | GothamChess and Lex Fridman

Deep Mind AI Alpha Zero's Positional Masterpiece With the Black Pieces

Deep Mind AI Alpha Zero Refuses a Draw from Stockfish

Google Deepmind's AlphaZero Chess Engine Strangles Stockfish

Google's self-learning AI AlphaZero masters chess in 4 hours