filmov
tv
Reinforcement Learning 3: Markov Decision Processes and Dynamic Programming

Показать описание
Hado van Hasselt, Research scientist, discusses the Markov decision processes and dynamic programming as part of the Advanced Deep Learning & Reinforcement Learning Lectures.