Reproducibility, Reusability, & Robustness in Deep Reinforcement Learning - Prof. Pineau

preview_player
Показать описание
Recorded May 3rd, 2018
"In recent years, significant progress has been made in solving challenging problems across various domains using deep reinforcement learning. However reproducing results for state-of-the-art deep RL methods is seldom straightforward. High variance of some methods can make learning particularly difficult when environments or rewards are strongly stochastic. Furthermore, results can be brittle to even minor perturbations in the domain or experimental procedure. In this talk, I will discuss challenges that arise in experimental techniques and reporting procedures in deep RL, and will suggest methods and guidelines to make future results more reproducible, reusable and robust. I will also report on findings from the ICLR 2018 reproducibility challenge." - Prof. Joelle Pineau
Рекомендации по теме
Комментарии
Автор

"Solving a problem" - eg, getting a goal in football, should be as abstract, general and generic as possible ("getting a goal"). So long as the system can produce many different solutions to achieve the same objective, then the goal "solving a problem" has been successfully reproduced, repeatedly, even though the path to it are different.

tthtlc
Автор

Creativity == Something different, and thus not necessarily identical to past pattern. Non-reproducibility is a sign of creativity.

tthtlc
Автор

Not to oversimplify Learning Like shocking a dog or providing a treat?

brianvandenberg
Автор

i dont know why, but i cant listen to female presenters. its like they dont say anything of substance, my brain is simply incapable of understanding what is it they say. i listent to 13 minutes of this and im done

lucid