Reinforcement Learning Made Simple - Policy

preview_player
Показать описание
This video goes over an introduction to reinforcement learning theory. Specifically, we dive into policies and go over their mathematical foundations.

Рекомендации по теме
Комментарии
Автор

phenomenal content. Great work, very easy to understand

Maxwellpaulwall
Автор

The Gaussian shape of the output makes sense if the output can be any real number, but if it's on a finite continuous range like [0, 1], wouldn't that make the probability densities at the endpoints unusually high and discontinuous (assuming you clip the output)? Would it make more sense to use something like a beta distribution for that kind of space?

zuloo