Reinforcement Learning Made Simple - Reward

preview_player
Показать описание
This video goes over an introduction to reinforcement learning theory. Specifically, we dive into reward and returns and go over their mathematical foundations.

Рекомендации по теме
Комментарии
Автор

These lectures are well made easy to follow.

gouravroy
Автор

9:48 Shouldn't this statement be the other way around? A high gamma would imply a similar importance on a reward in the future as an immediate one. Aside from that, great explanations :)

kelseystark
Автор

I was under the impression that the policy gets updated at the end of each episode. For the infinite horizon case, are you updating the policy during the middle of the episode, too, or are you generally stopping the simulation early to update the policy instead of letting it run indefinitely?

zuloo
Автор

Nice video. I would've liked to hear more explanation of what the discount is vs. no discount. Thanks.

fzamora
Автор

How the machine understands that it is a positive reward or not, why it supposed that it was a positive reward it can't feel why not it repeats the wrong decision ?

hossamsamir