Reinforcement Learning 8: Policy gradient methods

preview_player
Показать описание

Policy-based methods
- definition
- characteristics
- deterministic vs stochastic policies
Policy gradients
- gradient-based estimator
- Monte Carlo REINFORCE
Actor-critic methods
- definition
- algorithm
- extensions

#policygradients #actorcritic #reinforcementlearning #REINFORCE #montecarlo
Рекомендации по теме
Комментарии
Автор

Possible for you to share the latex template for the presentation?

yuktikaura