RL Course by David Silver Lecture 7 Policy Gradient Methods

preview_player
Показать описание
Looks at different policy gradients, including Finite Difference, Monte-Carlo and Actor Critic.
Рекомендации по теме