From Policy Gradient to Actor-Critic: Introduction (RLVS 2021 version)

preview_player
Показать описание
In this video I'm presenting the four routes to explain Deep RL and my choice of the Policy Gradient route.

The corresponding slides are available here:
Рекомендации по теме
Комментарии
Автор

The big picture is really nice! I finally have a better understanding of the families of RL methods. Thank you!

HL-kchj
Автор

2:50 Sequential decision making ...Policy search

SphereofTime