From Policy Gradient to Actor-Critic: Introduction (RLVS 2021 version)

preview_player

Показать описание

In this video I'm presenting the four routes to explain Deep RL and my choice of the Policy Gradient route.

The corresponding slides are available here:

Olivier Sigaud

Рекомендации по теме

Комментарии

The big picture is really nice! I finally have a better understanding of the families of RL methods. Thank you!

HL-kchj

2:50 Sequential decision making ...Policy search

SphereofTime