[Paper Summary] Objective Mismatch in Model-based Reinforcement Learning

preview_player

Показать описание

Two optimization problems leave model-based RL in a tricky point: you cannot optimize both the model and the controller simultaneously. This video points a direction for a new class of model-based RL algorithms.

Nathan Lambert

Рекомендации по теме