[Paper Summary] Objective Mismatch in Model-based Reinforcement Learning

preview_player
Показать описание
Two optimization problems leave model-based RL in a tricky point: you cannot optimize both the model and the controller simultaneously. This video points a direction for a new class of model-based RL algorithms.
Рекомендации по теме