filmov
tv
[Paper Summary] Objective Mismatch in Model-based Reinforcement Learning

Показать описание
Two optimization problems leave model-based RL in a tricky point: you cannot optimize both the model and the controller simultaneously. This video points a direction for a new class of model-based RL algorithms.