Convergence and Sample Complexity of Gradient Methods for the Model-Free Linear Quadratic Regulator

preview_player

Показать описание

Mihailo Jovanovic (USC)
Reinforcement Learning from Batch Data and Simulation

Рекомендации по теме