Convergence and Sample Complexity of Gradient Methods for the Model-Free Linear Quadratic Regulator

preview_player
Показать описание
Mihailo Jovanovic (USC)
Reinforcement Learning from Batch Data and Simulation
Рекомендации по теме