filmov
tv
Convergence and Sample Complexity of Gradient Methods for the Model-Free Linear Quadratic Regulator
Показать описание
Mihailo Jovanovic (USC)
Reinforcement Learning from Batch Data and Simulation
Reinforcement Learning from Batch Data and Simulation