RL Chapter 2 Part2 (Multi-armed bandits: Recursive value estimates formulas, setting initial values)

preview_player
Показать описание
Recursive value estimates formulas in multi-armed bandits settings are discussed. The use of optimistic initial values to force exploration is mentioned.
Рекомендации по теме
Комментарии
Автор

can you recommend some resources from where i can practice concepts on these algorithms in form of questions
?

harshpathak
join shbcf.ru