Ant-Semi-Circle Trained Meta-RL Agent

preview_player
Показать описание
The trained agent efficiently explores the semi-circle, demonstrating learning of approximately Bayes-optimal exploration strategy from offline data.
Рекомендации по теме