MULTIPOLAR Example

preview_player
Показать описание

Ant Example: A policy of a target agent (right) is trained by leveraging the policies of other source agents with different leg designs (left).

We propose MULTIPOLAR, a transfer RL method that leverages a set of source policies collected under unknown diverse environmental dynamics to efficiently learn a target policy in another dynamics.
Рекомендации по теме