filmov
tv
MULTIPOLAR Example

Показать описание
Ant Example: A policy of a target agent (right) is trained by leveraging the policies of other source agents with different leg designs (left).
We propose MULTIPOLAR, a transfer RL method that leverages a set of source policies collected under unknown diverse environmental dynamics to efficiently learn a target policy in another dynamics.