POET: Endlessly Generating Increasingly Complex and Diverse Learning Environments and Solutions

Показать описание

From the makers of Go-Explore, POET is a mixture of ideas from novelty search, evolutionary methods, open-ended learning and curriculum learning.

Abstract:
While the history of machine learning so far largely encompasses a series of problems posed by researchers and algorithms that learn their solutions, an important question is whether the problems themselves can be generated by the algorithm at the same time as they are being solved. Such a process would in effect build its own diverse and expanding curricula, and the solutions to problems at various stages would become stepping stones towards solving even more challenging problems later in the process. The Paired Open-Ended Trailblazer (POET) algorithm introduced in this paper does just that: it pairs the generation of environmental challenges and the optimization of agents to solve those challenges. It simultaneously explores many different paths through the space of possible problems and solutions and, critically, allows these stepping-stone solutions to transfer between problems if better, catalyzing innovation. The term open-ended signifies the intriguing potential for algorithms like POET to continue to create novel and increasingly complex capabilities without bound. Our results show that POET produces a diverse range of sophisticated behaviors that solve a wide range of environmental challenges, many of which cannot be solved by direct optimization alone, or even through a direct-path curriculum-building control algorithm introduced to highlight the critical role of open-endedness in solving ambitious challenges. The ability to transfer solutions from one environment to another proves essential to unlocking the full potential of the system as a whole, demonstrating the unpredictable nature of fortuitous stepping stones. We hope that POET will inspire a new push towards open-ended discovery across many domains, where algorithms like POET can blaze a trail through their interesting possible manifestations and solutions.

Authors: Rui Wang, Joel Lehman, Jeff Clune, Kenneth O. Stanley

Links:

Рекомендации по теме

Комментарии

I would be interested in seeing some kind of Differentiable Adversarial Trainer.

There would be two agents. The Normal Reinforcement Agent and an adversarial trainer agent. The trainer agent would generate levels such that the normal Agent gets a very specific score. It gets penalized if the normal agent outperforms or underperforms the level. This way it will keep things challenging but not impossible.

herp_derpingson

Wow so many papers in rapid succession.

herp_derpingson

Thanks for introducing the paper! learned a lot!

tenghuilai

Do you have a patreon or something we can use to support you?

petroschristodoulou

Amazing video as always Yannic! Looking forward to catching up tomorrow on this 😃

machinelearningdojo

POET: Endlessly Generating Increasingly Complex and Diverse Learning Environments and Solutions

POET: Endlessly Generating Increasingly Complex and Diverse Learning Environments and Solutions

POET: Endlessly Generating Increasingly Complex & Diverse Learning Environments and their Soluti...

MIT 6.S192 - Lecture 14: 'Towards Creating Endlessly Creative Open-Ended ...' by Jeff Clun...

Practical Procedural Generation for Everyone

3 Ways to Express Your Thoughts So That Everyone Will Understand You | Alan Alda | Big Think

Can you find the 5th arrow? #shorts

1 Simple Trick For Writing Memorable Lyrics

The Dream Of Life - Alan Watts

How to translate the feeling into sound | Claudio | TEDxPerth

Two Easily Remembered Questions That Silence Negative Thoughts | Anthony Metivier | TEDxDocklands

Mathematician Explains Infinity in 5 Levels of Difficulty | WIRED

The Will to Win - A Powerful Life Poem

The Terrible Paradox of Self-Awareness | Fernando Pessoa

A cappella arranging: 'Endless Sands' (original choral composition) | Choir With Knut

Metaphysics - The Book That Helps You Manifest Anything (Full Audiobook)

Why do I feel so empty, bored, unfulfilled, like something is missing...

The Trap of Mind Control: How They Shape and Manipulate Your Perceptions!

Mary Oliver — Listening to the World

Song of the Immortal: A Chat with Erik P. Antoni on Self Transformation and Spiritual Alchemy

Fully Booked EP59: The Endlessly Patient Co-Writer: Using AI for Faster, Better Writing

President Trump: 'I am the chosen one.'

Вячеслав Дубынин: тренировка памяти, витамины и БАДы, влияние соцсетей на работу мозга...

A Message of Hope to People That Are Suffering

A Complete Guide to New Complexity and its Core Composers