Implementing an Open Source Agent57

Показать описание

Having a YouTube channel has brought me a number of fantastic opportunities over the years. One of my favorites was working on a project to implement an open source agent 57. In this video I'll talk a little about how A57 works and how I worked with a team of Ukranian developers to implement an open source version.

If you want to contribute to the project, check it out here:

You can find a written version of this article with some more details here:

Want to support my work and learn how to turn deep reinforcement learning papers into code?

Get instant access to all my courses, including the new Prioritized Experience Replay course, with my subscription service. $29 a month gives you instant access to 42 hours of instructional content plus access to future updates, added monthly.

Or, pickup my Udemy courses here:

Deep Q Learning:

Actor Critic Methods:

Curiosity Driven Deep Reinforcement Learning

Natural Language Processing from First Principles:

Just getting started in deep reinforcement learning? Check out my intro level course through Manning Publications.

Reinforcement Learning Fundamentals

Here are some books / courses I recommend (affiliate links):

Come hang out on Discord here:

Machine Learning with Phil

Рекомендации по теме

Комментарии

Great job, as always Phil! Thanks for co-authoring/co-leading this effort. Absolutely couldn't have done it without you. Thanks, Chris

ChrisPoulin

Muchisimas gracias por este contenido de gran valor ❤

hidrogenodeuterio

Thanks great job !! Please Can you make a video to explain QMIX algorithm ?

manelmimi

Phil, text on the slides are hard to read, because window with You is too big :-).

Biedropegaz

Phil, only gpus I can really get are a couple of AMD RX580 8GB and one of them is for vm passthrough only, so really only direct access to one at a time (there are ways of course) and only have an 4core xeon 8threads 3.5GHZ which has an igpu that is not used at all, 32GB ECC 1600 MHZ (overclockable if needed), nvme shares 16 lanes with gpu, so only 8x lanes gen3 to gpu, I can connect a workstation class gpu to the 4x I have left for video only, so can dedicate that one 580 to compute, but prefer to keep that one slot for network. Will I be able to run any of the lessons you have demonstrate on youtube? I think I have to turn off igpu turbo from motherboard dip switch first, but Maybe I can get some compute loads to run against the igpu also? (I could use both cards from the host but would love just still leave that one passed through. So my system is 20 lanes max on cpu but has additional 8 lanes (electrically four lanes I think) connected to the chipset and shares with usb excetera and that is what the vm uses. Everything will run bare metal, ubuntu,

alignedfibers

Can you share some thoughts on Chat GPT. I know it is a buzz word these days and everyone is talking about it, but will be great to hear your thoughts.

FarrukhTickets

What gpu and motherboard should I buy if I want to train lstm or rnn? My budget is about 2k - 3k usd. I use keras and tensorflow. Where can I find information or what are key informations should I focus on?

karolstylok

Is it possible to configure the training to just run plain R2D2?

marcopleines

Hello, I'm having difficulties running docker-compose, it's my first time on Linux, I decided to change to a real operating system but I don't know if I'm doing things right or not, when I do docker-compose it's telling me that does not find .env, that file is not in the directory.

eduardorosentreter

Implementing an Open Source Agent57

Implementing an Open Source Agent57

AGI-22 | Chris Poulin - Open Source Deep Reinforcement Learning

Agent Learns to do Reinforcement Learning

Curiosity driven Exploration for Mapless Navigation with Asynchronous Deep Reinforcement Learning

NVIDIA Wants You To Have A Free GPU

DeepMind's AI Plays 57 Different Games With Super-Human Performance | Game Futurology #11

Open AI Gym , Applications in Reinforcement Learning

I Asked ChatGPT To Write an Actor Critic Agent ...

Don't miss these top 10 resources to download machine learning papers for free

VIM is a Modern Python IDE

RLVS 2021 - Day 3 - Deep Q-Networks and its variants (Part 3)

SDS 551: Deep Reinforcement Learning — with Wah Loon Keng

Learning AI through play and experimentation

Shortcut Learning in Deep Neural Networks

Future of Development: Tomorrow's Tech Today | Lee Englestone

Nick Cheney - Learning to Continually Learn

AI in a 2D Self Learning / Driving Car Simulation - Lee Englestone. Dot Net North - April 2022

Munchausen Reinforcement Learning

Never Give Up: Learning Exploration strategies in RL | Paper Explained

Superhuman Trackmania AI Demo | map5

DASIP21 Best Paper - Desnos K. - Lightweight AI through Generic and Evolvable Tangled Program Graphs

Yurii Tolochko: Why you don't see many real-world applications of... | PyData Berlin 2019

CartPole-v1 with QLearing

What's New In Machine Learning?