The Full Reinforcement Learning Iceberg

Показать описание

Dive into 10 levels of the RL stack with Joseph Suarez, a newly minted MIT PhD and the creator of Neural MMO + PufferLib. There's something here for beginners and world-class experts alike. Star the project on GitHub to feed the puffer!

Most of my development is livestreamed right here. It's all open source, and we welcome contributions!

Neural MMO

Рекомендации по теме

Комментарии

Thanks for putting in the work to build a solid foundation for the future researchers. I hope you become a standard and get rewarded for your contributions

DavidMisc-stuff

wow this was actually fantastic! very well explained the landscape. even has procgen! incredible. Going to check this lib out - thank you

AxelAhmer

Great video! Love learning abt RL. Subscribed :)

sinfinite

incredible video as always, i've been putting off starting my RL journey and i think that thanks to this video i'm starting lol

snats_xyz

Nice video! I also checked out your article on twitter, was a bit hard to find so you should also link it in the description.

mgostIH

Thanks for the video dude! Keep it up!

krankvegann

This is a very digestible summary of what you've been working on. I generally operate in the generative NLP space, whose intersection with reinforcement learning tends to be a quick REINFORCE or PPO run to adapt to human preferences, but this makes me wonder if it might not be a bad idea to take a few more steps into RL.

With regards to open endedness (creativity, if you will), it seems to me quite a natural assumption that some integration with language models with a combination of explicit and implicit planning, as well as specific rules regarding the communication of agents, would be the way forward (I do regard the segregation of information between agents to be quite an important consideration), though I dare not suggest exactly what form that will take for fear of being just close enough to be frustrated at not guessing at it, and yet being just far enough to be laughed at for it.

novantha

You’re so cool bro it’s actually incredible and you are working out my dream in real time

Sykooma

It is hard to gain trust as a dev when you are wearing elegant tuxedos instead of a coffee stained white t-shirts!Jokes aside great video!

avoidthevoid

Im glad im not the only one building custom simulators

AIShipped

But my Dr. says all my issues are because of carbs, I am confused

Wicaeed

Amazing video, people underestimate these points. I am here for level 10 though ;)

umairnasir

this is awesome, how approachable do you think using puffer lib is for beginners in RL? For context, I've trained a few RL agents using Gym in the past.

kushaagra

Great video! Some finance env please ;)

philiplivdan

cool video, "ppo solves dota, it can probably solve your problem too" is pithy, I like it

AmbisinisterSSBM

regarding open-endedness... Minecraft, Web-Agents, ...?

-mwolf

The Full Reinforcement Learning Iceberg

The Full Reinforcement Learning Iceberg

Programming Language Tier List

Why greatest Mathematicians are not trying to prove Riemann Hypothesis? || #short #terencetao #maths

What Is Reinforcement Learning? | Sergey Levine and Lex Fridman

why rust libraries may never exist.

Agent Learns to do Reinforcement Learning

How To Scare C++ Programmer

ML Was Hard Until I Learned These 5 Secrets!

AI is more than just ChatGPT

I Tried to Convince Intelligent AI NPCs They are Living in a Simulation

Ben Duffy - Introduction to Reinforcement Learning

Overview of Deep Reinforcement Learning Methods

TAIS 2024 | The Structure and Development of Neural Networks — Jesse Hoogland

Intro to AI Agents: Reinforcement Learning Basics

Deniz Altınbüken: The Tip of The Iceberg: How to Make ML for Systems Work

Unpacked | EP 5 | Exploring reinforcement learning with MIT Professor Vivek Farias

Robot uses machine learning to harvest lettuce

Rainbow: Combining Improvements in Deep Reinforcement Learning - Part #1. [Machine Learning]

The Mandela Effect Iceberg Explained

The Ultimate Iceberg of Obscure Oddites [PART 5]

Introduction To Reinforcement Learning | DRLR Summer School | FSOFT AI Lab

[Mini-course] Introduction to Reinforcement Learning. Part 1 (Eric Moulines)

Chip Placement with Deep Reinforcement Learning 2004 10746v1 05

Tech Exceptions LIVE - Optimizing Digital Twins using Machine Teaching & Reinforcement Learning