Reinforcement Learning with Stable Baselines 3 - Introduction (P.1)

Показать описание

Welcome to a tutorial series covering how to do reinforcement learning with the Stable Baselines 3 (SB3) package. The objective of the SB3 library is to be for reinforcement learning like what sklearn is for general machine learning.

Рекомендации по теме

Комментарии

By the way, when you were comparing the models you were still using
Which is why they were almost the same and didn’t look like they were learning

AakashKumar-gtip

I have been using stable baselines 2 for last year or so for my work and it's super convenient, the docs are great, great examples for custom env etc. It's a great library.

ashu-

This is very useful. I'm working on an RL video series myself (the theory side, so no overlap here) and I was just looking for prebuilt RL algos. Stable baseline's 3 is by far the most complete/well tested suite I've come across. This really makes a big differences - thanks!

Also, it's nice to see super technical coverage like can yield a 1M+ followers. Awesome.

Mutual_Information

SentDex youre a legend, brother. The thought of implementing these using deep learning libraries alone, instant grey hair! Thank you

vernonvkayhypetuttz

Your videos always inspire me to continue working on my own projects!!!

hendrixkid

Even without watching it . Thanks for your good work and content sentdex

thetiesenvy

If you're following along using a Conda environment and the Lunar Lander environment gives you an error (namely "module 'gym.envs.box2d' has no attribute 'LunarLander'") then I found that you need to also install two other packages; swig and box2d-py:

conda install -c conda-forge swig box2d-py

OhItsAnthony

LETS THIS IS EXACTLY WHAT I WANTED THANK YOU SO MUCH

pfrivik

Honestly loving this series, i hope you make a indepth tutorial series on this. Thanks

amogh

I had so much fun learning with you.... can't wait to follow you again after completing my web project

alishbakhan

wow, great video! really can't wait for the rest to come out and learn more.
Thanks for all the info you provide us!

enriquesnetwork

Thanks for introducing the Stable Baseline 3,
and yeah sometime we forget to use model!

VaibhavSingh-lfps

I think you were still getting random results because you still had the .sample method call in the rendered tests for A2C and PPO. They learned, but you did not use the trained model for testing.

OTheIDaveO

Happy New Year SEndex, was learning machine learning during the lockdown & I had no idea in the Field . U teach so well

tytobieola

Thank you, these video tutorial will be of big help to my thesis. I going to support you.
I have many doubts I hope this can resolved them.

arthurflores

awesome video, learned a lot, keep up the good work

ahmarhussain

Thank you for this tutorial. I am just getting into AI. It is over my head immediately, but your overview of the parts such as observation and agent were helpful for the bigger picture.

KennTollens

Great series as always...needs the next step, developing asynchronous (multiprocessing) models, eg: PPO into Asynchronous-PPO (APPO) on custom environments...Thx

markd

Can you please talk about how we use the RL to model and optimize satellite networks and HAP( high altitude platforms)??

How we control the direction and angle of a projector embedded into HAP or UAV so that it directs its light beams towards an special area of interest on the Earth??

AIdreamer_AIdreamer

This is really interesting and new to me! You mentioned going over creating custom environments in future videos which sounds like exactly what I am eager to know next so I’m really looking forward to that video! Is there anything I should educate myself on in the meantime?

DaZMan

Reinforcement Learning with Stable Baselines 3 - Introduction (P.1)

Reinforcement Learning with Stable Baselines 3 - Introduction (P.1)

Tweaking Custom Environment Rewards - Reinforcement Learning with Stable Baselines 3 (P.4)

Custom Environments - Reinforcement Learning with Stable Baselines 3 (P.3)

Saving and Loading Models - Stable Baselines 3 Tutorial (P.2)

Does your PPO agent fail to learn?

Python Reinforcement Learning using Stable baselines. Mario PPO

Stable Baselines Q-Learning (12.3)

Reinforcement Learning in 3 Hours | Full Course using Python

Stable Baselines3 Tutorial: Beginner's Guide to Choosing Reinforcement Learning Algorithms

Stable baselines 3 Reinforcement Learning using Tensor flow 2.x with PPO Algorithm

Stock Trading AI: Using Alpaca & Stable Baselines for Reinforcement Learning Investing

Future of AI with Stable Baselines 3: The Ultimate Guide to Applied Reinforcement Learning Part 1

Getting Started with Stable Baselines3 Python Reinforcement Learning Library on MuJoCo Humanoid-v4

Python Reinforcement Learning Tutorial for Beginners in 25 Minutes

Reinforcement Learning from Scratch: Gymnasium, Stable Baselines 3, and RL Zoo, and Nachos!

Introduction to Gym and Stable Baselines for Reinforcement Learning

A. I. Learns to Play Starcraft 2 (Reinforcement Learning)

Build a Custom Gymnasium Reinforcement Learning Environment & Train w Q-Learning & Stable Ba...

A Dive into Reinforcement Learning with Gym & Stable Baselines

Custom Snake AI with Reinforcement Learning | CNN & Stable Baselines 3

Stable Baselines Reinforcement learning Introduction Course Part 1

Atari Game 'Pong' learned by AI(Reinforcement learning) with stable Baselines

3 Train a Unity RL Env using Stable Baselines3!

Stable Baseline