How do RL agents really learn? | Reinforcement Learning Part-2

Показать описание

In this video, we present the fundamental algorithms that make Reinforcement Learning as powerful as it is today. The ideas, which originated several years ago, still find their way into today's state-of-the-art algorithms. That's why we are dedicating a video to these algorithms. Happy Learning!

============================
Do you want to learn from me?
============================

📱 Grow with us:

👍If you find this video helpful, consider giving it a thumbs up and subscribing for more educational videos on data science!

💭Share your thoughts, experiences, or questions in the comments below. I love hearing from you!

⌚Time Stamps⌚

0:00 - Intro
0:43 - MDPs
2:00 - Model
4:15 - Explore-Exploit
5:08 - Dynamic Programming
10:40 - Monte Carlo Methods
14:05 - TD Methods
15:49 - Driving Home Example
19:40 - SARSA
22:30 - Q-Learning
23:14 - Comparing SARSA and Q-Learning
24:31 - Outro

CampusX

Рекомендации по теме

Комментарии

Undoubtedly the best overview of RL I found on YT. I went thru deepmind course and what not but the clarity that I got from these videos is just really amazing. Thanks for this.

umaraslam

Really thank Rajthilak sir for this informative video. Please don't think bad about this message sir. Nothing personal.

But i am requesting to Nitish Sir (CampusX creator), sir you itself please create videos for the channel.

As u r the USP (unique selling proposition) in this channel.

- As the way u speak
- The way u breakdown the concept
- the best way of writing in iPad instead of slides.
These are really awesome style of teaching.

Fact:
All the people good in technology can't teach. It's a special quality.
A best surgeon need not be a Lecturer and vice versa.

So instead of anyone else. Even founder of these concepts are not needed.

We need ONLY NITISH Sir.
Please sir believing you, i took the risk of following only your playlist.

Please please kindly you yourself take videos and upload.

Last point.
Please don't worry on subscribers. It will 💯 come sir. I will share as much to clg grps and other platforms. (Also tell my friend influencers)
Your channel one day will suddenly boom with subscribers.
Please don't quit bcoz of it.

I even saw ur podcasts. Which was completely realistic. No masala added. Only plain story. Great one.

rbk.technology

Nitish sir
Please complete machine learning interview questions as soon as possible.

vaibhav

Sir please complete gradient boosting and xgboost playlist with python implementation

ambaradhikari

2 videos are hidden from your reinforcement learning playlist sir. Is there any reason sir?

fozler

You say a lot of information in a very short span of time, please show those information in a text format in the video as well. Just like Josh Starmer's videos. It helps in letting us get used to with that info and also pay attention to what you are saying next.
Just like in 11:20, you said "Instead of expected return, they are calculated as sample return", this text could have been shown in the video as well. You skimmed through this part and then proceeded to show the equations.

arghadeepdey

majority dimak ka upar se gaya evn after watching it fr second time, NITISH sir would have explained in a better way

harshavardhan

can someone please explain, how the state values were calculated at 9:10 ?

arghadeepdey

The video is too monotonous and I can't tell which is what, And differentiates bw one topic and another

praphulshaw

God but mostly subscribers are not satisfied because not a used pen .

YadavSachin

How do RL agents really learn? | Reinforcement Learning Part-2

How do RL agents really learn? | Reinforcement Learning Part-2

Reinforcement Learning Basics

How to Code RL Agents Like DeepMind

Training RL From YouTube Videos

The key difference between RL and Agent-based modeling

'Crowd dynamics, RL and Unity: A Journey' by Tomas Diaz

Reinforcement Learning with sparse rewards

Deep Reinforcement Learning: Neural Networks for Learning Control Laws

How to Be a Spy! - EPIC HOW TO

WarpDrive: Orders of Magnitude Faster Multi-Agent Deep RL on a GPU

Reinforcement Learning: Machine Learning Meets Control Theory

DLRLSS 2019 - Model-Based RL - Martha White

NEW Multi-Agent Dynamics w/ Self-Play RL

Decision Transformer: Reinforcement Learning via Sequence Modeling (Research Paper Explained)

Deep Reinforcement Learning Tutorial for Python in 20 Minutes

MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL)

RLVS 2021 - Day 1 - Reward processing biases in humans and RL agents

RL AI Agent

'Training an Autonomous Pentester with Deep RL' by Shane Caldwell

AI vs Machine Learning

Reincarnating RL @ DLCT

Train Your RL Agents With Attention! | Game Futurology #10

CS885 Lecture 18a: Safe multi-agent RL for autonomous driving (Presenter: Ashish Gaurav)

Reinforcement Learning for Gaming | Full Python Course in 9 Hours