Markov Decision Processes (MDPs) - Structuring a Reinforcement Learning Problem

Показать описание

💡Enroll to gain access to the full course:

Welcome back to this series on reinforcement learning! In this video, we'll discuss Markov decision processes, or MDPs. Markov decision processes give us a way to formalize sequential decision making. This formalization is the basis for structuring problems that are solved with reinforcement learning.

We will detail the components that make up an MDP, including: the environment, the agent, the states of the environment, the actions the agent can take in the environment, and the rewards that may be given to the agent for its actions.

Sources:
Reinforcement Learning: An Introduction, Second Edition by Richard S. Sutton and Andrew G. Bartow

Playing Atari with Deep Reinforcement Learning by Deep Mind Technologies

🕒🦎 VIDEO SECTIONS 🦎🕒

00:30 Help deeplizard add video timestamps - See example in the description
06:04 Collective Intelligence and the DEEPLIZARD HIVEMIND

💥🦎 DEEPLIZARD COMMUNITY RESOURCES 🦎💥

👋 Hey, we're Chris and Mandy, the creators of deeplizard!

👉 Check out the website for more learning material:

💻 ENROLL TO GET DOWNLOAD ACCESS TO CODE FILES

🧠 Support collective intelligence, join the deeplizard hivemind:

🧠 Use code DEEPLIZARD at checkout to receive 15% off your first Neurohacker order
👉 Use your receipt from Neurohacker to get a discount on deeplizard courses

👀 CHECK OUT OUR VLOG:

❤️🦎 Special thanks to the following polymaths of the deeplizard hivemind:
Tammy
Mano Prime
Ling Li

🚀 Boost collective intelligence by sharing this video on social media!

👀 Follow deeplizard:

🎓 Deep Learning with deeplizard:

🎓 Other Courses:

🛒 Check out products deeplizard recommends on Amazon:

🎵 deeplizard uses music by Kevin MacLeod

❤️ Please use the knowledge gained from deeplizard content for good, not evil.

Рекомендации по теме

Комментарии

Check out the corresponding blog and other resources for this video at:

deeplizard

Can we take a second and just appreciate the work put in producing such high-quality videos in bites that are easy to understand?

beltusnkwawir

Thanks deeplizard for doing the hard work on illustrations to explain it to the feeble-minded. Its like training a donkey, how to solve calculus.

aparvkishnov

this is by far the best tutorial I've seen about this topic. I'm about to watch the whole series :D

SandwichMitGurke

I’m so glad you produced this series of videos. I was intimidated by all the math and algorithm variations covered in the first four lectures of my graduate course. After watching these videos and then revisiting my grad lectures, I now actually understand what my professor was trying to teach. Thank you!

mike

I saw different channels but no one explained this topic better than you . thanks alot

amirhosseinesteghamat

I was wandering here and there looks like I have landed a perfect place to learn Deep Learning.... Thanks .. I will continue.

alokk

You are awesome.
This series would help me for my project.
Thank you so much.
Best regards...

muomgu

This series is awesome. Make learning a lot easier. Thank you so much.

ziaurrehman

- **Introduction to Markov Decision Processes (MDPs)**:
- 0:00 - 0:17

- **Components of MDPs**:
- 0:23 - 1:43

- **Mathematical Representation of MDPs**:
- 1:47 - 3:59

- **Probability Distributions and Transition Probabilities**:
- 4:02 - 4:56

- **Conclusion and Next Steps**:
- 5:01 - 5:47

theliterunner

Keep up the good work, thank you for the time your are putting on making this series :)

sahand

amazing explanation of what is RL. I will watch the whole series from now

danielzoulla

Seriously... Amazing tutorial! I really like how you offer text version as well. Thanks you :)

haneulkim

Very intuitive and easy explanation. Thank you! 🤗😀

ilovemusic

Great tutorial, understood the concept clearly for the first time, after going through many. Thank you very much.

thusharadunumalage

Great video with intuitive explanations 👌

Galinator

Thank you so much it is very clear the explanation of MDPs.

christopherherrera

Second video completed, the video was clear as day

asdfasdfuhf

This video can be denoted by n as n approaches perfection.

nossonweissman

very very very very help full..thnks for making these videos..pls keep it going

harshadevapriyankarabandar

Markov Decision Processes (MDPs) - Structuring a Reinforcement Learning Problem

Markov Decision Processes - Computerphile

Markov Decision Process (MDP) - 5 Minutes with Cyrill

Markov Decision Processes (MDPs) - Structuring a Reinforcement Learning Problem

Markov Decision Processes - Georgia Tech - Machine Learning

First MDP Problem

COMPSCI 188 - 2018-09-18 - Markov Decision Processes (MDPs) Part 1/2

Markov decision process in machine learning | Reinforcement learning | Lec-31 | Machine Learning

Reinforcement Learning 2: Markov Decision Processes

Markov Decision Processes

6.4. Markov Decision Processes MDPs

How to solve problems with Reinforcement Learning | Markov Decision Process

Lecture 8 MDPs

Reinforcement Learning #3 | Markov Decision Process (MDP) 🔥🔥

MDPs: Markov Decision Processes | Decision Making Under Uncertainty using POMDPs.jl

8. Markov Decision Processes MDPs

#60 Reinforcement Learning- Introduction, Markovs Decision Problem with Example |ML|

RL Course by David Silver - Lecture 2: Markov Decision Process

Section 3 Worksheet Solutions: MDPs

The Markov Decision Process Explained

Markov Decision Processes (MDPs): The Foundation of Decision Making Under Uncertainty

Unit- V Lecture 59 - Markov Decision Process

RL CH3 - Markov Decision Processes (MDPs) and Dynamic Programming

Markov Decision Processes in Artificial Intelligence

Markov Decision Processes Two - Georgia Tech - Machine Learning