RL 5: Markov Decision Process - MDP | Reinforcement Learning

preview_player
Показать описание
Markov Decision Process - MDP - Markov decision process process is a way to formalize sequential decision making process. Thus we can formalize reinforcement learning problem with finite markov decision process. There are 5 components of Markov decision process - the agent, the environment, the states, the actions and the rewards. The agents takes an action in the environment based on the current state of the environment. After every action the environment moves t[o another state. The agent receives a reward for it's action on the previous state. The goal of the agent is to maximize the total reward it receives in an episode or a specific number of steps.

Reinforcement learning tutorial series:

Рекомендации по теме
Комментарии
Автор

this is just wow... I hv been struggling to learn RL... this guy makes it so easy... THANK YOU Brother

vimalpanmasala-yt
Автор

Great video dude. I learnt a lot. Hope you continue to make more videos. Your explanation is quite crystal clear. Most AI videos are full of abstract maths and forget to help the audience to visualize those algo.

adityachaturvedi
Автор

the little greeting at the beggining thank you sir 😭

maria
Автор

very well put..am a new follower of your videos..

paedrufernando
Автор

SO four years back there were 5 components .. Nowadays we are studying 6 components

dhakaluma
Автор

How to chose Algorithm for our problems like im working on UAV drones where User want to send info to destination with healp of Assited UAV relay so we want to achive maximum secrecry rate. Supervisor told me that we will use RL approach for this but im not sure what to write in proposal i mean MDP or Some kind of Algorithm? becouse righ now i don't know about any algorithm whic i can chose for my topic? Waiting for Quick suggestion

Sajedahmad
Автор

Hi sir very nice video i need some help in understanding more detail.. how do I do...!?

SamanviKhushi
Автор

Nice video sir. Sir, please guide me for electric vehice deaigned in Matlab/ Simulink, I want to use model free Reinforcement algorithm in matlab. Please guide sir

kundankumar-dtuu
Автор

What you have studied fucked up all my concept

alokkumar
Автор

I can't access a playlist of RL, it says "Private Videos". Kindly allow me to see the content

kushkumar
visit shbcf.ru