Introduction (Move 37)

preview_player
Показать описание
Welcome to my new reinforcement learning course titled Move 37! in this 10 week course we'll go over the basics of reinforcement learning up to modern day techniques that involve neural networks called 'deep' reinforcement learning. In this first video, i'll introduce the idea of a Markov Decision Process. This is the basic mathematical framework for framing the reinforcement learning problem. We'll also briefly mention the ideas of a 'policy' and the agent-environment loop. Get hype!

Code for this video:

Please Subscribe! And like. And comment. That's what keeps me going.

The School of AI:

Want more education? Connect with me here:

Github Syllabus:

More learning resources:

Join us in the Wizards Slack channel:

And please support me on Patreon:
Signup for my newsletter for exciting updates in the field of AI:
Рекомендации по теме
Комментарии
Автор

Thank you Mr. Raval, for teaching A.I. with passion.

OO-iepe
Автор

You just described supervised, unsupervised and reinforcement learning in the most easy to understand language👶

painkillerDELHI
Автор

Your Lecture Delivering Skills have improved dramatically Siraj Sir . <3 . An inspiration for us. :)

hassaananwar
Автор

This is my life.I want to nailed it.Siraj keep it going.Passionate Kid from Kenya

gideongibeon
Автор

This is some god-like production quality

awalvie
Автор

I'm excited for this. Please make sure you go through everything in fine details, explaining the math and everything. Much love

_chappie_
Автор

I didn't believe Reinforcement learning course
Excited!!!👍

ankitmisra
Автор

Yes, finally we have Move 37 live. Looking forward to learn more.

ChetanVashistth
Автор

thank you for this course. RL has been the only branch of ml that I have been having a challenge teaching myself.. Natural born teacher you are Siraj.

devonk
Автор

Nice choice of name for the course. Move 37 was a revolutionary in that it represented new knowledge fo Go and AlphaGo found it by itself.

sibyjoseplathottam
Автор

I can't wait this course. !!! Thank you!!!

gunhwanson
Автор

Dude you're amazing, your videos are getting better by the day, and your pace is very reasonable!

AmeenAltajer
Автор

Great Video, Really Excited For MOVE37

anandjha
Автор

Awesome explanation of Markov's chain!! Congrats!

agustin
Автор

Your videos keep getting better! You seem to be ignoring the haters and taking all the feedback, and I'm so proud :')

MarcosPT
Автор

Thank u for this awesome video and series

AbhishekKumar-mqtt
Автор

thanks for using a supply chain scenario!

randombrandon
Автор

6:38 In a Markov Chain outgoing probabilities for every node should sum up to 1(or in transition matrix, each row sums up to 1), so there is something wrong with C, right?

kungfupanda
Автор

I love you Siraj you are damn genius and your energy. Love you man

HinaFirdaus
Автор

@Siraj, there are no resource links provided for Backpropogation algorithm in the description.

pradeeptippa
visit shbcf.ru