MIT: Machine Learning 6.036, Lecture 9: State machines and Markov decision processes (Fall 2020)

preview_player
Показать описание
* Lecture 9 for the MIT course 6.036: Introduction to Machine Learning (Fall 2020 Semester)
* Lecture date: 2020 / 10 / 27
* Lecturer: Tamara Broderick
* Lecture TAs: Crystal Wang and Satvat Jagwani

Errata: At the time of recording, slide 8 had an incorrect V^3_{\pi_B}(poor) value. Thanks to MIT student Bhav Jain for catching the error! The value (108) should now be correct in the video and also in the slides pdf linked above.

0:00:00 Overview & Plan
0:02:31 Intro to extended farming example
0:04:16 State machine
0:13:07 Markov decision process
0:34:54 What's the value of a policy?
0:57:54 What's the best policy?
1:12:39 What if I don't stop farming?
Рекомендации по теме