L4: Value Iteration and Policy Iteration (P3-Truncated policy iteration)—Math Foundations of RL

preview_player

Показать описание

Welcome to the open course “Mathematical Foundations of Reinforcement Learning”. This course provides a mathematical but friendly introduction to reinforcement learning.

Up to now, the textbook as received 3K+ stars on GitHub! The Chinese version of the videos has received 1,000,000+ views on the Internet!

It has ~50 short lecture videos and lasts for ~11 hours long in total. The videos will be uploaded one by one within the next few months. Please stay tuned!

WINDY Lab

Рекомендации по теме