filmov
tv
L4: Value Iteration and Policy Iteration (P3-Truncated policy iteration)—Math Foundations of RL
Показать описание
Welcome to the open course “Mathematical Foundations of Reinforcement Learning”. This course provides a mathematical but friendly introduction to reinforcement learning.
Up to now, the textbook as received 3K+ stars on GitHub! The Chinese version of the videos has received 1,000,000+ views on the Internet!
It has ~50 short lecture videos and lasts for ~11 hours long in total. The videos will be uploaded one by one within the next few months. Please stay tuned!
Up to now, the textbook as received 3K+ stars on GitHub! The Chinese version of the videos has received 1,000,000+ views on the Internet!
It has ~50 short lecture videos and lasts for ~11 hours long in total. The videos will be uploaded one by one within the next few months. Please stay tuned!