MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL)

preview_player
Показать описание
First lecture of MIT course 6.S091: Deep Reinforcement Learning, introducing the fascinating field of Deep RL. For more lecture videos on deep learning, reinforcement learning (RL), artificial intelligence (AI & AGI), and podcast conversations, visit our website or follow TensorFlow code tutorials on our GitHub repo.

INFO:

OUTLINE:
0:00 - Introduction
2:14 - Types of learning
6:35 - Reinforcement learning in humans
8:22 - What can be learned from data?
12:15 - Reinforcement learning framework
14:06 - Challenge for RL in real-world applications
15:40 - Component of an RL agent
17:42 - Example: robot in a room
23:05 - AI safety and unintended consequences
26:21 - Examples of RL systems
29:52 - Takeaways for real-world impact
31:25 - 3 types of RL: model-based, value-based, policy-based
35:28 - Q-learning
38:40 - Deep Q-Networks (DQN)
48:00 - Policy Gradient (PG)
50:36 - Advantage Actor-Critic (A2C & A3C)
52:52 - Deep Deterministic Policy Gradient (DDPG)
54:12 - Policy Optimization (TRPO and PPO)
56:03 - AlphaZero
1:00:50 - Deep RL in real-world applications
1:03:09 - Closing the RL simulation gap
1:04:44 - Next step in Deep RL

CONNECT:
- If you enjoyed this video, please subscribe to this channel.
Рекомендации по теме
Комментарии
Автор

Deep RL is my favorite subfield of AI, because it asks some fundamental questions about what it takes to build safe and intelligent robots that operate in the real world. So many open problems and interesting challenges to solve!

lexfridman
Автор

I wish you still did videos like this, we appreciate you sharing such knowledge.

NakedSageAstrology
Автор

Deep RL is the field that excites me the most. Thank you Lex.

KeepingUp_withAI
Автор

Thank you for bringing these lectures to us.

Techieadi
Автор

I really like that tongue in cheek chuckle when Lex talked about that multiverse and whoever created

kawingchan
Автор

Seriously the best Deep RL lecture out there to date.

samuelschmidgall
Автор

"Every type of machine learning is supervised learning", cannot agree more!!!

nova
Автор

Since 2017, Lex have improved his lessons spectacularly ! Now (2019), I watch a more fluid video with a feeling that this guy know exactly what his talking without hesitating . Once again, thanks Lex, for sharing this videos. Congratulations and thanks from Brazil.

wendersonj
Автор

1:04:40 Best part, that grin after he just casually dropped that line in an MIT lecture.. All of infinite universes being Simulations

akarshrastogi
Автор

Certainly, one of the best videos on deep learning I have come across.

ronaldolum
Автор

Loved the lecture. Definitely recommend his podcast. Quality.

judedavis
Автор

Every Lecture has a historical context, evolution, mathematics and inspiration, Technical overview, Network Architecture overview. Well Summarized!!

sivaa
Автор

Haha he says "that is super exciting", without being excited! He is a robot!
Thanks for the open lectures

danielvelazquez
Автор

I honestly don't care about AlphaGo or Dota 2 or the robots, I just cannot get over how incredible the thought structure is behind this. What is mean by thought structure is the strategy behind how to quantify the right things, asking the right questions, and model the policy upon which growth can be created. IT IS SICK

DennisZIyanChen
Автор

As always Le a generous Share, which will be a useful resource for loads of folks. Thanks.

amandajrmoore
Автор

I appreciate the philosophical insights sprinkled throughout the lecture!

mrr
Автор

I've seen a lot of these videos & read some of the books in ML; Lex has a clarity thats rare

MistaSmilesz
Автор

Important detail when trying to transfer from a simulation to the real world: make the simulation have many random variations in its behavior/mechanics during runtime. (such as drag, gravity, friction, size of the agent, random perturbations, etc) This will make the agent have to generalize more, and not over optimize on the details in the sim. This makes it easier to transfer the agents capabilities to a real world environment.

vast
Автор

Professor Lex, can we get the entirety of 6.S091 on MIT OCW ? This is an incredibly interesting topic that I've been working on (Evolutionary Computing) and am currently enrolled in a project with thorough knowledge of Deep RL as a requisite. This research field has very few online resources besides Stanford's CS 234 and Berkeley's CS 285.

Your explanations are immensely helpful and intuitive. Humanity will present it's gratitude if this whole course is made available ! AGI and AI safety issues need more attention before it's the greatest immediate existential risk, your courses can help raise general AI awareness and advance our civilization to higher dimensions. Loved the fact that you grinned while just casually mentioning the Simulation Hypothesis..

akarshrastogi
Автор

Lex Fridman, I just love your videos. I am your great fan sir. Carry on.

ArghyaChatterjeeJony