MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL)

Показать описание

First lecture of MIT course 6.S091: Deep Reinforcement Learning, introducing the fascinating field of Deep RL. For more lecture videos on deep learning, reinforcement learning (RL), artificial intelligence (AI & AGI), and podcast conversations, visit our website or follow TensorFlow code tutorials on our GitHub repo.

INFO:

OUTLINE:
0:00 - Introduction
2:14 - Types of learning
6:35 - Reinforcement learning in humans
8:22 - What can be learned from data?
12:15 - Reinforcement learning framework
14:06 - Challenge for RL in real-world applications
15:40 - Component of an RL agent
17:42 - Example: robot in a room
23:05 - AI safety and unintended consequences
26:21 - Examples of RL systems
29:52 - Takeaways for real-world impact
31:25 - 3 types of RL: model-based, value-based, policy-based
35:28 - Q-learning
38:40 - Deep Q-Networks (DQN)
48:00 - Policy Gradient (PG)
50:36 - Advantage Actor-Critic (A2C & A3C)
52:52 - Deep Deterministic Policy Gradient (DDPG)
54:12 - Policy Optimization (TRPO and PPO)
56:03 - AlphaZero
1:00:50 - Deep RL in real-world applications
1:03:09 - Closing the RL simulation gap
1:04:44 - Next step in Deep RL

CONNECT:
- If you enjoyed this video, please subscribe to this channel.

Рекомендации по теме

Комментарии

Deep RL is my favorite subfield of AI, because it asks some fundamental questions about what it takes to build safe and intelligent robots that operate in the real world. So many open problems and interesting challenges to solve!

lexfridman

I wish you still did videos like this, we appreciate you sharing such knowledge.

NakedSageAstrology

Deep RL is the field that excites me the most. Thank you Lex.

KeepingUp_withAI

Thank you for bringing these lectures to us.

Techieadi

I really like that tongue in cheek chuckle when Lex talked about that multiverse and whoever created

kawingchan

Seriously the best Deep RL lecture out there to date.

samuelschmidgall

"Every type of machine learning is supervised learning", cannot agree more!!!

nova

Since 2017, Lex have improved his lessons spectacularly ! Now (2019), I watch a more fluid video with a feeling that this guy know exactly what his talking without hesitating . Once again, thanks Lex, for sharing this videos. Congratulations and thanks from Brazil.

wendersonj

1:04:40 Best part, that grin after he just casually dropped that line in an MIT lecture.. All of infinite universes being Simulations

akarshrastogi

Certainly, one of the best videos on deep learning I have come across.

ronaldolum

Loved the lecture. Definitely recommend his podcast. Quality.

judedavis

Every Lecture has a historical context, evolution, mathematics and inspiration, Technical overview, Network Architecture overview. Well Summarized!!

sivaa

Haha he says "that is super exciting", without being excited! He is a robot!
Thanks for the open lectures

danielvelazquez

I honestly don't care about AlphaGo or Dota 2 or the robots, I just cannot get over how incredible the thought structure is behind this. What is mean by thought structure is the strategy behind how to quantify the right things, asking the right questions, and model the policy upon which growth can be created. IT IS SICK

DennisZIyanChen

As always Le a generous Share, which will be a useful resource for loads of folks. Thanks.

amandajrmoore

I appreciate the philosophical insights sprinkled throughout the lecture!

mrr

I've seen a lot of these videos & read some of the books in ML; Lex has a clarity thats rare

MistaSmilesz

Important detail when trying to transfer from a simulation to the real world: make the simulation have many random variations in its behavior/mechanics during runtime. (such as drag, gravity, friction, size of the agent, random perturbations, etc) This will make the agent have to generalize more, and not over optimize on the details in the sim. This makes it easier to transfer the agents capabilities to a real world environment.

vast

Professor Lex, can we get the entirety of 6.S091 on MIT OCW ? This is an incredibly interesting topic that I've been working on (Evolutionary Computing) and am currently enrolled in a project with thorough knowledge of Deep RL as a requisite. This research field has very few online resources besides Stanford's CS 234 and Berkeley's CS 285.

Your explanations are immensely helpful and intuitive. Humanity will present it's gratitude if this whole course is made available ! AGI and AI safety issues need more attention before it's the greatest immediate existential risk, your courses can help raise general AI awareness and advance our civilization to higher dimensions. Loved the fact that you grinned while just casually mentioning the Simulation Hypothesis..

akarshrastogi

Lex Fridman, I just love your videos. I am your great fan sir. Carry on.

ArghyaChatterjeeJony

MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL)

MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL)

Introduction to Deep Learning (MIT 6.S191)

MIT Deep Learning 6.S191 Teaser

MIT 6.S191 (2019): Deep Generative Modeling

MIT 6.S191 (2021): Reinforcement Learning

MIT 6.S191 (2018): Deep Learning - A Personal Perspective

Lex Fridman: Is Reinforcement Learning a Good Candidate for AGI?

MIT 6.S191 (2023): Reinforcement Learning

MIT 6.S191 (2018): Deep Reinforcement Learning

MIT 6.S191 (2021): Deep Generative Modeling

MIT 6.S094: Computer Vision

MIT 6.S191 (2022): Reinforcement Learning

Introduction to Deep Reinforcement Learning | Deep RL Course

The power of reinforcement learning and robotics

MIT 6.S191 (2018): Convolutional Neural Networks

MIT 6.S191 (2019): Recurrent Neural Networks

The Magic of Reinforcement Learning with Human Feedback RLHF

Lex Fridman: Roadblocks in Deep Learning

Deep Learning State of the Art (2019) - MIT

Reinforcement Learning, by the Book

MIT 6.S094: Deep Learning for Human-Centered Semi-Autonomous Vehicles

Deepmind Reinforcement learning w unsupervised auxiliary tasks #airesearch #artificialintelligence

DeepRL1.1 - Introduction to Deep Reinforcement Learning

16. Reinforcement Learning, Part 1