Reinforcement Learning with sparse rewards

preview_player
Показать описание
In this video I dive into three advanced papers that addres the problem of the sparse reward setting in Deep Reinforcement Learning and pose interesting research directions for mastering unsupervised learning in autonomous agents.

Papers discussed:

Reinforcement Learning with Unsupervised Auxiliary Tasks - DeepMind:

Curiosity Driven Exploration - UC Berkeley:

Hindsight Experience Replay - OpenAI:

If you want to support this channel, here is my patreon link:

Рекомендации по теме
Комментарии
Автор

Your teaching style is incredible! Can you please do a video on Capsule Networks?

AnkitBindal
Автор

I've just been reading these papers for the openai retro competition. Your video went into a lot of depth, which is really hard to do with complex ideas, bravo!

michaelc
Автор

no clickbait, good video quality, good sound, relative nice topics for some people, but 16k subs Excuse me, wtf

glorytoarstotzka
Автор

Seriously great, I'd love to see an updated video with the newest research!

DeaMikan
Автор

Holy shit. I have been working on this problem for months and to see that professionals are getting almost the exact same answers as me is pretty cool. There are a whole bunch if ideas in here I have not tried yet as well. Super useful

timonix
Автор

Ah dude just discovered your videos!! Just what I needed. Can't believe, have 6 year degree in engineering, work in AI and I can still learn from YouTube.mad when you think out it. It's a new paradigm of education

Frankthegravelrider
Автор

AMAZING! The prediction-reward algorithm in the first mentioned paper is very similar to how humans learn, at least based on a computational neurobiology course I took in college.

thomasbao
Автор

It's always interesting to see how ideas around curiosity have taken off in reinforcement learning (I think about the "Never give up" paper and atari57

adrienforbu
Автор

This a very effective strategy for personal productivity as a programmer with ADHD.
I augment my unreliable reward-signaling system with Test-Driven Development.

armorsmith
Автор

I was really touched by the ending of the video. We need research on models and the social-economic consequences of the AI models...and I don't mean that terminator, Butlerian jihad crap. I mean human side: job losses, bias, morality, misuse...etc

minos
Автор

This is a marvel. I read a book with similar content, and it was a marvel to behold. "The Art of Saying No: Mastering Boundaries for a Fulfilling Life" by Samuel Dawn

Matthew
Автор

Love your passion for what you are talking about!

henningyt
Автор

Fantastic video! Making me gain time and in an enjoyable way.
Many thanks

pasdavoine
Автор

Maan it is so damn interesting and good video. I come from a completely different area - game development. And I wanted to understand some basics of A.I because I really want to dive deep into this to eventually teach for example rocket to fly, flappy bird to jump, snake to play efficiently.
Reading papers is really difficult without knowledge of some basics, and the way you explained all these things is so good. I still don't understand the terminology and all these formulas, but at least I got one step closer :)
Thank you for this brilliant video :)

skyheart-abc
Автор

if it wasn't for this channel I'd have never have known it wasn't pronounced 'ark-ziv'

DjChronokun
Автор

Recommend to read "Curiosity-driven Exploration by Self-supervised Prediction"
it's really awesome paper.

lukaslorenc
Автор

Really happy to find your channel, really sad to find out few videos in it.

Leibniz_
Автор

So "HER" basically starts off as "if I do this action, I can get to this goal", and then gradually learns how to flip the statement to "if I want to get to this goal, I need to do this action". Pretty nice.

vadrif-draco
Автор

I would like to see a video on meta reinforcement learning. Its an exciting field now!

satyaprakashdash
Автор

Always impressing and I never get bored watching your videos. Good job and keep it up 👍

mohammadhatoum