Reinforcement Learning Course: Intro to Advanced Actor Critic Methods

preview_player
Показать описание
Actor critic methods are used in machine learning. They are most useful for applications in robotics as they allow us to output continuous, rather than discrete actions. This enables control of electric motors to actuate movement in robotic systems, at the expense of increased computational complexity.

💻 Code for the algorithms covered:

⭐️ Course Contents ⭐️
⌨️ (0:00:00) Intro
⌨️ (0:04:03) Actor Critic (TF2)
⌨️ (0:44:50) DDPG (TF2)
⌨️ (1:52:36) TD3 (TF2)
⌨️ (3:08:29) PPO (PyTorch)
⌨️ (4:03:16) SAC (TF2)
⌨️ (5:09:28) A3C (PyTorch)

⭐️ Software requirements ⭐️
Python 3.x
box2d-py 2.3.8
gym 0.15.4
matplotlib 3.1.1
numpy 1.18.1
pybullet 2.8.5
torch 1.4.0
tensorflow-gpu 2.3.1

🎉 Thanks to our Champion and Sponsor supporters:
👾 Wong Voon jinq
👾 hexploitation
👾 Katia Moran
👾 BlckPhantom
👾 Nick Raker
👾 Otis Morgan
👾 DeezMaster
👾 Treehouse

--

Рекомендации по теме
Комментарии
Автор

Hey I know that guy! Any questions, please leave them down below!

MachineLearningwithPhil
Автор

Guys we should literally donate to this channel once Hired, is more useful than most universities

rachadelmoutaouaffiq
Автор

Man this channel is a goldmine 😂

Nothing new though as this is not the first course I saw here. This course is going to be very helpful for me. Thank you for the work you guys are putting into teaching people like me.

P.s. double thanks for the nextjs course as well. It was very helpful.

BlurryBit
Автор

This is why the computer science and software engg field is so successful and growing so quickly. We keep everything open source and freely available to anyone willing to learn. That’s so rare these days. There are so many other fields that lock up their knowledge in university courses and paywalls.

kiran
Автор

Personally like the style of few slides, no BS, no nothing Sir, straight to the coding. Strong work.

AungBaw
Автор

As a first viewer and a young coder I love code camp well my age is 12 and in picture he is my dad so don’t be confused

vpundir
Автор

This was exactly what I wanted to learn. Thank you

ketchupparty
Автор

This channel is awesome. Its content and support is beyond any words.. Thank you so much for all the quality content Team.

kbhaskar
Автор

Wow, I just turned in my project with an actor critic algorithm THIS WEEK.
-__-
*cries

cescabhi
Автор

Oohhh sweet! Machine Learning with Phil is awesome!

trentonspears
Автор

I thought prob_ratio must equal to one if we replay the same action as the actor is updated after replay . am I right?

tarifcemay
Автор

Thank you for the awesome video. Can you please characterize all the DRL models? If possible.

Falconoo
Автор

Thank you so much for putting an effort to do the whole implementation which is relatively bit easier to grasp than the paper. I am very new to RL and I have a rather weird question(cause no one actually addressed but ignore if I am being stupid), so when for the first time you call the learn function after doing 20 steps, wouldn't the new_probs be equal to the old_probs, because essentially the neural network didn't learn anything so would both these values be random until like several iteration? And if actually they would be random, how is the agent learning?

smitasingh
Автор

Hi everyone. Could you recommend any paper or longer discussion about the limitations of actor critic models for continuous space

attilasarkany
Автор

Are these methods suggested for NLP tasks such as Text classification?

rwyodfc
Автор

لماذا لا توجد ترجمه مصاحبة لهذا الفيديو

mohamednasrel-dinazouzmoha
Автор

Please Turn On the Auto English subtitle. I am not English native

ayarzuki
Автор

I have an actor critic algorithm. I want you to implement the python code. I will pay for that.

tharunkumar
Автор

I thought it was @wojespn for a second

saiiyengar
Автор

this is just a compilation of pre existing videos, this should have been clarified

kkyars