I've Been Doing This Wrong The Whole Time ... The Right Way to Save Models In PyTorch

Показать описание

It turns out I wasn't saving PyTorch models correctly. You really need to save the optimizer state as well as the state of the current weights of your deep neural network. This is critical to getting deep Q learning and actor critic agents training on complex environments that may require multiple sessions spread over time.

Starter code for this video is here:

Learn how to turn deep reinforcement learning papers into code:

Get instant access to all my courses, including the new Prioritized Experience Replay course, with my subscription service. $29 a month gives you instant access to 42 hours of instructional content plus access to future updates, added weekly.

Or, pickup my Udemy courses here:

Deep Q Learning:

Actor Critic Methods:

Curiosity Driven Deep Reinforcement Learning

Natural Language Processing from First Principles:

Just getting started in deep reinforcement learning? Check out my intro level course through Manning Publications.

Reinforcement Learning Fundamentals

Here are some books / courses I recommend (affiliate links):

Come hang out on Discord here:

Machine Learning with Phil

Рекомендации по теме

Комментарии

Many thanks for updating us. I really appreciate you providing us with such wonderful courses.

forough

Hey Phil, how to save/reload the seed for numpy, env.seed, env.action_space.seed, T.cuda.manual_seed_all and other related seed setting? Just say if I need to run a long training, but have to switch over to different machines (very likely happen in HPC). Could you please provide some advice on this? Thanks in advance.

adamjiang

Thanks for update. I wonder how would you go about loading the model, once trained, for "testing"? I've tried for example, loading the model state to q_eval, setting q_eval to ".eval()" mode, using "with torch.no_grad()", then getting the predictions with "model(observation)", but the model/agent doesn't perform as how it was doing during training (it does really bad in comparison).

jose-alberto-salazar-jimenez

Dear Phil,
May I ask why q_next[done] = 0? Is not q_next a single value?

forough

Hi, thanks for the video. Can you explain what are the benefits of saving the state of the optimizer?

meowatthemoon

hey phil good to see you
what you working on RL or anything else?

markadyash

I did horrible on all of this organizational shit for by bachelor thesis, next time I wanna be organized from the start. Gonna watch what a bunch of different experienced people are doing.

MrCmon

What are ur pc specs? Training is rly fast

killereks

Thanks for video.Please check your e mail

fastaalemyapanadam

I've Been Doing This Wrong The Whole Time ... The Right Way to Save Models In PyTorch

I’ve been doing this wrong all my life #shorts

Everyday Things You've Been Doing *WRONG*!

I’ve been doing this wrong my whole life..

THINGS YOU'VE BEEN DOING WRONG!!

Things You Have Been Doing Wrong!

Things You've Been Doing Wrong

I’ve been doing this wrong for years! #shorts

45 THINGS YOU'VE BEEN DOING WRONG

Forgiveness- you have been doing it wrong!!

Have we been doing Solar wrong all along?

Everyday Things You're Doing Wrong, But It's Not Too Late to Change

I've been doing it all wrong

YOU HAVE BEEN DOING THIS WRONG ALL YOUR LIFE! 🤯

Everyday Things You've Been Doing Wrong!

You’ve Been Doing THIS Wrong! #shorts

5 Things You've been Doing Wrong Everyday

33 Everyday Things You’ve Been Doing All Wrong

I Tested Everyday Things You've Been Doing Wrong

We've Been Doing it Wrong

I've Been Doing This Wrong (Reverse Hive Splits!)

19 Things You've Been Doing Wrong All Your Life

10 Things You've been Doing Wrong Everyday

I’ve been doing this wrong the whole time #onion

I’ve been doing it wrong 🤯 #lifehack #lifehacks #lifehack101 #viralreel #viralreels

Everyday Things You've Been Doing WRONG!