L12.4 Adam: Combining Adaptive Learning Rates and Momentum

Показать описание

-------

This video is part of my Introduction of Deep Learning course.

-------

Рекомендации по теме

Комментарии

With general idea of adaptive learning rates, how do we determine whether we're going to the right or wrong direction? So we know whether we need to add "B" to the local gain or multiply by it (1-B)

sergeykurk

Hi, great course. The link for L12.5 is wrong on your website (blog).

danielgurgel

Thank you very much,
I have some question. In my task the number of features can be changed, so I have to develop the NN model where the first layer is updated at runtime, but others don't (something like transfer learning, but without freezing other layers). Then, should I initialize the optimizer again? The part of the code I wrote below.
Thank you.

second_layer_input_size = int(model.fc1.out_features) # input number in the second layer
model.fc1 = nn.Linear(new_features_len, second_layer_input_size) # updating neural network's first layer at runtime for more features
model.to(device) # Again to device, without it, it doesn't work. I'm not sure does it mean that I lost already trained weights ?
optimizer = optim.Adam(params=model.parameters(), lr=learning_rate) # Should I do it ???
features_len = new_features_len

Nice tutorial, thx for sharing! In your experience, is it redundant to use learning rate scheduler with Adam?

hungerbear

I'm learning ML. I found most videos spent a lot time on explain complex "equation" . Actually those equations are easy to understand if you pass in some numbers to explain.🙂Not sure what happened in these area.

RayGuo-bonr

L12.4 Adam: Combining Adaptive Learning Rates and Momentum

L12.4 Adam: Combining Adaptive Learning Rates and Momentum

l12 4 adam combining adaptive learning rates and momentum

Adam Optimizer Explained in Detail | Deep Learning

AdamW Optimizer Explained #datascience #machinelearning #deeplearning #optimization

Optimization in Data Science - Part 4: ADAM

Adam Optimizer

Unit 6.3 | Using More Advanced Optimization Algorithms | Part 2 | Adaptive Learning Rates

Deep Learning Lecture 4.4 - RMSprop & Adam

Adaptive Gradient Descent

Adam Optimization Part-8

Adam Optimizer Explained in Detail with Animations | Optimizers in Deep Learning Part 5

PYTHON : How to set adaptive learning rate for GradientDescentOptimizer?

Adaptive learning

EUSIPCO 2020 Tutorial 7-3: Adaptive Optimization Methods for Machine Learning and Signal Processing

Lecture 45 Optimisers RMSProp, AdaDelta and Adam Optimiser

Adam Optimizer for Neural Network || Lesson 15 || Deep Learning || Learning Monkey ||

Adam Optimizer or Adaptive Moment Estimation Optimizer

Top Optimizers for Neural Networks

function optimization with Adam Optimizer

Gradient Descent on Neurons and its Link to Approximate Second-order Optimisation

69 Adam (Adaptive Moment Estimation) Optimization - Reduce the Cost in NN

Yutian Chen | 'Towards Learning Universal Hyperparameter Optimizers with Transformers'

ADAM OPTIMIZER IMPLEMENTATION

03 - Methods for Stochastic Optimisation: AdaGrad, RMSProp and Adam