Gradient Descent algorithm Simplified

preview_player
Показать описание
- Gradient descent is an optimization algorithm used to obtain the optimized network weight and bias values

- It works by iteratively trying to minimize the cost function

- Gradient Descent algorithm works by calculating the gradient of the cost function and move in the negative direction until the local/global minimum is achieved. If the positive of the gradient is taken, local/global maximum is achieved.

- The size of the steps taken are called the learning rate. If learning rate increases, the area covered in the search space will increase so we might reach global minimum faster . However, we can overshoot the target.

- For small learning rates, training will take much longer to reach optimized weight values.

GRADIENT DESCENT WORKS AS FOLLOWS:

1. Calculate the gradient (derivative) of the Loss function 𝝏𝒍𝒐𝒔𝒔/𝝏𝒘

2. Pick random values for weights (m, b) and substitute

3. Calculate the step size (how much are we going to update the parameters?)
𝑺𝒕𝒆𝒑 𝒔𝒊𝒛𝒆 =𝒍𝒆𝒂𝒓𝒏𝒊𝒏𝒈 𝒓𝒂𝒕𝒆∗𝒈𝒓𝒂𝒅𝒊𝒆𝒏𝒕= 𝜶∗𝝏𝒍𝒐𝒔𝒔/𝝏𝒘

4. Update the parameters and repeat
𝒏𝒆𝒘 𝒘𝒆𝒊𝒈𝒉𝒕 = 𝒐𝒍𝒅 𝒘𝒆𝒊𝒈𝒉𝒕 –𝒔𝒕𝒆𝒑 𝒔𝒊𝒛𝒆
𝒘_𝒏𝒆𝒘=𝒘_(𝒐𝒍𝒅 )−𝜶∗𝝏𝒍𝒐𝒔𝒔/𝝏𝒘

I hope you will enjoy this video and find it useful and informative.

Thanks and happy learning!
Ryan

#gradientdescent #deeplearning #machine learning #AI
Рекомендации по теме
Комментарии
Автор

Read so many books but Ryan is the only one who explains best, totally understand, thank u

Sakamoto-waAu
Автор

You saved my life and I am saying that in the literal sense.

rockus
Автор

I really love the way you explain these things. Very easy to understand. Thanks!

ExplanationNext
Автор

I'm sad that you eanred only 13 likes with mine of cource. You are explaining very well everything and you are making the complex simple. Continue your good work!!!! Congratulatons not for the knowlege (This can have anyone) but much more for how you explain everything!!!!

dimitrioskyriakos
Автор

Sir I taken your courses in udemy those are amazing sir it helped me to learn clearly. If possible make 1 course on opencv with deep learning like object detection etc..
It helpful alot

subhashachutha
Автор

Sir from escaping local minima there is only option by learning rate or anything is there??

subhashachutha