Gradient Descent algorithm Simplified

Показать описание

- Gradient descent is an optimization algorithm used to obtain the optimized network weight and bias values

- It works by iteratively trying to minimize the cost function

- Gradient Descent algorithm works by calculating the gradient of the cost function and move in the negative direction until the local/global minimum is achieved. If the positive of the gradient is taken, local/global maximum is achieved.

- The size of the steps taken are called the learning rate. If learning rate increases, the area covered in the search space will increase so we might reach global minimum faster . However, we can overshoot the target.

- For small learning rates, training will take much longer to reach optimized weight values.

GRADIENT DESCENT WORKS AS FOLLOWS:

1. Calculate the gradient (derivative) of the Loss function 𝝏𝒍𝒐𝒔𝒔/𝝏𝒘

2. Pick random values for weights (m, b) and substitute

3. Calculate the step size (how much are we going to update the parameters?)
𝑺𝒕𝒆𝒑 𝒔𝒊𝒛𝒆 =𝒍𝒆𝒂𝒓𝒏𝒊𝒏𝒈 𝒓𝒂𝒕𝒆∗𝒈𝒓𝒂𝒅𝒊𝒆𝒏𝒕= 𝜶∗𝝏𝒍𝒐𝒔𝒔/𝝏𝒘

4. Update the parameters and repeat
𝒏𝒆𝒘 𝒘𝒆𝒊𝒈𝒉𝒕 = 𝒐𝒍𝒅 𝒘𝒆𝒊𝒈𝒉𝒕 –𝒔𝒕𝒆𝒑 𝒔𝒊𝒛𝒆
𝒘_𝒏𝒆𝒘=𝒘_(𝒐𝒍𝒅 )−𝜶∗𝝏𝒍𝒐𝒔𝒔/𝝏𝒘

I hope you will enjoy this video and find it useful and informative.

Thanks and happy learning!
Ryan

#gradientdescent #deeplearning #machine learning #AI

Рекомендации по теме

Комментарии

Read so many books but Ryan is the only one who explains best, totally understand, thank u

Sakamoto-waAu

You saved my life and I am saying that in the literal sense.

rockus

I really love the way you explain these things. Very easy to understand. Thanks!

ExplanationNext

I'm sad that you eanred only 13 likes with mine of cource. You are explaining very well everything and you are making the complex simple. Continue your good work!!!! Congratulatons not for the knowlege (This can have anyone) but much more for how you explain everything!!!!

dimitrioskyriakos

Sir I taken your courses in udemy those are amazing sir it helped me to learn clearly. If possible make 1 course on opencv with deep learning like object detection etc..
It helpful alot

subhashachutha

Sir from escaping local minima there is only option by learning rate or anything is there??

subhashachutha

Gradient Descent algorithm Simplified

Gradient Descent in 3 minutes

Gradient descent simple explanation|gradient descent machine learning|gradient descent algorithm

Gradient Descent, Step-by-Step

Gradient Descent Explained

Gradient Descent algorithm Simplified

How Gradient Descent Works. Simple Explanation

Intro to Gradient Descent || Optimizing High-Dimensional Equations

Gradient Descent - Simply Explained! ML for beginners with Code Example!

Day 8 - ML algorithms

Gradient Descent explained in 5 minutes.

Gradient descent, how neural networks learn | Chapter 2, Deep learning

Solve any equation using gradient descent

Building the Gradient Descent Algorithm in 15 Minutes | Coding Challenge

Mastering Gradient Descent | The Heart of Machine Learning Algorithms Explained

The Unreasonable Effectiveness of Stochastic Gradient Descent (in 3 minutes)

Machine Learning Tutorial Python - 4: Gradient Descent and Cost Function

1. Gradient Descent | Delta Rule | Delta Rule Derivation Nonlinearly Separable Data by Mahesh Huddar

What is Gradient Descent in Machine Learning?

Gradient descent - with a simple example

Linear Regression Gradient Descent | Machine Learning | Explained Simply

Gradient Descent From Scratch in Python - Visual Explanation

Stochastic Gradient Descent, Clearly Explained!!!

What is Gradient Descent || With Example

Gradient Descent Part 1 Chieh