Neural Networks Demystified [Part 5: Numerical Gradient Checking]

Показать описание

When building complex systems like neural networks, checking portions of your work can save hours of headache. Here we'll check our gradient computations.

Supporting code:

In this series, we will build and train a complete Artificial Neural Network in python. New videos every other friday.

Part 1: Data + Architecture
Part 2: Forward Propagation
Part 3: Gradient Descent
Part 4: Backpropagation
Part 5: Numerical Gradient Checking
Part 6: Training
Part 7: Overfitting, Testing, and Regularization

@stephencwelch

Рекомендации по теме

Комментарии

Just an amazing resource. There is NO reason we can't communicate complex concepts simply and not have to cut corners either. Thank you for this!

isaaclumpkins

I swear, one day I shall master this art.

bobsmithy

NO!!! I REFUSE TO BE INTIMITADED BY THIS!! I WILL GET IT!!... eventually

kariuki

I'm a mathematician and people who are either terrible teachers or who were taught by terrible teachers are the bane of my existence. Being able to clearly and methodically break down a problem or method into steps small enough to be clear and logically connected but not so numerous as to require 100 years to communicate is a very rare skill/gift. I've just watched this Neural Networks series of yours and I've got to say it's very good in its quality of presentation and, more importantly, quality of material. +1 subscriber!

SecondQuantisation

just too good....now I can confidently say that yes I know Neural Networks and it's not a mysterious math anymore...Thanks a lot for these videos

Wow, Now neural network is no more a mysterious black box to me. thank u

SethuIyer

oh man. I was watching Andrew Ng course on coursera and could not get everything. Your explanation is way better...

hiprabhat

These videos are awesome! I'm taking master classes on machine learning and these videos help bring things into perspective whenever I get lost during parts of those classes. Thanks for filling in the blanks ;)

chrisa

Thank you for the great and thorough explanations.
Though, in my opinion, the background music is disturbing.

djelleleddine

Well, so far my neural network works, I just copied all the code from the videos and tried every method in the neural network, I'm specially excited because I actually understand the whole thing.

sephirothjc

I did not realize this was an unfinished series.
*subscribes and awaits next video with bated breath*

kkiller

Thank you so much for your videos. I loved watching them. It would be really helpful if you could please upload the codes that you have explained in these videos.

siddrameshwarkadagad

Love the video series. Both incredibly interesting and informative. Looking forward to future uploads!

skipy

such phenomenal videos. thank you for this, keep up the great work. i

SaviorThree

Can't wait to get to pre-calculus next year... finally will be able to begin to understand this. :P

drelthian

If your output landscape is smooth enough that this would be expected to work, then you can also use an evolutionary algorithm to explore it instead. What I mean is using mutation + selection, keep the best result, decrease mutation magnitude each time you improve. Once you've found a local minimum (you decide what constitutes one based on your needs, e.g. 20 or maybe 100 mutations with no improvement) you can store that as a best solution and increase the magnitude of mutation again, then either start mutating from your current best, or just select a brand new location. Keep track of your best solution, obviously. You decide how many iterations (local minimums found) are worth exploring, or that you can have it quit after some number of new attempts fail to improve on the old one. You might also find that many distant starting points settle on the same minimum, increasing your confidence about the smoothness of your landscape and quality of your solution. This still removes the "brute force" but allows the computer to take on the remaining grunt work.

LiftPizzas

Thanks a lot for this set of videos could you make an another set about convolutionnal neural network or other machine learning algorithms ?

florianm

At 3:08, I'm confused by the syntax in computeGradients with the colon in front of the self.costFunctionPrime, because it's giving me an error and I can't figure it why, partly cause I'm also confused by the syntax, so I was wondering if someone could explain it to me? Thanks in advance!

KID-jrib

Thank you so much, you really help to learn this from 0. All tutorials at internet just explain the ecuations that they use to make the network but they never use it on a real case like you in python. I just have one doubt, is deep learning the same thing that you does in your vids? or deep learning means that the hidden layer is composed for 2 or more layers. Sorry for my bad english, but actually im from chile and my english is not so good. jaja Thanks again bro!

albertoherrerahp

Numeric gradient = (f(x + epsilon) - f(x - epsilon)) / (2 * epsilon)

Given that epsilon = 1e-4, we can calculate the numeric gradient for x = 1.5 as follows:

f(x + epsilon) = (1.5 + 1e-4) ** 2 =
f(x - epsilon) = (1.5 - 1e-4) ** 2 =

Numeric gradient = - / (2 * 1e-4) = 0.02 / 2e-4 = 100

Therefore, the numeric gradient for the function f(x) = x**2 at x = 1.5 is 100.

I can't get the numeric gradient value to be could you advise?

nbynvne

Neural Networks Demystified [Part 5: Numerical Gradient Checking]

Neural Networks Demystified [Part 5: Numerical Gradient Checking]

Neural Networks Demystified: Part 5, Welch Labs @ MLconf SF

Neural Networks Demystified Part 5 Numerical Gradient Checking

Neural Networks Demystified [Part 6: Training]

Neural Networks Demystified [Part 4: Backpropagation]

Neural Networks Demystified [Part 2: Forward Propagation]

Neural Networks Demystified [Part 1: Data and Architecture]

Neural Networks Demystified [Part 3: Gradient Descent]

1D convolution for neural networks, part 5: Backpropagation

Neural Networks Demystified [Part 7: Overfitting, Testing, and Regularization]

Backpropagation Details Pt. 2: Going bonkers with The Chain Rule

Neural Networks Demystified: The Key to Understanding AI & Machine Learning!

Neural Networks Demystified: Part 1, Welch Labs @ MLconf SF

Neural Networks Demystified Part 2 Forward Propagation

5.3: Regression Neural Networks for Keras and TensorFlow (Module 5, Part 3)

Neural Network Fundamentals (Part 4): Prediction

Neural Networks Demystified: Your Ultimate Guide to Understanding and Implementing ANNs

Neural Networks Demystified: Part 4, Welch Labs @ MLconf SF

5 concepts in transformer neural networks (Part 1)

Neural Networks Demystified: Part 7, Welch Labs @ MLconf SF

Neural Networks Demystified: Part 3, Welch Labs @ MLconf SF

Artificial Neural Networks Demystified

NLP Demystified 11: Essential Training Techniques for Neural Networks

Neural Networks Demystified: Part 2, Welch Labs @ MLconf SF