01L – Gradient descent and the backpropagation algorithm

Показать описание

Speaker: Yann LeCun

Chapters
00:00:00 – Supervised learning
00:03:43 – Parametrised models
00:07:23 – Block diagram
00:08:55 – Loss function, average loss
00:12:23 – Gradient descent
00:30:47 – Traditional neural nets
00:35:07 – Backprop through a non-linear function
00:40:41 – Backprop through a weighted sum
00:50:55 – PyTorch implementation
00:57:18 – Backprop through a functional module
01:05:08 – Backprop through a functional module
01:12:15 – Backprop in practice
01:33:15 – Learning representations
01:42:14 – Shallow networks are universal approximators!
01:47:25 – Multilayer architectures == compositional structure of data

Рекомендации по теме

Комментарии

Thanks for posting these! With this, you reach a very wide audience and help anyone who does not have access to such teachers and universities! 👏

AICoffeeBreak

You’re doing a massive favor to the community who wants to access to high quality content without paying a huge amount of money. Thank you so much!

makotokinoshita

From seeing the name of Yan in a research paper during a literature survey in my internship program, to attending his lectures is really a thriller. Quite enriching and mathematically profound stuff here. Thanks for sharing it free!

sutirthabiswas

Wow! Yann is such a great teacher. I thought I knew this material fairly well, but Yann is enriching my understanding with every slide. It seems to me that his teaching method is extremely efficient. I suppose that's because he has such a deep understanding of the material.

dr.mikeybee

Thanks very much for the content. What a time to be alive. To hear from the master himself.

johnhammer

I can totally see how a quantum computer could be used to perform gradient descent in all directions simultaneously, helping to find the true global minimum across all valleys in one go! 😲 It's mind-blowing to think about the potential for quantum computing to revolutionize optimization problems like this!

OpenAITutor

You are a great man. Thanks to you someone even in a third world country can learn DL from one of the inventors himself. THIS IS CRAZY!

thanikhurshid

At 1:05:40 Yann is explaining the two jacobians, but I was having trouble getting the intuition. Then I realized that the first jacobian was getting the gradient to modify the weights w[k+1] for function z[k+1] and the second jacobian was back propagating the gradient to function z[k] which can then be used to calculate the gradient at k for yet another jacobian to adjust weights w[k]. So one jacobian is for the parameters and the other is for the state since both the parameter variable and state variable are column vectors. Yann explains it really well. I'm amazed that I seem to be understanding this complicated mix of symbols and logic. Thank you.

dr.mikeybee

I just can't believe this content is free. Amazing! Long life to Open Source! Grazie Alfredo :)

neuroinformaticafbf

That is my honor to learn from you and Sir...

mahdiamrollahi

Mehn!! these are gold.. especially for people who don't have access to these types of teachers, and methods of teaching, plus the material etc (that's a lot of people actually).

fuzzylogicq

I have watched this lecture twice in the last year. Mister LeCun is great! :)

copuzvv

Thank you so much for sharing this 🥰 This was the best video for learning gradient descent and backpropagation.

monanasery

I really love that discussion about solving non convex problems.... finally we get out of the books ! At least we unleash our mind.

alexandrevalente

Thank you so much, Alfredo, for organizing the material in such a nice and compact way for us! The insights of Yann and your examples, explanations and visualization are an awesome tool for anybody willing to learn (or to remember stuff) about deep learning. Greetings from Greece and I owe you a coffee, for your tireless effort.

PS. Sorry for my bad English. I am not a native speaker.

mpalaourg

This intimate atmosphere allows for a better understanding of the subject matter. Great questions 【ツ】 and of course great answers. Thank you

mataharyszary

Alfredo Canziani ... drinks are on me if you ever visit India ... this is extremely high quality content!

gurdeeepsinghs

I don't know if this helps anyone, but it might. Weighted sums like s[0] are always to the to the first power. There are no squared weighted sums or cubed. So the derivative using the power rule of nx to the first power is equal to n. The derivative of ws[0] is always the weight w. That's why the application of the chain rule is so simple. Here's some more help. If y=2x, y'=2. If q=3y, q'=3; so y(q(x))' = 2 * 3. Picture the graph of y(q(x)), What is the slope? It's 6. And as many layers as you add in a neural net, the partial slopes will be multiples of the weights.

dr.mikeybee

Discussion on stochastic gradient descent (12:23) and with adams (1:16:15) are great. General misconception.

WeAsBee

Great content! It’s just great to have this quality information available

jobiquirobi

01L – Gradient descent and the backpropagation algorithm

01L – Gradient descent and the backpropagation algorithm

1 4 1 Gradient Descent Dyanmics Proof of Thm 6 part1

2.3) Gradient Descent and Directional Derivatives

Bayesian Gradient Descent - TensorFlow

Course 008: Gradient Descent Backpropagation NN

Mini-batch gradient descent and softmax | Computer Vision | Electrical Engineering Education

Deep Learning | 02 | Stochastic gradient descent and backpropagation

Gradient Descent, Demystified

DIP Lec 15: Optimization Algorithms in Machine Learning || Prof. Hazem Al-Otum

Back-propagation & Gradient Descent in Neural Networks

Machine Learning 9 - Backpropagation | Stanford CS221: AI (Autumn 2021)

Applied Deep Learning 2022 - Lecture 2 - Neural Networks, Optimization and Backpropagation

u2 gradient descent

Deep Learning(CS7015): Lec 5.8 Line Search

Giorgia Dellaferrera: Biologically plausible alternatives to backpropagation

Unleashing the Power of Backpropagation: Training Neural Networks to Perfection

Lecture 8 Backpropagation part 4

Deep Learning 2: Mathematical principles and backpropagation

Tensor-Based Backpropagation

Lecture 7: Training Neural Networks: Optimization Part 2

All about Back Propagation Algorithm | Practical example | Neural Network | Deep Learning | Part 2

Back-Propagation with Tensors

Machine Learning Lecture 1 (Introduction/ Supervised/ Classification/ Regression/ Gradient Descent)

Cinematic Rendering in Virtual Reality