Lesson 13: Deep Learning Foundations to Stable Diffusion

Показать описание

We also discuss the importance of the chain rule in calculating the gradient of the mean squared error (MSE) applied to a model, and demonstrate how to use PyTorch to calculate derivatives and simplify the process by creating classes for ReLU and linear functions. We then explore the issues with floating point math and introduce the log sum exp trick to overcome these issues. Finally, we create a training loop for a simple neural network.

0:00 - Introduction
2:54 - Linear models & rectified lines (ReLU) diagram
10:15 - Multi Layer Perceptron (MLP) from scratch
18:15 - Loss function from scratch - Mean Squared Error (MSE)
23:14 - Gradients and backpropagation diagram
31:30 - Matrix calculus resources
33:27 - Gradients and backpropagation code
38:15 - Chain rule visualized + how it applies
49:08 - Using Python’s built in debugger
1:00:47 - Refactoring the code

Рекомендации по теме

Комментарии

Great, very enlightening, liked the small details also, thank you!

michaelmuller

That e^a trick shows that, even though algebra is such a pain, it comes in handy so often to make things move smooth. Reminds me of that trick to avoid overflow in binary search: mid = low + ((high - low) / 2).

Favorite thing about these lectures are the small hints for math and Python along the way. Thanks for being so detail oriented!

mattst.hilaire

When we compare result of the softmax with the one-hot vector (at 1:21:00), we take only the value of the softmax where one-hot vector is one. Isn't this a missed opportunity to incorporate other "wrong" predictions into the loss function? E.g. if the model is highly confident in making the prediction for some other wrong class (eg. numbers that look similar) then getting more penalised for this could further speed up the training?

markozege

Lesson 13: Deep Learning Foundations to Stable Diffusion

Lesson 13: Deep Learning Foundations to Stable Diffusion

TWiML x Fast ai Deep Learning From the Foundations Study Group - Lesson 13

Lesson 14: Deep Learning Foundations to Stable Diffusion

Lesson 13 (2019) - Basics of Swift for Deep Learning

Lesson 13: Cutting Edge Deep Learning for Coders

Lesson 12: Deep Learning Foundations to Stable Diffusion

LESSON 13: DEEP LEARNING MATHEMATICS: Scrutinizing Information Theory

Lesson 23: Deep Learning Foundations to Stable Diffusion

Lesson 16: Deep Learning Foundations to Stable Diffusion

Lesson 17: Deep Learning Foundations to Stable Diffusion

Lesson 11 2022: Deep Learning Foundations to Stable Diffusion

Lesson 15: Deep Learning Foundations to Stable Diffusion

Lesson 20: Deep Learning Foundations to Stable Diffusion

Lesson 9: Deep Learning Foundations to Stable Diffusion

Lesson 19: Deep Learning Foundations to Stable Diffusion

Lesson 18: Deep Learning Foundations to Stable Diffusion

KRR Classroom - Lesson 13 || An Interaction Session With KRR - Part #2

Plaxis 2D V20: Lesson 13 Dynamic Analysis of a Generator on an Elastic Foundation

Lesson 24: Deep Learning Foundations to Stable Diffusion

Lesson 10: Deep Learning Foundations to Stable Diffusion, 2022

Lesson 12: Deep Learning Part 2 2018 - Generative Adversarial Networks (GANs)

Lesson 8 (2019) - Deep Learning from the Foundations

LESSON 13: MASTERING MACHINE LEARNING ALGORITHM: Mapping Continuous Random Variable

Jetson Xavier NX Lesson 13: Installing Face Recognition and Identification Libraries