Lecture 4: Optimization

Показать описание

Lecture 4 discusses optimization algorithms that are used to minimize loss functions discussed in the previous lecture. We introduce the core algorithm of gradient descent, and contrast numeric and analytic approaches to computing gradients. We discuss extensions to the basic gradient descent algorithm including stochastic gradient descent (SGD) and momentum. We also discuss more advanced first-order optimization algorithms such as AdaGrad, RMSProp, and Adam, and briefly discuss second-order optimization.

_________________________________________________________________________________________________

Computer Vision has become ubiquitous in our society, with applications in search, image understanding, apps, mapping, medicine, drones, and self-driving cars. Core to many of these applications are visual recognition tasks such as image classification and object detection. Recent developments in neural network approaches have greatly advanced the performance of these state-of-the-art visual recognition systems. This course is a deep dive into details of neural-network based deep learning methods for computer vision. During this course, students will learn to implement, train and debug their own neural networks and gain a detailed understanding of cutting-edge research in computer vision. We will cover learning algorithms, neural network architectures, and practical engineering tricks for training and fine-tuning networks for visual recognition tasks.

Рекомендации по теме

Комментарии

As a CS grad student myself, I've sat through many lectures. This professor is really, really good.

jaeen

great lecture again, even though I did not understand anything.

muhammetcavus

love these lectures
thanks man, you are an amazing professor.

mohammadvahidi

Animations:
33:41 SGD
38:06 SGD + Momentum
45:05 Nesterov
50:23 RMSprop
55:27 Adam

baskaisimkalmamisti

This is best lecture on optimizations, very clearly explanations, SGD, SGD+Momentum, Adagrad, RMSProp and Adam.
If you have doubt i would request you to watch this with Andrew Ng Deep Learning Specialization Lectures to get a clear picture around optimization.

VikasKM

Absolutely brilliant !!
Since you have went through the details (instead of jumping over them) I finally understood how derivative is taken in ML – Kudos!! 😊

alexanderfrei

Thanks a lot professor, i will be grateful if you add a video course for proximal gradient methods.

riadelectro

Great lecture, I came back to watch again

changjuanjing

Great lecture, Justin. Wonder what you think about MADGRAD?

BoTian

25:06 Isnt the loss in SGD computed for an example, what differences would SGD have over Minibatch gradient descent then?

anishmanandhar

please fix voice in next recordings...

syed

Lecture 4: Optimization

Lecture 4: Optimization

Francis Bach: Large scale Machine Learning and Convex Optimization (Lecture 4)

CS 182: Lecture 4: Part 1: Optimization

Lecture 4 - Deep Learning Foundations: the implicit bias of gradient descent

Optimisation Lecture 4: Differential Evolution

Optimal Control (CMU 16-745) 2023 Lecture 4: Optimization Pt. 2

CS 182: Lecture 4: Part 3: Optimization

CS 182: Lecture 4: Part 2: Optimization

Lecture 14 - Evaluating Predictors | UofA CMPUT267: Machine Learning I (Fall 2024)

lecture 4: Mechanics and Optimizations

Deep Learning - Lecture 6.2 (Optimization: Optimization Algorithms)

Lecture 4 | Convex Optimization Principles | Convex Optimization by Dr. Ahmad Bazzi

ASIC Design Course [ECE413s] - Lecture (4): Synthesis Basics & Optimization Techniques

Deep Learning Lecture 4: Deep Learning Details

Lecture 4, Spring 2022: Adaptive Control. Value and Policy Approximations in DP/RL. ASU

Deep Learning Lecture 4: Regularization, model complexity and data complexity (part 1)

Lecture #4: Adding a New Constraint in LPP - Sensitivity Analysis

Lecture 4: Infrastructure and Tooling - Full Stack Deep Learning - March 2019

Lecture 4 - Training Neural Networks | Deep Learning on Computational Accelerators

RL Course by David Silver - Lecture 4: Model-Free Prediction

[ML 2021 (English version)] Lecture 6: What to do when optimization fails? (3/4)

Lecture 4 - Building Product, Talking to Users, and Growing (Adora Cheung)

Optimization | MIT Computational Thinking Spring 2021 | Lecture 16

Lecture 'Channel Coding: Graph-based Codes', Chapter 4, Vid. 6, 'Optimization of Degr...