Lecture 7: Convolutional Networks

preview_player
Показать описание
Lecture 7 moves from fully-connected to convolutional networks by introducing new computational primitives that respect the spatial structure of 2D image data. We discuss convolution layers, which slide a learnable filter over the input data. We discuss pooling layers, which spatially downsample their input data. We then look at normalization layers including batch, layer, and instance normalization, which normalize their input data along different axes and improve training speed.

_________________________________________________________________________________________________

Computer Vision has become ubiquitous in our society, with applications in search, image understanding, apps, mapping, medicine, drones, and self-driving cars. Core to many of these applications are visual recognition tasks such as image classification and object detection. Recent developments in neural network approaches have greatly advanced the performance of these state-of-the-art visual recognition systems. This course is a deep dive into details of neural-network based deep learning methods for computer vision. During this course, students will learn to implement, train and debug their own neural networks and gain a detailed understanding of cutting-edge research in computer vision. We will cover learning algorithms, neural network architectures, and practical engineering tricks for training and fine-tuning networks for visual recognition tasks.

Рекомендации по теме
Комментарии
Автор

Great lecture for free. Thank you Michigan University and professor Justin.

jhjjjj
Автор

Thank you I found answers to the questions that I have been looking for long time

temurochilov
Автор

Amazing!
Pd: Although I am sorry for the guy with the coughing

tatianabellagio
Автор

I have this great package for my university course.❤

alokoraon
Автор

33:10 stride
53:00 batch normalization

hasan
Автор

Thank you for exellent video! But I have a question here, at 1:05:42, after layer normalization, every sample in x has shape 1xD, while μ has shape Nx1. How do you perform the subtraction x-μ?

eurekad
Автор

Thanks for an excellent video Justin!! I had a quick question on how does the conv. filters change the 3d input into a 2d output

puranjitsingh
Автор

at 35:09, the expression for output in case of stride convolution is (W - K + 2P)/S +1...for W=7, K=3, P = (K-1)/2 = 1 & S=2 we get output as (7 - 3 + 2*1)/2 + 1 = 3 +1 = 4 ...however, the slide shows the output as 3x3 instead of 4x4 at the right hand corner... is it correct..?

rajivb
Автор

why they don't use batch norm + layer norm together?

intoeleven
Автор

1:01:30 what did he mean by “fusing BN with FC layer or Conv layer”?

bibiworm
Автор

In Batch Normalization during Test time at 59:52, what are the averaging equations used to average Mean & Std deviation, sigma ..during the lecture some mention is made of exponential mean of Mean vectors & Sigma vectors...please suggest.

rajivb
Автор

sounds like someone was building duplo the entire lecture

magic