Lesson 11 (2019) - Data Block API, and generic optimizer

Показать описание

We start lesson 11 with a brief look at a smart and simple initialization technique called Layer-wise Sequential Unit Variance (LSUV). We implement it from scratch, and then use the methods introduced in the previous lesson to investigate the impact of this technique on our model training. It looks pretty good!

Then we look at one of the jewels of fastai: the Data Block API. We already saw how to use this API in part 1 of the course; but now we learn how to create it from scratch, and in the process we also will learn a lot about how to better use it and customize it. We'll look closely at each step:

- Transformations: we create a simple but powerful `list` and function composition to transform data on-the-fly
- Split and label: we create flexible functions for each
- DataBunch: we'll see that `DataBunch` is a very simple container for our `DataLoader`s

Next up, we build a new `StatefulOptimizer` class, and show that nearly all optimizers used in modern deep learning training are just special cases of this one class. We use it to add weight decay, momentum, Adam, and LAMB optimizers, and take a look a detailed look at how momentum changes training.

Finally, we look at data augmentation, and benchmark various data augmentation techniques. We develop a new GPU-based data augmentation approach which we find speeds things up quite dramatically, and allows us to then add more sophisticated warp-based transformations.

Рекомендации по теме

Комментарии

1:40:09 why is the debias term not mom**(step+1)?

aswahd

58:40
If I have a problem where I only have one number as input and for predicting the target it is useful to have such information as "is the input greater than or less than 'a', greater than or less than 'b'?" than I have a problem where the input dimension is 1 and the output filters I have is 2.
If so, then your argument for having less number of output channels than the number of inputs is incorrect. Am I missing something?

jonatani

Hi Jeremy, at 8:56 in lsuv_model(), why didn't you scale the variance first, then shift the mean later ( swap the 2 while loops) to exactly get the zero mean and std = 1?

loctruong

Lesson 11 (2019) - Data Block API, and generic optimizer

Lesson 11 (2019) - Data Block API, and generic optimizer

WHY I HATE MATH 😭 #Shorts

Lesson 11 - Entering Data Into Cells

How to Answer Any Question on a Test

Machine Learning 1: Lesson 11

A satisfying chemical reaction

TWiML x Fast ai v3 Deep Learning Part 2 Study Group - Lesson 11 - Spring 2019 1080p

Lesson 2: Deep Learning 2019 - Data cleaning and production; SGD from scratch

Carbon Laser Peel treatment at Skinaa Clinic | Viral #shorts

Code.Org | Course D | Lesson 11 | 2019 | If Else Statement With Bee

Lesson 6: Deep Learning 2019 - Regularization; Convolutions; Data ethics

Lesson 3: Deep Learning 2019 - Data blocks; Multi-label classification; Segmentation

NORMAL CHILD DELIVERY | BABY BIRTH #shorts #youtubeshorts #viral

✅Japanese Language in [Nepali] 2020 N5 Level : Lesson 11

Pakistan education system what a beautiful environment WOW🤣🤣

top 10 best paying jobs in sri lanka 🇱🇰🔥 #shorts #job #srilanka

Experiment on Density#Basic science

PostGIS Lesson 11 - Reprojecting Data in PostGIS

[2019.04.30 Lesson11-session1]Functional Connectivity of fMRI

Korean / Indian makeup look.#shorts #youtubeshorts #viral #tutorial #trending #youtube #ytshorts #yt

Let's Code Core Data | Lesson #11 | Fetched-Results-Controller mit Delegation

Mean, median and mode of grouped Data(Lesson 1)

Please Subscribe to my channel❤️ #shorts #youtubeshorts #shortsfeed #roadto1ka.

Google Ads Sitelink Extensions Best Practices - Surfside PPC Marketing Lesson #11