Active Learning. The Secret of Training Models Without Labels.

preview_player
Показать описание
A large part of the success of supervised machine learning systems is the existence of large quantities of labeled data. Unfortunately, in many cases, creating these labels is difficult, expensive, and time-consuming.

An obvious solution is to use machine learning to aid in the creation of the labels, but this presents a chicken and egg problem: how do we build a model to create labels before labeling our data to train that model?

Active Learning is one solution. A semi-supervised learning technique to build better-performing machine learning models using fewer training labels.

Paper mentioned in the video:

📚 My 3 favorite Machine Learning books:

Disclaimer: Some of the links included in this description are affiliate links where I'll earn a small commission if you purchase something. There's no cost to you.
Рекомендации по теме
Комментарии
Автор

Really helpful video, thanks. One small thing though, the sound effects on the title screens were a bit loud imo :)

thecouchman
Автор

Nice video!
You can also use a similar approach to compare models and stay with the one that performs best. Here is how:
A few years ago I was collecting data in the chemistry lab in order to fit some models. Each experiment took 1 day to complete, so I started with a simple factorial design, fitted all models to the initial data set, and then predicted the point of maximum divergence between all models. That point was used as the next experiment and models we refitted thereafter. This procedure was repeated several times.
Computing uncertainty in your predictions is similar, but only with one model.

miguelduqueb
Автор

Duplicated records in the data has a significant meaning. It means that this repeatedly appearing record in the past is probably going to repeatedly appear in the future, it a VIP records, and knowing how to handle it well means you succeeded in high percentage of your supposed to do. So having duplicate data should some how eventually make the model very accurate in predicting it's related lable, more accurate than unique records.

hasanx
Автор

Nice video! Could you also explain about semi-supervised learning? There are not many videos that clearly explain about the progress so far in semi-supervised learning, even though the topic become more popular nowadays

fikriansyahadzaka
Автор

Love it, world class content! Also agree. A thought: Why not start with few shot or zero shot learning before active learning?

knutjagersberg
Автор

Fantástico vídeo. La verdad es que ahora voy a trabajar el código para entenderlo. Gracias por el trabajo que haces para ayudarnos.

tecbrain
Автор

This is lit 🔥. Love this practical approach to Machine learning. Keep doing the amazing work 👏👏

sahanakaweraniyagoda
Автор

I love your videos, nice and extremely informative! Just a quick comment: is it possible not to have those " bommmm!" soun?(: It make impossible to listen your videos in a car or with headphone. Thank you!

erdi
Автор

Another nice video!
Learned a new concept - *Active Learning*

Param
Автор

Loved the Idea of smart labelling. very cool

maheshBasavaraju
Автор

Great explanation, thanks! Do you have some example of labeling services providing this approach?. greetings !

lorenzoleongutierrez
Автор

Hi, Santiago! Love your content!
Could you please make a video on how to start machine learning as a beginner with some programming experience. I've been doing web dev but want to transit into ML. I will appreciate your response 😊

roshanaryal
Автор

This method to me seems a little bit like boosting. I might be wrong though, but boosting is what came to my mind after watching the video.

jayantghadge
Автор

Lovely video Santiago! Quick question: How do we label the low confidence data that the model initially had a hard time predicting since we also didn't know what the label was in the first place. How do we know the label/class to use for that low confidence predicted data when we re-train ?

fobaogunkeye
Автор

Excellent Video. This channel is going to be huge soon

JoaquinRevello
Автор

Great Santiago, real data has never been so easy! LoL

kutkut
Автор

I've some queries. There's no proper practical application of it is it? Since the paper talks about methods proposed along with practical issues.
Since your videos are straight to the point and you try to keep it simple, just wanna know if you've found practical implementation of it in Python etc. Do give a link to it in the description. Thank you

arskas
Автор

Thanks! This was exactly what I needed at the moment! (:

jubakala
Автор

Super insightfull, I`m using this ideas right now!

brunoras
Автор

1:03 - We need to Build a Model to Label the data we need, to Build a Model 🤯

Param