Imbalanced Data Classification

preview_player
Показать описание
Imbalanced data is a disproportionate number of data points with discrete labels and can be a big challenge to develop an accurate classifier. A classifier attempts to find the data boundary where one class ends and the other begins. [[Main/ClassificationOverview|Classification]] is used to create these boundaries when the desired output (label) is discrete such as 0/1, Yes/No, 1,2,3,4,5, or Normal/Abnormal. Regression is used when the desired output (label) is continuous.

Imbalanced data can give biased predictions for classification when there is a minority class. This tutorial demonstrates how to deal with imbalanced data.

Рекомендации по теме
Комментарии
Автор

Great tutorial. Thank you so much for making this

anirbansarkar
Автор

Great class.
Keep up the good work.
Thank You,
Natasha Samuel

natashasamuel