filmov
tv
Handling Imbalanced Data in machine learning classification (Python) - 1
Показать описание
Welcome to our Handling Imbalanced Data in machine learning classification series. You'll work on a highly imbalanced example dataset in Python.
In this Part 1 video, we'll learn:
- what is imbalanced data
- what are the proper evaluation metrics for it
- set up our example of a highly imbalanced dataset ready for modeling.
Please check out the Part 2 video to learn 6 popular techniques to deal with the imbalanced data problem in Python.
✔️Collecting a bigger sample
✔️Oversampling (e.g., random, SMOTE)
✔️Undersampling (e.g., random, K-Means, Tomek links)
✔️Combining over and undersampling
✔️Weighing classes differently
✔️Changing algorithms
Technologies that will be used:
☑️ JupyterLab (Notebook)
☑️ pandas
☑️ sklearn
☑️ imbalanced-learn (imblearn)
Links mentioned in the video
In this Part 1 video, we'll learn:
- what is imbalanced data
- what are the proper evaluation metrics for it
- set up our example of a highly imbalanced dataset ready for modeling.
Please check out the Part 2 video to learn 6 popular techniques to deal with the imbalanced data problem in Python.
✔️Collecting a bigger sample
✔️Oversampling (e.g., random, SMOTE)
✔️Undersampling (e.g., random, K-Means, Tomek links)
✔️Combining over and undersampling
✔️Weighing classes differently
✔️Changing algorithms
Technologies that will be used:
☑️ JupyterLab (Notebook)
☑️ pandas
☑️ sklearn
☑️ imbalanced-learn (imblearn)
Links mentioned in the video