Hands-on Handling missing value using Mean Median mode with Python | Data Cleaning Tutorial 8

preview_player
Показать описание
During the Machine Learning Data Cleaning process, you will often need to figure out whether you have missing values in the data set, and if so, how to deal with it. In this video, I have demonstrated to handling the missing value using statistical way mean, median and mode. In this video I only cover the hands-on explanation using python :-
1. We impute the missing data for a quantitative attribute by the mean or median and for qualitative attribute by mode.
2. Generalized Imputation: In this case, we calculate the mean or median for all non missing values of that variable then replace missing value with mean or median.

3. Similar case Imputation: In this case, we calculate mean individually of non missing values then replace the missing value based on other variable.

Рекомендации по теме
Комментарии
Автор

this is a very helpful video, I must admit. Nice work. I'd love to ask though, what do we do with the NaN gotten after using the groupby function? I mean, how can we replace it with a reasonable value afterwards?

chiomaobiajulu
Автор

How to use group by imputation if we dont have domain knowledge on that particular dataset?

sreeramsaravanan
Автор

why are u doing before test train split

harshavardhan
Автор

HI : in section 7 why did u use 0 after mode ? mode()[0]

terryterry
Автор

in similar case imputation you took 10 + 15 / 2 =12.5 .. where this 2 is coming from . this is because u have only 2 values 10 and 15 ?

terryterry
Автор

Inline no 7 why did you write 0 before mode. could you please tell us about it?

shashankathawale