4.3. Handling Missing Values in Machine Learning | Imputation | Dropping

preview_player
Показать описание
. In this video, I have explained about Handling Missing values using Python in Machine Learning. I have explained about Imputation and Dropping in Handling missing values.

Hi guys! I am Siddhardhan. I work in the field of Data Science and Machine Learning. It all started with my curiosity to learn about Artificial Intelligence and the ability of AI to solve several Real Life Problems. I worked on several Machine Learning & Deep Learning projects involving Computer Vision.
I am on this journey to empower as many students & working professionals as possible with the knowledge of Machine Learning and Artificial Intelligence.

Hello everyone! I am setting up a donation campaign for my YouTube Channel. If you like my videos and wish to support me financially, you can donate through the following means:

(No donation is small. Every penny counts)
Thanks in advance!

I am making a "Hands-on Machine Learning Course with Python" in YouTube. I'll be posting 3 videos per week: Monday Evening; Wednesday Evening; Friday Evening.

Рекомендации по теме
Комментарии
Автор

Thanks. Very useful for a quick refresher of the techniques to deal with missing values.

digigoliath
Автор

Amazing explaination sir, all my doubts are clear

I_Anupam_Pandey
Автор

I've watched all ur ML videos ... it was very helpful to me ... damn clr explanation ...keep going sir

jeezz
Автор

You Sir are a very rare gem in these YouTube streets!🤍💫👏🏾

kelethabetseroberts
Автор

Can you plz make a video on handling categorical missing values with lots of unique values . Also how to put different datasets together with common columns that are in the other datasets that needs to be joined together.

karishmasewraj
Автор

Nice explanation but what to do for large dataset plz make a vedio on that

ashulohar
Автор

Siddhardhan, This is a fantastic video. I just want to find out how to go about a dataset with fluctuating densities? How do you go about it? Do you use mean, median, or mode? Thank you so much

Kevin-gzth
Автор

Hi, First of all the video was really helpful. But I noticed that in the head() function, the salary was NaN when the status was not placed. So I tried this: The output was such that the salary was NaN only when the status was not placed.. I mean the unique value was just "Not Placed". So I guess it will be correct to fillna with 0 instead of median, mode or mean. I understand that you have made this video for just understanding purpose but just telling what I tried. Thanks for the video, very helpful!

criclal
Автор

Bro plz make videos on outlier detection nd removal techniques

jeezz
Автор

Dataset was not correctly read.
Salary column has direct dependency with status column.
Student with status not placed salary should have been replaced with 0 not by any mean, median, mode.
As far my understanding only the outlier satisfying all the dependency should be replaced with any of other methods.
Please correct me if I'm wrong on this?

studystuffs
Автор

If someone is not placed... How does replacing their empty salary with median of the salary make sense? Shouldn't be replaced with Zero??

hishnew
Автор

I see following warning "FutureWarning: A value is trying to be set on a copy of a DataFrame or Series through chained assignment using an inplace method.
The behavior will change in pandas 3.0. This inplace method will never work because the intermediate object on which we are setting values always behaves as a copy" --> Can you please show me modified code snippet that works without inplace method?

meenakaria
Автор

siddhardh ji can you please make a series of videos for explaining each machine learning algorithms

venkatprabhu
Автор

what to do when I have empty rows in date columns and almost 90 percent of the dates are empty,

mungamurisitaramnikhil
Автор

I have data around 20000 and missing values only 23 so I can drop those row? It's good or need to be fill nan value?

sachinvithubone
Автор

Bro, I am not understanding that what arguments we have to choose when we call a method of a library and how we come to know that which argument we have to choose and what is its purpose, i have read the method documentation but I can't understand plz tell me, i am continuously watching your course and now i am on 4th module Preprocessing topic

AliSher-kvpd
Автор

Hi sir,
Sir if are given the regression problem and we have the missing values in the data . Then can we use the regression models like KNN regresor, Random Forest Regressor to find the missing values ?
and then solve our actual problem
Is this the right approach?

amitbudhiraja
Автор

Don't we use pandas profiling for EDA

ankushgupta
Автор

Sir but you added a salary to the person who is not placed it lead to wrong data.

PAVANSIDEAS
Автор

What about handling missing values in catogoercal type of columns

kaushikdwivedi