Data Pre-processing in R: Handling Missing Data

preview_player
Показать описание
In this video, I will show you how you can handle missing data in your own data science project. This video represents the first in a multi-part series on data pre-processing in R.

⭕ Timeline
0:33 First part in Data pre-processing series
1:11 DHFR dataset
2:41 Outline of this episode
4:15 Let's start
4:21 1. Load in the dataset
4:59 2. Check for missing data
5:48 3. Let's make the data dirty!
8:38 4. Check for missing data
10:18 Let's look at rows containing NA
11:29 Let's find the NA in the data
12:45 5. Handling the missing data
12:54 5.1 Simply delete data samples containing NA
13:30 5.2 Perform imputation
16:59 Preview of next episode of this series (on Data pre-processing)

The idea for this video was suggested in a comment by Marco Festugato

⭕ Playlist:
Check out our other videos in the following playlists.

⭕ Subscribe:
If you're new here, it would mean the world to me if you would consider subscribing to this channel.

⭕ Recommended Tools:
Kite is a FREE AI-powered coding assistant that will help you code faster and smarter. The Kite plugin integrates with all the top editors and IDEs to give you smart completions and documentation while you’re typing. I've been using Kite and I love it!

⭕ Recommended Books:

⭕ Stock photos, graphics and videos used on this channel:

⭕ Follow us:

⭕ Disclaimer:
Recommended books and tools are affiliate links that gives me a portion of sales at no cost to you, which will contribute to the improvement of this channel's contents.

#dataprofessor #machinelearning #datapreprocessing #preprocessing #missingdata #datamissing #cleandata #datacleaning #cleaningdata #preprocessingdata #datascienceproject #learnr #rprogramming #learnrprogramming #datascience #datamining #bigdata #datascienceworkshop #dataminingworkshop #dataminingtutorial #datasciencetutorial #ai #artificialintelligence #r
Рекомендации по теме
Комментарии
Автор

🤔QUESTION OF THE DAY: How do you handle missing data? Comments down below! 😃
💗Help support this YouTube channel by hitting the Subscribe button, Like button and Comment down below! 👇

DataProfessor
Автор

Your detailed and relaxing method of teaching helps me a lot, Thanks a bunch

HamedMorady-buey
Автор

Your vids are completely amazing. Please keep posting it, it helps me a lot. Greetings from Brazil!

douglaspiresmartins
Автор

Thank you very much for your channel Professor Chanin. And this series will be really useful. Greetings from Poland !

szymonk.
Автор

Thanks for your video Chanin, I really appreciated it!! Eager to watch more of this^^

marcofestu
Автор

Loved your way of explanation of things step by step it really helped :)
Thanks

lokesh
Автор

Very Nice... It was very good with basic knowledge covered so simply that all concepts cleared in one go.. Thanks a lot

ruhinehri
Автор

It was really helpful sir. Thanks a lot ♥️

dakshaudawatta
Автор

Thanks for the video. Is that possible to introduce how to impute left-censored data (with multiple limits of detection)?

cma
Автор

Sir, can you make a another video on missing data of numerical type where some advanced techniques applicable when mean and median is not working.
You are write thing, keep it up and make more video on R.

deeppatel
Автор

Hello Professor. I have a question which is related to missing data. I have a dataset cars_missing and there has two missing data. When I run sum(is.na(cars_missing)), it's only show 1. Also, When I run view(cars_missing), I can see in column "cubicinches" there has "NA", but in column "brand" there only show blank without NA. I think that's why when I run sum(is.na) there only show 1. Can you explain why?

jiaweihu
Автор

Thank u for this. How to get previous working day date? Week starting on sunday and end on thurs day from 1 sep 2020 todate

ahmedJaber
Автор

Once again..Amazing video!!. However, at the end of the video you suggested to upload other videos on how to deal with missing values for other types of data which are not numerical such as factor, categorical, ordinal..Are you still going to upload these videos? Thanks

desmondojei
Автор

Thanks for the great video! This problem just occured to me ;)
How do you deal with missing data for classification problems though?

jeffersonjones
Автор

How can I replace missing data with characters data?

gamalucianogen
Автор

Can you do a video regarding some of the important algorithms to master, like regression, classification etc? Trying to find information online can sometimes be overwhelming because they throw a million things at you at once. Thanks in advance.

bazi
Автор

really nice video . but sir i am getting error while loading data
Error in function (type, msg, asError = TRUE) :
error:1407742E:SSL alert protocol version

utkarshgupta
Автор

I didn't get the colsums part. Where did it come from? Please reply fast, I have a project assignment.

dikshyasurvi
Автор

¿Which is better for data cleaning python or R?

kevinalejandro
Автор

When is it better to replace NAs with mean? When with median?

marcofestu