Data Cleaning Project in Python

preview_player
Показать описание
Get certified as a data analyst with DataCamp

Gather your own data for projects

💎Join my membership💎

Do this project next - (PowerBI file included)

The dataset can be downloaded from

💎Support my channel 💎

Buy me a coffee

This is a data cleaning project of the FIFA21 dataset that was provided by the Data Challenge Space on twitter.
You will learn how to use pandas and numpy libraries.

LET'S CONNECT

Some of the links on this channel are affiliate links, meaning that, at no additional cost to you, I will earn a commission if you click through and make a purchase. This helps support my channel and allows me to continue creating valuable content for you. I only recommend products or services that I personally use and believe will add value to you. Thank you for being so supportive!
Рекомендации по теме
Комментарии
Автор

This is so far the best data-cleaning tutorial I've seen on the internet. Keep up the good work!

abolarinwayusuf
Автор

Thanks a million for this.
I love your contents❤

footballbanter
Автор

Thanks Irene I was able to go through the tutorial and actually practice on it and I can say it is absolutely helpful to me as a beginner in the data industry .Thumbs up and keep up inspiring!!

deborahchepkoech
Автор

Thank you for this. Can't wait to get to this level of proficiency in Python.

ronykeya
Автор

Really thankful to you. You uploaded different content from others… Your methods are very helpful… ❤

Prathmesh_jadhav
Автор

Right on point for me as an aspiring data analyst. Thank you.

NjabuloDlamini-gl
Автор

Awesome tutorial, thank you for sharing. As a beginner with no confidence in looping/iteration, I have succeeded cleaning the contract column just the naive way, borrowing some pieces from your snippet:
# udf that extract the contract period
def contract_period(period):
if period =='Free' or 'On Loan' in period:
return np.nan
else:
return x.split(' ~ ')

# get contract start and end years as pd.Series
row: contract_period(row)).str[0]
end_year = fifa.Contract.apply(lambda

# Add the new columns in the dataframe
fifa.insert(loc=fifa.columns.get_loc('contract_status')+1, column='Start_Contract', value= start_year)
fifa.insert(loc=fifa.columns.get_loc('contract_status')+2, column='End_Contract', value=end_year)

osoriomatucurane
Автор

hello it is very interesting the project, I would be happy to see more tutorials of this type, I enjoyed the explanation.

nauncastillo
Автор

Awesome Explanation, hope u doing well and please do it more project with python

russel
Автор

Bravo, good job, very insightful & inspiring

emilynjeri
Автор

nice effort. thankyou for it. will there be more videos on this topic

admonitoring-pios
Автор

Thank you for this it was really helpful. Upload more like this and I will soon get a job

zorqhbm
Автор

hie.. This is really helpful.. I would be grateful, if you upload more of such data cleaning videos

rohanbansiyar
Автор

Great content, but if there anyone who encountered an issue when running the convert height function alone rather than the entire notebook?

kevinmugo
Автор

Your video was so helpful. Thank you so much

codewithpraisejames
Автор

Thank you for your effort of love in creating this tutorial/

ZionRemojo
Автор

Girl, are you ok? this is some dope work. I have checked your twitter account and it's been a while since you tweeted

samuel_muly
Автор

Thank you so much for this video. I’m unable to download the dataset tho.

QueenOpex
Автор

I want to learn more please upload more videos 🙏

sonusiwan
Автор

Thanks for contributing its really helped me to polish myself. love from bangladesh

rumonseoexparts