Pandas Profiling for Data Science (Quick and Easy Exploratory Data Analysis)

preview_player
Показать описание
In this video, I will be showing you how to use the pandas-profiling library in Python to easily and quickly perform Exploratory Data Analysis. In just a few lines of code you can get a glimpse of your data.

⭕ Playlist:
Check out our other videos in the following playlists.

⭕ Subscribe:
If you're new here, it would mean the world to me if you would consider subscribing to this channel.

⭕ Recommended Tools:
Kite is a FREE AI-powered coding assistant that will help you code faster and smarter. The Kite plugin integrates with all the top editors and IDEs to give you smart completions and documentation while you’re typing. I've been using Kite and I love it!

⭕ Recommended Books:

⭕ Stock photos, graphics and videos used on this channel:

⭕ Follow us:

⭕ Disclaimer:
Recommended books and tools are affiliate links that gives me a portion of sales at no cost to you, which will contribute to the improvement of this channel's contents.

#dataprofessor #pandas #pandasprofiling #EDA #exploratorydataanalysis #datavisualization #datavisualisation #dataviz #datavis #dataframe #dataframes #jupyternotebook #jupyter #googlecolab #colaboratory #notebook #machinelearning #datascienceproject #randomforest #decisiontree #svm #neuralnet #neuralnetwork #supportvectormachine #python #learnpython #pythonprogramming #datascience #datamining #bigdata #datascienceworkshop #dataminingworkshop #dataminingtutorial #datasciencetutorial #ai #artificialintelligence #tutorial #dataanalytics #dataanalysis #machinelearningmodel
Рекомендации по теме
Комментарии
Автор

This is cool! I've never used that package before

KenJee_ds
Автор

As you may notice, Google Colab does not yet support pandas-profiling. The good news is that Kaggle Kernel (Notebook) supports pandas-profiling out of the box (no need to pip install, just import the library and your good to go).
I've just created a Kaggle Kernel, check it out here:
On the Kaggle Kernel, please give it a thumbs up so others can notice it too! Thanks for helping out! 😃

DataProfessor
Автор

Amazing library and excellent video! Kudos! Data Professor is one of the best sources of information regarding data science, nowadays.

AlessandroBottoni
Автор

Hi Chanin, thank you so much for this quick guide. I recently just started my first independent data science project, but I am really struggling with the EDA. Beyond the simple summary statistics, histograms, density plots, and correlation plots, what other analysis should I also do for a new data set? Is there an algorithmic approach that one should always follow to get a good grasp of the data set and its limitations?

grantchan
Автор

I watched for 3 mnts and then MAGIC. It's a great library. loved it.

irfanahmad
Автор

Thanks a lot Data Professor, I really enjoyed learning this approach to exploratory data analysis. You are awesome👍🏾👍🏾👍🏾👍🏾

chrisosomo
Автор

will you make an updated video now that pandas profiling will deprecate?

lindasolis
Автор

nice but I wish that they colored the scatter plot because i cant see which the x axis and the y axis .

husseinfadin
Автор

I think you should increase font because i am using simultaneously, with sreen splitting. and I am facing problem.

heythere
Автор

Hi, Can we also profile the data null records, also profiling data based on business rules like if the state is = NJ Country must be USA

databauatconsultant
Автор

I used this library just to see how it works. It gives great insights. But I've 1 doubt, how is data cleaning done? how is missing value imputation done? I am new to data science. Please help.

atod
Автор

Hi, first of all, I'd say thanks to you
when I started pip install I get this ERROR
ERROR: coursera-dl 0.11.5 has requirement attrs==18.1.0, but you'll have attrs 20.3.0 which is incompatible.
could you help me?

amrmoursi
Автор

Ooooh my god.. what a great video It is. You are great. I loved it.
I expect a omics machine learning project!

angsumandas
Автор

Hello! This tool is amazing! I am currently using it to process some data in my company. Just a question, my data has over 2M rows, could the tool be limiting the amount of data? I am getting some results that I’m not sure they’re right

dianamgdata
Автор

Can you please tell me how to remove report generated by y-data persent at last

abhishektripathi
Автор

Hi Data Professor, I also have a problem installing the pandas-profiling package. Here is the error message: "ERROR: Could not install packages due to an EnvironmentError: [Errno 2] No such file or directory: ". Please advise. Thanks!

linli
Автор

increase notebook size for better visibility

anuragshrivastava
Автор

Keep up the great work! you are being followed! Thanks...

onlymusic
Автор

BrokenProcessPool: A task has failed to un-serialize. Please ensure that the arguments of the function are all picklable.

I get those error, do you know why? thanks in advance

ekosetiawan_indo
Автор

Very good video and package. The HTML argument is used for what exact purpose?

FredericBiondi