Correcting Skewed Data with Scipy and Numpy

preview_player
Показать описание
Skewed data can adversely affect your analysis and machine learning models. In this video, I demonstrate five methods for cleaning skewed data using the NumPy and SciPy modules. The methods include taking the square root, cube root, fourth root, log, and Yeo-Johnson transform. I also showcase the effectiveness of each method by summarizing the skewness of the data after each transformation with a bar plot.
Рекомендации по теме
Комментарии
Автор

First time I have seen your videos. This is genuinely a very good video. Very well explained and clear. I am subscribing.

The music wasn’t off putting either!

marcom
Автор

the on-screen text is a great addition, Dr. P!

officialscience
Автор

Thank you for the video, subscribed! Youtube needs more quality content like this.

metinunlu_
Автор

Amazing video
I like it's structure: motivation, overview with examples, practical advices
Thanks!

mmucyex
Автор

So what about if we were to standardize using z-scoring? It seems like that would get largely the same impact, wouldn't it?

undertaker
Автор

This is interesting. If one corrects the original skewed data, via doing these kinds of transformations, in the context of linear regression or multiple linear regression, will that not change the interpretation of the original data. Curious to know.

pabloagogo
Автор

Skewing doesn’t necessarily matter if you’re using XGBoost, correct? For classification or regression, that is

AyahuascaDataScientist
Автор

Amazing video! I was creating a function for measuring the same you forgot to name log1p Wich is log of (x+1) really useful for right skewed data with values less than 1

nicolaslpf
Автор

Thank you! That was helpful!
So we basically can make the root of any power? Is there a drawbag for exploiting it, like keep increasing the n value for feature to the power of 1/n?

dannybee
Автор

SUBSCRIBED! What should one do before? Or, what's the correct order? - treating outliers, impute missing values, correct symmetry? Thanks Dr. P!

thoniasenna
Автор

What if my data contains a lot of useful '0' values?

pewkaboo
Автор

Bro you explain a concept, but go you need the music!! It’s distracting

mouhsineelqesry