Removing Outliers From a Dataset

preview_player
Показать описание

Рекомендации по теме
Комментарии
Автор

Great video! I did not know before how to get z-scores with this easy way, I always computed it using the formula. More appropriate way to deal with outliers, in case when you don't want to keep them and just remove them, then get the z-score for your variable of interest just as the lady did. Check the z-score and the variable together by both ascending and descending order of the z-score if you want, just as the lady did. Then had to the data section, select cases, select if condition is satisfied, write the formula in the formula portion - ABS(z_price) <= 3 and hit continue and then ok. Boom! your outliers are now gone.

theg.ksinstinct
Автор

God bless you. Every piece of documentation and every guide I've read tells me "just delete the crazy values lol."

RectalDesign
Автор

Boxplots are also very useful - from a visual standpoint - when checking for outliers :)

JustKevStockholm
Автор

How can i determine the critical value which helps delimitate the outliers. For instance your value was 3.29, but why >?

rsxZ
Автор

trying to do this for university and we have the same surname, the universe works in weird ways

benotoole
Автор

Do you have any references I can use to cite this method in my thesis?

menzir
Автор

Thanks for the advice, but how do I know which ones are the actual outliers? My data is not as simple as this one in this video.

myfictionalthoughts
Автор

Holy crap! Thank you so much for this! This just helped solve a problem I've been working on for a couple days

mattadoritmo
Автор

thanks Siobhan, really useful video, but what if I have outliers on both sides of the histogram? How can I set up the filter in Variable View Missing Column in this case?

martinbrestovansky
Автор

Hey, thanks for the amazing video. Would it somehow possible to quote the techniques you used? really would like to use this in my thesis. Kind regards :)

sharist
Автор

Why 3.29? Isn't 1.96 was the key number?

Electrify
Автор

Very useful video! Thanks for uploading!

chunnjh
Автор

thank you very nice explanation, been frustrated all day, thanks

whiteshadow
Автор

Is there an automated way of removing outliers from positive and negative end? I have a data set of 120 000 cases and hundreds of outliers...

Brickkzz
Автор

if i delete it~~the previous test such as normality test and compute mean need to redo?or will change automatically?

zivleong
Автор

Great video. I don't understand the 'id' section though. Is this just the IV? What if I have two IV's? Thanks!

PurpleRawr
Автор

Thank you for your video, do you perhaps know who I can cite for this method?

vkunst
Автор

how i define PC mom column? It use mean or something ?

MrKKrid
Автор

But if I exclude a case for a certain variable, shouldn't I exclude this case for all other variables too?

ScroogeMD
Автор

hello i want to ask, if i want to do the run test, i have to use the new data or old data that still have the outlier?

*sorry for my bad english

novitaasastr