Finding an outlier in a dataset using Python

Показать описание

In this video we will understand how we can find an outlier in a dataset using python.

ref: #medium articles

#Outlierdetection

You can buy my book on Finance with Machine Learning and Deep Learning from the below url

Рекомендации по теме

Комментарии

Clustering techniques are also widely used in industry to detect outliers. Specially isolation forest algo

yourkarma

The tutorial offers a lucid explanation of a complex problem of outliers. It is well-presented with examples that made it easier to follow. However, threshold = 3 isn't working for me. I modified it to threshold = 3+std to make it work properly. Moreover, declaring outliers = [ ] outside the function is causing problems if you want use this function in another dataset in the same notebook. So, declaring outlier list inside the function would be a better approach, I think.

shujashakir

You have explained things well. Just one correction - it's inter-quartile range and not inter-quantile range.

mridulagarwal

Amazing Krish, now I understand the concept of outliers, thanks

doubando

Here is the correction lower bound = q1 - 1.5*IQR and upper bound = q3 + 1.5*IQR

shadrul

Superb explanation...in very simple way..

AmitSharma-pozb

Very clear and crisp explanation, loved it

srijeetful

13:57 Correction
Lower bound=Q1-IQR*1.5
Upeer bound= Q3+IQR*1.5

vamsinadh

Thank you so much sir, I understood everything

adityapradhan

I have a couple of questions.
1. Is it always better to remove the outliers or could it be big mistake as well? You gave an example of a fraudulent transaction. Now, an outlier indeed is a hint that the transaction was fraud. If I remove all transactions at the first place, how am i going to achieve my results?

2. You did not explain how do we perform outlier checks with multivariate dataset. Suppose IRIS dataset. I have seen a couple of videos here and there but no proper way is coming out. What is the proper way to identify outliers with multivariate datasets.

Tahnks

smalirizvi

Nice work mate. I also tried something similar but with Upper and Lower Bound on the Return

thedatascientist_me

Hi Krish, Thank you so much for the tutorial, Very clear and crisp explanation, loved it :)

satheeshswaminathan

Just a correction, when calculating z-score, you are doing subtraction of i to an array, you should enumerate on datasets and then subset i from the current index of mean and std.

kaka

great video sir. great content, and explained in the cleanest way possible. thanks

yuktikhantwal

Well explained, would be great if you can add some plot for visualization.

ryando

Nice Content and you explained it very well.ThankYou So Much

gyapti-fctfinder

Hi Krish, well explained. can you please post a video on how to equate the outliers using any dataset. Thanks in advance.

Ashokkumar-scvt

I have been following your videos and I have learnt many things Krish Naik. Could you please tell me have you written any Datascience and machine learning books. I would like to buy your books and follow your videos to clinch Datascience job as soon as possible.

ksoftqatutorials

Hi Krish
Thanks for excellent explanation....But if we get some outliers in any feature should we remove those records containing outliers(but in this case we loose some data), if not then how can we handle outliers??? Please cover this portion also :)

niveshtayal

thanks for sharing this video.
One correction, in the loop it should be *outliers.append(i) *
not
outliers.append(y)

otroleonarbe

Finding an outlier in a dataset using Python

Determining an Outlier Using the 1.5 IQR Rule

Statistics - How to find outliers

How To Find The Interquartile Range & any Outliers - Descriptive Statistics

How Do You Tell If a Number is an Outlier? | Using IQR to find Outliers for a Modified Boxplot

Outlier in Math- Math

Finding an outlier in a dataset using Python

How to Find Outliers with Excel

Finding an Outlier

Market Wizard Gives Only Lesson on Trading Psychology You'll Ever Need

Outlier detection and removal using IQR | Feature engineering tutorial python # 4

How to Show 4th Graders How to Get an Outlier in Math : Math Equations & More

Outliers. What Is An Outlier On A Scatter Graph And How To Find Them.

Find the Outlier of a Median or Mean - Statistics SAT/ACT Math Practice

The Effects of Outliers on Spread and Centre (1.5)

HOW TO FIND AN OUTLIER IN A GIVEN LINE PLOTS WITH EXAMPLES

Outlier detection and removal: z score, standard deviation | Feature engineering tutorial python # 3

Find the outlier do this step #mathshorts #maths #mathshack #math #mathmath #easymathtricks

Finding the Outlier of a Graph - Algebra 1/Statistics Practice Question - ACT and SAT Math

Complete guide to outliers| how to work with outliers | Finding an outlier in dataset using python,

Finding an outlier in a dataset using percentile (MATLAB)

Outliers: Find an Outlier

What is an outlier?

#5. Find the Outlier, Mode, and Range given a Dot Plot

What is outlier | Finding an outlier in a dataset |outlier detection #viral #outliers #subscribers