Welcome to Scribd!

Finding Outliers 2 Wayes Z-Score and Interquortile Range

Uploaded by

0% found this document useful (0 votes)

9 views1 page

The document discusses using Z-scores to identify outliers in data that follows a normal distribution. It explains that a Z-score indicates how many standard deviations an observation is from the mean, and values more than 3 standard deviations out are considered outliers. However, outliers can skew the calculation of Z-scores by influencing the mean and standard deviation. The document then introduces an alternative method using interquartile range to calculate inner and outer fences to identify outliers. Values outside the outer fences would be outliers.

Original Description:

Outliers how to find it - Statistics

Original Title

Finding Outliers 2 wayes Z-Score and interquortile range

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

9 views1 page

Finding Outliers 2 Wayes Z-Score and Interquortile Range

Uploaded by

Ana Chikovani

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

Using Z-scores to Detect Outliers

Z-scores can quantify the unusualness of an observation when your data follow the normal distribution. Z-scores
are the number of standard deviations above and below the mean that each value falls. For example, a Z-score
of 2 indicates that an observation is two standard deviations above the average while a Z-score of -2 signifies it
is two standard deviations below the mean. A Z-score of zero represents a value that equals the mean.

The further away an observation’s Z-score is from zero, the more unusual it is. A standard cut-off value for
finding outliers are Z-scores of +/-3 or further from zero. The probability distribution below displays the
distribution of Z-scores in a standard normal distribution. Z-scores beyond +/- 3 are so extreme you can barely
see the shading under the curve.

In a population that follows the normal distribution, Z-score values more extreme than +/- 3 have a probability
of 0.0027 (2 * 0.00135), which is about 1 in 370 observations. However, if your data don’t follow the normal
distribution, this approach might not be accurate.

Also, note that the outlier’s presence throws off the Z-scores because it inflates the mean and standard deviation
as we saw earlier. Notice how all the Z-scores are negative except the outlier’s value. If we calculated Z-scores
without the outlier, they’d be different! Be aware that if your dataset contains outliers, Z-values are biased such
that they appear to be less extreme (i.e., closer to zero).

To calculate the outlier fences, do the following:

1. Take your IQR and multiply it by 1.5 and 3. We’ll use these values
to obtain the inner and outer fences. For our example, the IQR equals 0.222.
Consequently, 0.222 * 1.5 = 0.333 and 0.222 * 3 = 0.666. We’ll use 0.333
and 0.666 in the following steps.
2. Calculate the inner and outer lower fences. Take the Q1 value and subtract the two values from step 1. The two
results are the lower inner and outer outlier fences. For our example, Q1 is 1.714. So, the lower inner fence =
1.714 – 0.333 = 1.381 and the lower outer fence = 1.714 – 0.666 = 1.048.
3. Calculate the inner and outer upper fences. Take the Q3 value and add the two values from step 1. The two
results are the upper inner and upper outlier fences. For our example, Q3 is 1.936. So, the upper inner fence =
1.936 + 0.333 = 2.269 and the upper outer fence = 1.936 + 0.666 = 2.602.

Using the Outlier Fences with Our Example Dataset

For our example dataset, the values for these fences are 1.048, 1.381, 2.269, and 2.602. Almost all of our data
should fall between the inner fences, which are 1.381 and 2.269. At this point, we look at our data values and
determine whether any qualify as being major or minor outliers. 14 out of the 15 data points fall inside the inner
fences—they are not outliers. The 15th data point falls outside the upper outer fence—it’s a major or extreme
outlier.

The IQR method is helpful because it uses percentiles, which do not depend on a specific distribution.
Additionally, percentiles are relatively robust to the presence of outliers compared to the other quantitative
methods. Values that fall inside the two inner fences are not outliers. Let’s see how this method works using
our example dataset.

Factors Influencing Career Choice of ABM Students With Family Business in SPSPS
Document32 pages
Factors Influencing Career Choice of ABM Students With Family Business in SPSPS
De Asis Andrei
0% (1)
Theoretical Questions - Answer MM
Document12 pages
Theoretical Questions - Answer MM
Mohamed Raheem
No ratings yet
Nemo Explains The Approaches
Document1 page
Nemo Explains The Approaches
ahdyal
No ratings yet
Outliers Z-Score
Document1 page
Outliers Z-Score
Ana Chikovani
No ratings yet
5 Ways To Find Outliers in Your Data - Statistics by Jim
Document35 pages
5 Ways To Find Outliers in Your Data - Statistics by Jim
Arindam Chakraborty
No ratings yet
Outlier Detection
Document41 pages
Outlier Detection
Tanishi Gupta
No ratings yet
Explanatory Data Analysis
Document28 pages
Explanatory Data Analysis
devashreereddy
No ratings yet
07 Box Plots, Variance and Standard Deviation
Document5 pages
07 Box Plots, Variance and Standard Deviation
Alex Childs
No ratings yet
Outliers in Machine Learning
Document13 pages
Outliers in Machine Learning
Sushma M
No ratings yet
Boxplot Outlier
Document3 pages
Boxplot Outlier
Vivian Ling
No ratings yet
Outlier
Document12 pages
Outlier
kaleidosky
No ratings yet
Measures of Dispersion
Document11 pages
Measures of Dispersion
Kristen Imie Dungog Lacapag
No ratings yet
Quantitative Methods in Management
Document67 pages
Quantitative Methods in Management
manish gupta
No ratings yet
Aalto Test
Document6 pages
Aalto Test
Sahas Shah
No ratings yet
Psychological Statistics
Document4 pages
Psychological Statistics
20-0078-887
No ratings yet
Z Score Distribution: Presented by Harisa Tajammul
Document27 pages
Z Score Distribution: Presented by Harisa Tajammul
Khadijah Taifoor
No ratings yet
Fulgar - 9040 Standard Normal Distribution
Document29 pages
Fulgar - 9040 Standard Normal Distribution
eugene louie ibarra
No ratings yet
Answer Report (Preditive Modelling)
Document29 pages
Answer Report (Preditive Modelling)
Shweta Lakhera
100% (1)
Outlier Analysis in Data Mining
Document5 pages
Outlier Analysis in Data Mining
Diksha Gupta
No ratings yet
1.07 Z-Scores
Document2 pages
1.07 Z-Scores
Aisha Chohan
No ratings yet
Z Score
Document7 pages
Z Score
akshay
No ratings yet
Maths IA - Maria
Document16 pages
Maths IA - Maria
Maria Griesser
No ratings yet
How To Calculate Outliers
Document7 pages
How To Calculate Outliers
Celina Borillo
No ratings yet
Ester Paksuniemi Assignment5
Document9 pages
Ester Paksuniemi Assignment5
ester.paksuniemi
No ratings yet
Mind - How To Build A Neural Network (Part One)
Document9 pages
Mind - How To Build A Neural Network (Part One)
Marcos Moreira Alves
No ratings yet
OUTLIERS
Document5 pages
OUTLIERS
Rana Arslan Munir
100% (1)
Human Resource Analytics
Document9 pages
Human Resource Analytics
Princy
No ratings yet
Detecting and Treating Outliers - Treating The Odd One Out!: Data Science Blogathon
Document6 pages
Detecting and Treating Outliers - Treating The Odd One Out!: Data Science Blogathon
Narendra Singh
No ratings yet
Data Science Essentials: Visualizing Statistics
Document4 pages
Data Science Essentials: Visualizing Statistics
ThubtenDorje
No ratings yet
Data Analyzing by Using Z-Score Method and PCA: W.M.Safras Sc/2018/10464
Document13 pages
Data Analyzing by Using Z-Score Method and PCA: W.M.Safras Sc/2018/10464
mohamed safras
No ratings yet
Lesson 2.3 Standard Normal Curve and Z Scores
Document19 pages
Lesson 2.3 Standard Normal Curve and Z Scores
Klarence Timothy Pineda Bundang
No ratings yet
Data Mining Project
Document22 pages
Data Mining Project
Ranadip Guha
No ratings yet
BS-chapter3-2021-Z - Score or STD Score
Document17 pages
BS-chapter3-2021-Z - Score or STD Score
farwa
No ratings yet
Descriptive Statistics Theory
Document10 pages
Descriptive Statistics Theory
Layan Mohammad
No ratings yet
Normal Distribution1
Document21 pages
Normal Distribution1
Rabin Baniya
No ratings yet
Week 1: Lecture 2: Significant Figures
Document10 pages
Week 1: Lecture 2: Significant Figures
Amirul Fadlin
No ratings yet
719 Final Syllabus Merged
Document200 pages
719 Final Syllabus Merged
mraza17
No ratings yet
An Explanation of Z-Scores (Standardized Values)
Document2 pages
An Explanation of Z-Scores (Standardized Values)
JG
No ratings yet
Chapter 02-Describing Distributions With Numbers
Document21 pages
Chapter 02-Describing Distributions With Numbers
amandaluvsanimals
No ratings yet
Logit R101
Document27 pages
Logit R101
Thiago Rocha
No ratings yet
ANN6
Document19 pages
ANN6
ARPIT SANJAY AVASARMOL R2566003
No ratings yet
Assignment 2
Document22 pages
Assignment 2
Apurva Negi
No ratings yet
Eda 2
Document21 pages
Eda 2
Riya Singh
No ratings yet
Data Preprocessing
Document56 pages
Data Preprocessing
Raksa Kun
No ratings yet
Module Using The Empirical Rule
Document14 pages
Module Using The Empirical Rule
Izyle Cabriga
No ratings yet
Normal 30
Document2 pages
Normal 30
Kerdid Simbolon
No ratings yet
Averages and Allegations
Document8 pages
Averages and Allegations
tanya1780
No ratings yet
lecture 3
Document23 pages
lecture 3
ghania azhar
No ratings yet
Lesson 2.3 Standard Normal Curve and Z Scores
Document18 pages
Lesson 2.3 Standard Normal Curve and Z Scores
Klarence Timothy Pineda Bundang
No ratings yet
Detecting Data Outliers
Document7 pages
Detecting Data Outliers
Judy Ann Galleno
No ratings yet
Z-Score: Definition, Calculation and Interpretation
Document5 pages
Z-Score: Definition, Calculation and Interpretation
ppkuldeep4
No ratings yet
Topic 3. Normal Distribution
Document33 pages
Topic 3. Normal Distribution
Rodel Camposo
100% (1)
Measures of Dispersion
Document26 pages
Measures of Dispersion
yaminis.0223
No ratings yet
Visual Presentation of Data: by Means of Box Plots
Document4 pages
Visual Presentation of Data: by Means of Box Plots
hutomorezky
No ratings yet
Variance and Standard Deviation
Document15 pages
Variance and Standard Deviation
Srikirupa V Muraly
100% (3)
Applied Statistics Outliers Chapter 2
Document12 pages
Applied Statistics Outliers Chapter 2
khadija
No ratings yet
Advanced Search On Linear Data Structures: Li Yin February 8, 2020
Document17 pages
Advanced Search On Linear Data Structures: Li Yin February 8, 2020
mark
No ratings yet
Lecture 5
Document25 pages
Lecture 5
Luna eukharis
No ratings yet
STA1013 Study Guide
Document3 pages
STA1013 Study Guide
Julia
No ratings yet
Descriptive Statistics: 由Nordridesign提供
Document21 pages
Descriptive Statistics: 由Nordridesign提供
مي عديسان
No ratings yet
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
Document24 pages
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
Manoj Paliwal
No ratings yet
Gre Formula Book
From Everand
Gre Formula Book
Saifuddin Kamran
No ratings yet
Box Plot Conspect
Document2 pages
Box Plot Conspect
Ana Chikovani
No ratings yet
Distributions Normal Binominal ..
Document1 page
Distributions Normal Binominal ..
Ana Chikovani
No ratings yet
What Can I Write in Mid Term
Document5 pages
What Can I Write in Mid Term
Ana Chikovani
No ratings yet
Box Plot Consect
Document2 pages
Box Plot Consect
Ana Chikovani
No ratings yet
Histograms Conspect
Document1 page
Histograms Conspect
Ana Chikovani
No ratings yet
Distributions Normal Binominal
Document1 page
Distributions Normal Binominal
Ana Chikovani
No ratings yet
MBA5112 - Project 4
Document2 pages
MBA5112 - Project 4
Ana Chikovani
No ratings yet
MBA5112 - Project 5
Document2 pages
MBA5112 - Project 5
Ana Chikovani
No ratings yet
MBA5112 - Project 3
Document2 pages
MBA5112 - Project 3
Ana Chikovani
No ratings yet
MBA5112 - Project 2
Document1 page
MBA5112 - Project 2
Ana Chikovani
No ratings yet
Final Project - Group 1
Document6 pages
Final Project - Group 1
Ana Chikovani
No ratings yet
Project 1 - Descriptive Statistics
Document11 pages
Project 1 - Descriptive Statistics
Ana Chikovani
No ratings yet
Supplemental Data Co.1943-7862.0000673 Yates
Document7 pages
Supplemental Data Co.1943-7862.0000673 Yates
smitupatil
No ratings yet
Topclean S CPC 30 Ti235cen
Document32 pages
Topclean S CPC 30 Ti235cen
Vicho Tronico
No ratings yet
I Nternal Combustion Engine Fundamentals PDF
Document78 pages
I Nternal Combustion Engine Fundamentals PDF
sub
No ratings yet
Sonnets
Document8 pages
Sonnets
Hazel Catapang
No ratings yet
Beryl Gemstone Poster x2
Document2 pages
Beryl Gemstone Poster x2
Nqobile Simphiwe
No ratings yet
NS-EN 447 - 2007 Grout For Prestressing Tendons-Basic Requirements
Document16 pages
NS-EN 447 - 2007 Grout For Prestressing Tendons-Basic Requirements
FATİH ÖZTÜRK
No ratings yet
Laurente, Nieva A. Bsed Ii-Filipino
Document4 pages
Laurente, Nieva A. Bsed Ii-Filipino
Nieva Aldiano Laurente
No ratings yet
Tensile Properties of Plastics by Use of Microtensile Specimens
Document5 pages
Tensile Properties of Plastics by Use of Microtensile Specimens
Srikanth Srikanti
100% (1)
DLL Mathematics-1 Q1 W6
Document5 pages
DLL Mathematics-1 Q1 W6
Lea Versoza
No ratings yet
Flowchart
Document3 pages
Flowchart
Sarah Sazali
No ratings yet
Humanities Module 1
Document5 pages
Humanities Module 1
Sunshine Glimada
No ratings yet
Paper 2 Nov 2007 Physics
Document12 pages
Paper 2 Nov 2007 Physics
solarixe
No ratings yet
Definition:: Retention
Document39 pages
Definition:: Retention
tirathram
No ratings yet
Business World (Jan. 14, 2016)
Document27 pages
Business World (Jan. 14, 2016)
Peter Rojas
No ratings yet
Product Carbon Footprint Study
Document307 pages
Product Carbon Footprint Study
Gianandrea Rizzi
No ratings yet
15210-Article Text-47244-2-10-20170330
Document9 pages
15210-Article Text-47244-2-10-20170330
Aminda Makrufatul Fadilah
No ratings yet
Quiz Aviation 101
Document5 pages
Quiz Aviation 101
Brayan Villalba
No ratings yet
Servo Motor 1
Document3 pages
Servo Motor 1
Amin Shaik
No ratings yet
Armored Fiber Optic OM3
Document2 pages
Armored Fiber Optic OM3
Daniel Fernandez
No ratings yet
Stahl Family History Selinsgrove, PA
Document15 pages
Stahl Family History Selinsgrove, PA
bzgfb6d5mp
No ratings yet
Mineral Resource Classification. How The Viability of Your Project May Hang On A Qualified Person'S Judgement
Document9 pages
Mineral Resource Classification. How The Viability of Your Project May Hang On A Qualified Person'S Judgement
Alexandra Paola Rodriguez Caicedo
No ratings yet
HDPE Catalog
Document56 pages
HDPE Catalog
Salwa Alsamneh
No ratings yet
Week-2 Module-2 Basis of Remote Sensing Image Representation
Document18 pages
Week-2 Module-2 Basis of Remote Sensing Image Representation
Trambak Bhattacharya
No ratings yet
00000330
Document64 pages
00000330
Bahman Matouri
No ratings yet
Agriculture Under President Gloria Macapagal Arroyo
Document32 pages
Agriculture Under President Gloria Macapagal Arroyo
Vin Lava
60% (10)
Brochure PDF
Document4 pages
Brochure PDF
SUJITH KRISHNAN
No ratings yet
FinalHistoricalAtlas June9 ForUpload
Document124 pages
FinalHistoricalAtlas June9 ForUpload
Xandelyn Racel B. Reyes
No ratings yet
Muti Stage Amplifier
Document28 pages
Muti Stage Amplifier
haitham78h
100% (1)