3 views

Uploaded by niyati25

WHAT ARE OUTLIERS82.pptx

save

- What Are Outliers176
- What Are Outliers239
- What Are Outliers168
- What Are Outliers138
- WHAT ARE OUTLIERS72.pptx
- What Are Outliers242
- What Are Outliers245
- What Are Outliers76
- What Are Outliers169
- What Are Outliers106
- What Are Outliers261
- What Are Outliers227
- What Are Outliers86
- What Are Outliers195
- What Are Outliers222
- 1583665_67383352_Comparing+Methods+Assignmentf17
- Matlab2014a Data Analysis
- Prepositions.pptx
- Reserch Methodology Course Outline
- AsaRopert and LizbethZuniga
- Cosmides 1985 Intro
- Table of Contents
- Cluster
- en
- IUPAC - Determination of Tocopherols and Tocotrienols in Vegetable Oils and Fats by High Performance Liquid Chromatography
- The Smart Recruit
- Class 3 - Theory and Hypotheses
- Mind on Statistics 5th Edition Utts Test Bank
- Math FNBE0115 - Statistic Brief
- An Innovative Idea to Discover the Trend on Multi-dimensional Spatio-temporal Datasets
- 9.e
- 98qwerty
- Cs453 d HTML Javascript 1
- What Are Outliers271
- BhavanaRam Paper
- Computer Graphics1
- ch27.pdf
- 184890998 CEH v8 Labs Module 13 Hacking Web Applications PDF
- Ieee Test Plan
- FTKManual
- 2 D Transformations 2
- chap 7
- 9.e
- 4.80 SY IT
- Computer Graphics December 2010 - Old(Vtuplanet.com)
- What Are Outliers260
- What Are Outliers268
- What Are Outliers269
- What Are Outliers270
- What Are Outliers255
- What Are Outliers257
- What Are Outliers258
- What Are Outliers262
- What Are Outliers266
- What Are Outliers272
- What Are Outliers261
- What Are Outliers264
- What Are Outliers259
- What Are Outliers263
- What Are Outliers267

You are on page 1of 15

• Outliers can be caused by measurement or execution error. These data objects are outliers. • the outliers may be of particular interest .• A database may contain data objects that do not comply with the general behavior or model of the data.

Applications: • Fraud detection • Medicine • Public health • Sports statistics • Detecting measurement errors .

OUTLIER DETECTION METHODS • Statistical Distribution-Based Outlier Detection • Distance-Based Outlier Detection • Density-Based Local Outlier Detection • Deviation-Based Outlier Detection .

.Statistical Distribution-Based Outlier Detection • assumes a distribution for the given data set • identifies outliers with respect to the model using a discordancy test • requires knowledge of the data set parameters • knowledge of distribution parameters • expected number of outliers.

How does the discordancy testing work? • This test examines two hypotheses: • working hypothesis • alternative hypothesis .

is a statement that the entire data set of n objects comes from an initial distribution model. • H : oi E F. n. … . is evaluated • If SP(vi) is small H is rejected . that is.• A working hypothesis. F. where i = 1. H. 2. • Verifies whether oi is <> in relation to F • Assume T is some statistic used as discordancy test • Assume value of the statistic for object oi is vi • Then distribution T is constructed • SP(vi)=Prob(T > vi).

. which states that oi comes from another distribution model. is adopted. • The result is very much dependent on which model F is chosen because oi may be an outlier under one model and a perfectly valid value under another.• An alternative hypothesis. H. G.

H : oi E (1-mu)F +muG. where i = 1. • Inherent alternative distribution H’ : oi E G. 2. n • Mixture alternative distribution G. : : : . where i = 1. n.• kinds of alternative distributions. • Slippage alternative distribution . : : : . 2.

11 that is. of the objects in D lie at a distance greater than dmin from o. pct. . a DB(pct. if at least a fraction. o. D.Distance-Based Outlier Detection • An object. is a distancebased (DB) outlier with parameters pct and dmin. in a data set.dmin)-outlier.

algorithms for mining distance-based outliers • Index-based algorithm • Nested-loop algorithm • Cell-based algorithm .

Density-Based Local Outlier Detection • Distance-based outlier detection is based on global distance distribution • It encounters difficulties to identify outliers if data is not uniformly distributed. .

Deviation-Based Outlier Detection • it identifies outliers by examining the main characteristics of objects in a group • two techniques for deviation-based outlier detection • Sequential Exception Technique • OLAP Data Cube Technique .

Sequential Exception Technique .

OLAP Data Cube Technique .

- What Are Outliers176Uploaded byniyati25
- What Are Outliers239Uploaded byniyati25
- What Are Outliers168Uploaded byniyati25
- What Are Outliers138Uploaded byniyati25
- WHAT ARE OUTLIERS72.pptxUploaded byniyati25
- What Are Outliers242Uploaded byniyati25
- What Are Outliers245Uploaded byniyati25
- What Are Outliers76Uploaded byniyati25
- What Are Outliers169Uploaded byniyati25
- What Are Outliers106Uploaded byniyati25
- What Are Outliers261Uploaded byniyati25
- What Are Outliers227Uploaded byniyati25
- What Are Outliers86Uploaded byniyati25
- What Are Outliers195Uploaded byniyati25
- What Are Outliers222Uploaded byniyati25
- 1583665_67383352_Comparing+Methods+Assignmentf17Uploaded byAli Nasar
- Matlab2014a Data AnalysisUploaded byLeke Ogunranti
- Prepositions.pptxUploaded byNionios Speed
- Reserch Methodology Course OutlineUploaded byGaurav Singh Chauhan
- AsaRopert and LizbethZunigaUploaded byfvijayan
- Cosmides 1985 IntroUploaded byMario Cacasenno
- Table of ContentsUploaded byAnonymous XgWlFZLLwX
- ClusterUploaded bySanjay Nath
- enUploaded byDana Madalina
- IUPAC - Determination of Tocopherols and Tocotrienols in Vegetable Oils and Fats by High Performance Liquid ChromatographyUploaded byagomezlo
- The Smart RecruitUploaded bybobby
- Class 3 - Theory and HypothesesUploaded byPhillip Wininger
- Mind on Statistics 5th Edition Utts Test BankUploaded bya848562378
- Math FNBE0115 - Statistic BriefUploaded byMuhammad Nazmi
- An Innovative Idea to Discover the Trend on Multi-dimensional Spatio-temporal DatasetsUploaded byesatjournals

- 9.eUploaded byniyati25
- 98qwertyUploaded byniyati25
- Cs453 d HTML Javascript 1Uploaded byniyati25
- What Are Outliers271Uploaded byniyati25
- BhavanaRam PaperUploaded byniyati25
- Computer Graphics1Uploaded byniyati25
- ch27.pdfUploaded byniyati25
- 184890998 CEH v8 Labs Module 13 Hacking Web Applications PDFUploaded byniyati25
- Ieee Test PlanUploaded byniyati25
- FTKManualUploaded byniyati25
- 2 D Transformations 2Uploaded byniyati25
- chap 7Uploaded byniyati25
- 9.eUploaded byniyati25
- 4.80 SY ITUploaded byniyati25
- Computer Graphics December 2010 - Old(Vtuplanet.com)Uploaded byniyati25
- What Are Outliers260Uploaded byniyati25
- What Are Outliers268Uploaded byniyati25
- What Are Outliers269Uploaded byniyati25
- What Are Outliers270Uploaded byniyati25
- What Are Outliers255Uploaded byniyati25
- What Are Outliers257Uploaded byniyati25
- What Are Outliers258Uploaded byniyati25
- What Are Outliers262Uploaded byniyati25
- What Are Outliers266Uploaded byniyati25
- What Are Outliers272Uploaded byniyati25
- What Are Outliers261Uploaded byniyati25
- What Are Outliers264Uploaded byniyati25
- What Are Outliers259Uploaded byniyati25
- What Are Outliers263Uploaded byniyati25
- What Are Outliers267Uploaded byniyati25