3 views

Uploaded by niyati25

WHAT ARE OUTLIERS259.pptx

save

- HEC-SSP User's Manual.pdf
- IEEE C57.104 Minutes of Meeting
- Midterm1-Sol-Fall10
- Brave Diggers Skill Trigger Chance
- Brain Development
- Evaluation of Museum Collection
- handout1.docx
- COMPUSOFT, 3(1), 487-490.pdf
- s
- Ameli Todorov
- jet eng
- Part 1
- managing-stats-for-optimal-query-performance
- COLLECTING AND PLOTTING DATA
- Practical Concepts of Quality Control
- Research Methodology_original
- Research proposal
- learning models matrix document inquiry
- Geographical Skills Glossary
- A Comparative Study of Students Performance in Technology Based Theory and Practical Subjects in Nigerian Universities IJSTR
- Output bj
- Excel Box and Whisker Diagrams (Box Plots) - Peltier Tech Blog
- Normal Table
- finalproject7-31-141
- chem math pow
- 8th_meanmedian and Mode
- Print Bolton Richard
- Reflection
- RM(6)Hypothesis
- assmt5
- 184890998 CEH v8 Labs Module 13 Hacking Web Applications PDF
- Ieee Test Plan
- 2 D Transformations 2
- FTKManual
- chap 7
- 9.e
- 4.80 SY IT
- Computer Graphics December 2010 - Old(Vtuplanet.com)
- 9.e
- Cs453 d HTML Javascript 1
- 98qwerty
- What Are Outliers271
- Computer Graphics1
- BhavanaRam Paper
- ch27.pdf
- What Are Outliers261
- What Are Outliers264
- What Are Outliers263
- What Are Outliers267
- What Are Outliers260
- What Are Outliers269
- What Are Outliers268
- What Are Outliers257
- What Are Outliers255
- What Are Outliers270
- What Are Outliers266
- What Are Outliers262
- What Are Outliers258
- What Are Outliers272

You are on page 1of 15

• A database may contain data objects that do not comply with the general behavior or model of the data. These data objects are outliers. • the outliers may be of particular interest . • Outliers can be caused by measurement or execution error.

Applications: • Fraud detection • Medicine • Public health • Sports statistics • Detecting measurement errors .

OUTLIER DETECTION METHODS • Statistical Distribution-Based Outlier Detection • Distance-Based Outlier Detection • Density-Based Local Outlier Detection • Deviation-Based Outlier Detection .

Statistical Distribution-Based Outlier Detection • assumes a distribution for the given data set • identifies outliers with respect to the model using a discordancy test • requires knowledge of the data set parameters • knowledge of distribution parameters • expected number of outliers. .

How does the discordancy testing work? • This test examines two hypotheses: • working hypothesis • alternative hypothesis .

• Verifies whether oi is <> in relation to F • Assume T is some statistic used as discordancy test • Assume value of the statistic for object oi is vi • Then distribution T is constructed • SP(vi)=Prob(T > vi).• A working hypothesis. is evaluated • If SP(vi) is small H is rejected . that is. 2. where i = 1. is a statement that the entire data set of n objects comes from an initial distribution model. F. • H : oi E F. … . n. H.

• An alternative hypothesis. H. which states that oi comes from another distribution model. is adopted. • The result is very much dependent on which model F is chosen because oi may be an outlier under one model and a perfectly valid value under another. G. .

where i = 1. H : oi E (1-mu)F +muG. n • Mixture alternative distribution G.• kinds of alternative distributions. • Slippage alternative distribution . • Inherent alternative distribution H’ : oi E G. : : : . where i = 1. 2. n. : : : . 2.

o. D. a DB(pct.Distance-Based Outlier Detection • An object. in a data set. . is a distancebased (DB) outlier with parameters pct and dmin. if at least a fraction.dmin)-outlier.11 that is. pct. of the objects in D lie at a distance greater than dmin from o.

algorithms for mining distance-based outliers • Index-based algorithm • Nested-loop algorithm • Cell-based algorithm .

.Density-Based Local Outlier Detection • Distance-based outlier detection is based on global distance distribution • It encounters difficulties to identify outliers if data is not uniformly distributed.

Deviation-Based Outlier Detection • it identifies outliers by examining the main characteristics of objects in a group • two techniques for deviation-based outlier detection • Sequential Exception Technique • OLAP Data Cube Technique .

Sequential Exception Technique .

OLAP Data Cube Technique .

- HEC-SSP User's Manual.pdfUploaded byDritan Bratko
- IEEE C57.104 Minutes of MeetingUploaded bySreeram Panigrahi
- Midterm1-Sol-Fall10Uploaded byKevin Sun
- Brave Diggers Skill Trigger ChanceUploaded byJancolin Yani
- Brain DevelopmentUploaded byscribd4tavo
- Evaluation of Museum CollectionUploaded byFabricio Da Costa Caxias
- handout1.docxUploaded bybambang
- COMPUSOFT, 3(1), 487-490.pdfUploaded byIjact Editor
- sUploaded byevasvdasdv
- Ameli TodorovUploaded byMoisés Sáez Beltrán
- jet engUploaded byveljss007
- Part 1Uploaded byAnkur Upadhyay
- managing-stats-for-optimal-query-performanceUploaded bywetwilly123
- COLLECTING AND PLOTTING DATAUploaded byTeka Kam
- Practical Concepts of Quality ControlUploaded bySchreiber_Dieses
- Research Methodology_originalUploaded byRizwan Shameem
- Research proposalUploaded bySuhail Khan
- learning models matrix document inquiryUploaded byapi-239373469
- Geographical Skills GlossaryUploaded byDavid Drake
- A Comparative Study of Students Performance in Technology Based Theory and Practical Subjects in Nigerian Universities IJSTRUploaded byIJSTR Research Publication
- Output bjUploaded byZainalMustofa
- Excel Box and Whisker Diagrams (Box Plots) - Peltier Tech BlogUploaded byMargarita Rodriguez Duran
- Normal TableUploaded byNasir Mehmood Aryani
- finalproject7-31-141Uploaded byapi-231635350
- chem math powUploaded byapi-298364374
- 8th_meanmedian and ModeUploaded byYetz
- Print Bolton RichardUploaded byapi-3806285
- ReflectionUploaded bymatang_lawin
- RM(6)HypothesisUploaded bymarysruthyanto89
- assmt5Uploaded byklaymen292

- 184890998 CEH v8 Labs Module 13 Hacking Web Applications PDFUploaded byniyati25
- Ieee Test PlanUploaded byniyati25
- 2 D Transformations 2Uploaded byniyati25
- FTKManualUploaded byniyati25
- chap 7Uploaded byniyati25
- 9.eUploaded byniyati25
- 4.80 SY ITUploaded byniyati25
- Computer Graphics December 2010 - Old(Vtuplanet.com)Uploaded byniyati25
- 9.eUploaded byniyati25
- Cs453 d HTML Javascript 1Uploaded byniyati25
- 98qwertyUploaded byniyati25
- What Are Outliers271Uploaded byniyati25
- Computer Graphics1Uploaded byniyati25
- BhavanaRam PaperUploaded byniyati25
- ch27.pdfUploaded byniyati25
- What Are Outliers261Uploaded byniyati25
- What Are Outliers264Uploaded byniyati25
- What Are Outliers263Uploaded byniyati25
- What Are Outliers267Uploaded byniyati25
- What Are Outliers260Uploaded byniyati25
- What Are Outliers269Uploaded byniyati25
- What Are Outliers268Uploaded byniyati25
- What Are Outliers257Uploaded byniyati25
- What Are Outliers255Uploaded byniyati25
- What Are Outliers270Uploaded byniyati25
- What Are Outliers266Uploaded byniyati25
- What Are Outliers262Uploaded byniyati25
- What Are Outliers258Uploaded byniyati25
- What Are Outliers272Uploaded byniyati25