3 views

Uploaded by niyati25

WHAT ARE OUTLIERS82.pptx

save

- Aapt Guelph 00 Pdfnya David
- Brett Analysis
- What Are Outliers32
- What Are Outliers158
- Exercise 2
- Exam1a.Fall02
- Scientific Method
- Good Documentation Practice
- A Statisical Approach to Machinery Condition Monitoring - Blake
- 1Preparing Data
- nrcc47014
- ML003739956.pdf
- What Are Outliers39
- Effective and Efficient Approach for Detecting Outliers
- 42 Ermco 2015 Assessment of Concrete Compressive Strength in Structures
- Modified Rubric for Evaluating Shs Prostat Culminating Project
- zoominweek8illuminatinggroupproject
- Online Poker - Rigged or Not? A case study: Pokerstars
- 20110113scientificmethod-110120030748-phpapp02
- Tax Avoidance and Evasion as a Factor Influencing ‘Creative Accounting Practice’ Among Companies in Kenya
- Phys Eei Hints
- Draft Summary Essay 1
- Elad 6773 Resource Document 08 Mertler6
- How to Write a Deadly EEI
- Madeeha Hypothesis Types
- Research Methods for Science
- Social Science-Course Outline
- Quartely Journal of Economics - Vol 128 Feb 2013
- Plan
- A hoard of astragals discovered in the Copper Age settlement at Iepures¸ti, Giurgiu County, Romania
- Ieee Test Plan
- 184890998 CEH v8 Labs Module 13 Hacking Web Applications PDF
- 2 D Transformations 2
- FTKManual
- 9.e
- chap 7
- Computer Graphics December 2010 - Old(Vtuplanet.com)
- 4.80 SY IT
- Cs453 d HTML Javascript 1
- 98qwerty
- 9.e
- Computer Graphics1
- BhavanaRam Paper
- What Are Outliers271
- ch27.pdf
- What Are Outliers261
- What Are Outliers264
- What Are Outliers259
- What Are Outliers267
- What Are Outliers263
- What Are Outliers260
- What Are Outliers269
- What Are Outliers268
- What Are Outliers257
- What Are Outliers270
- What Are Outliers255
- What Are Outliers272
- What Are Outliers266
- What Are Outliers258
- What Are Outliers262

You are on page 1of 15

• A database may contain data objects that do not comply with the general behavior or model of the data. These data objects are outliers. • the outliers may be of particular interest . • Outliers can be caused by measurement or execution error.

Applications: • Fraud detection • Medicine • Public health • Sports statistics • Detecting measurement errors .

OUTLIER DETECTION METHODS • Statistical Distribution-Based Outlier Detection • Distance-Based Outlier Detection • Density-Based Local Outlier Detection • Deviation-Based Outlier Detection .

.Statistical Distribution-Based Outlier Detection • assumes a distribution for the given data set • identifies outliers with respect to the model using a discordancy test • requires knowledge of the data set parameters • knowledge of distribution parameters • expected number of outliers.

How does the discordancy testing work? • This test examines two hypotheses: • working hypothesis • alternative hypothesis .

• Verifies whether oi is <> in relation to F • Assume T is some statistic used as discordancy test • Assume value of the statistic for object oi is vi • Then distribution T is constructed • SP(vi)=Prob(T > vi). is evaluated • If SP(vi) is small H is rejected . 2. n. H. … . • H : oi E F. F. where i = 1. that is.• A working hypothesis. is a statement that the entire data set of n objects comes from an initial distribution model.

• An alternative hypothesis. H. which states that oi comes from another distribution model. G. . is adopted. • The result is very much dependent on which model F is chosen because oi may be an outlier under one model and a perfectly valid value under another.

2. 2. : : : . • Slippage alternative distribution . H : oi E (1-mu)F +muG. where i = 1.• kinds of alternative distributions. • Inherent alternative distribution H’ : oi E G. : : : . n • Mixture alternative distribution G. where i = 1. n.

Distance-Based Outlier Detection • An object. D. is a distance-based (DB) outlier with parameters pct and dmin.dmin)-outlier. if at least a fraction. pct.11 that is. in a data set. . o. of the objects in D lie at a distance greater than dmin from o. a DB(pct.

algorithms for mining distancebased outliers • Index-based algorithm • Nested-loop algorithm • Cell-based algorithm .

Density-Based Local Outlier Detection • Distance-based outlier detection is based on global distance distribution • It encounters difficulties to identify outliers if data is not uniformly distributed. .

Deviation-Based Outlier Detection • it identifies outliers by examining the main characteristics of objects in a group • two techniques for deviation-based outlier detection • Sequential Exception Technique • OLAP Data Cube Technique .

Sequential Exception Technique .

OLAP Data Cube Technique .

- Aapt Guelph 00 Pdfnya DavidUploaded byAl-akbar Septian
- Brett AnalysisUploaded byBrett Petzer
- What Are Outliers32Uploaded byniyati25
- What Are Outliers158Uploaded byniyati25
- Exercise 2Uploaded bybookmoon
- Exam1a.Fall02Uploaded byAhmed M T
- Scientific MethodUploaded byYan Lean Dollison
- Good Documentation PracticeUploaded byarif18539
- A Statisical Approach to Machinery Condition Monitoring - BlakeUploaded bytylerdurdane
- 1Preparing DataUploaded byUkky
- nrcc47014Uploaded byEduardo Gabriel Luchetti
- ML003739956.pdfUploaded byCristina Castillo
- What Are Outliers39Uploaded byniyati25
- Effective and Efficient Approach for Detecting OutliersUploaded byEditor IJRITCC
- 42 Ermco 2015 Assessment of Concrete Compressive Strength in StructuresUploaded byIoan Sosa
- Modified Rubric for Evaluating Shs Prostat Culminating ProjectUploaded byrr
- zoominweek8illuminatinggroupprojectUploaded byapi-297846009
- Online Poker - Rigged or Not? A case study: PokerstarsUploaded byIonut Apahideanu
- 20110113scientificmethod-110120030748-phpapp02Uploaded byAlan Sobrevilla
- Tax Avoidance and Evasion as a Factor Influencing ‘Creative Accounting Practice’ Among Companies in KenyaUploaded byCharles Guandaru Kamau
- Phys Eei HintsUploaded bycheyennel71
- Draft Summary Essay 1Uploaded byg_tsoukalas
- Elad 6773 Resource Document 08 Mertler6Uploaded byArta Koxha
- How to Write a Deadly EEIUploaded byChristoph Kirch
- Madeeha Hypothesis TypesUploaded byMadeeha Qamar
- Research Methods for ScienceUploaded byAlfarezaLazuardy
- Social Science-Course OutlineUploaded bykenha2000
- Quartely Journal of Economics - Vol 128 Feb 2013Uploaded byLinh Tran
- PlanUploaded byKyle Barrow
- A hoard of astragals discovered in the Copper Age settlement at Iepures¸ti, Giurgiu County, RomaniaUploaded byMonica Mărgărit

- Ieee Test PlanUploaded byniyati25
- 184890998 CEH v8 Labs Module 13 Hacking Web Applications PDFUploaded byniyati25
- 2 D Transformations 2Uploaded byniyati25
- FTKManualUploaded byniyati25
- 9.eUploaded byniyati25
- chap 7Uploaded byniyati25
- Computer Graphics December 2010 - Old(Vtuplanet.com)Uploaded byniyati25
- 4.80 SY ITUploaded byniyati25
- Cs453 d HTML Javascript 1Uploaded byniyati25
- 98qwertyUploaded byniyati25
- 9.eUploaded byniyati25
- Computer Graphics1Uploaded byniyati25
- BhavanaRam PaperUploaded byniyati25
- What Are Outliers271Uploaded byniyati25
- ch27.pdfUploaded byniyati25
- What Are Outliers261Uploaded byniyati25
- What Are Outliers264Uploaded byniyati25
- What Are Outliers259Uploaded byniyati25
- What Are Outliers267Uploaded byniyati25
- What Are Outliers263Uploaded byniyati25
- What Are Outliers260Uploaded byniyati25
- What Are Outliers269Uploaded byniyati25
- What Are Outliers268Uploaded byniyati25
- What Are Outliers257Uploaded byniyati25
- What Are Outliers270Uploaded byniyati25
- What Are Outliers255Uploaded byniyati25
- What Are Outliers272Uploaded byniyati25
- What Are Outliers266Uploaded byniyati25
- What Are Outliers258Uploaded byniyati25
- What Are Outliers262Uploaded byniyati25