3 views

Uploaded by niyati25

WHAT ARE OUTLIERS255

save

- What Are Outliers141
- What Are Outliers167
- What Are Outliers217
- A Comprehensive Guide to Data Exploration
- DMBA 2
- stats eport project
- section 6 datat analysis
- What Are Outliers209
- What Are Outliers190
- WHAT ARE OUTLIERS64.pptx
- What Are Outliers161
- What Are Outliers206
- What Are Outliers44
- What Are Outliers246
- What Are Outliers146
- dataexploration
- Skew Kurtos Robust
- Data Presentation
- weebly syllabus
- Climatol Guide
- Body of the Research Report.doc
- Methanol, Etc
- Concept-Attainment Model
- Evaluating Drug Literature a Statistical Approach PDF
- 8614.pdf
- June 2014 QP - S1 Edexcel
- HemSvendsen_H09
- RelvalFMA From Damodaran
- Como escrever em inglês
- A Comprehensive Guide to Data Exploration
- What´s different
- Programación de Búsqueda de Datos
- Declaratie de Confidentialitate
- Tuan,
- Control Systems Engineering. by I.J. Nagrath
- Periódico Revolución Obrera No. 474 octubre de 2018
- Anexo 3 - Lei Comentada 2014.pdf
- Monitoring Database BTAL Permission Monitoring Report
- Sejarah Sastra_Profetik Kuntowijoyo
- Act_2_Ramon_Isidoro_Ordoñez_Ruiz.doc
- Art 19 n 7 Al Art 19 n 26
- Πρόσκληση Συνεδρίασης Οικονομικής Επιτροπής 28-09-2018
- HIFNA LBM 1 SKN.docx
- Danisco Choozit Brand, Starter Culture - MA 11 LYO 125 DCU v4 English
- Apuntes 2do Parcial Topo
- 2.3.13 Ep 2 Sk Manajemen Resiko
- femur-and-pelvis-fracture-(trauma).ppt
- Books - Latest AICTE - PG
- Convocatória Reunião CP Out 2018
- sbpr_adc-sbpr_adc_20151015.pdf
- Wireshark_DNS_v7.0.pdf
- Biogeokimia Laut Dan Perubahan Global
- Istrazivanje Zdravlja Stanovnistva RS 2013.pdf
- Questions oral practice
- telar mapuche el sentido de tejer.pdf
- Neurosis.pdf
- olesistitis
- Speech for VC.docx
- Cuadernos Tridimensional
- PERRONE-MOISES Leyla Altas Literaturas
- 98qwerty
- Cs453 d HTML Javascript 1
- 9.e
- BhavanaRam Paper
- Computer Graphics1
- What Are Outliers271
- ch27.pdf
- Ieee Test Plan
- 184890998 CEH v8 Labs Module 13 Hacking Web Applications PDF
- FTKManual
- 2 D Transformations 2
- 9.e
- chap 7
- Computer Graphics December 2010 - Old(Vtuplanet.com)
- 4.80 SY IT
- What Are Outliers260
- What Are Outliers268
- What Are Outliers269
- What Are Outliers270
- What Are Outliers257
- What Are Outliers272
- What Are Outliers262
- What Are Outliers258
- What Are Outliers266
- What Are Outliers261
- What Are Outliers264
- What Are Outliers259
- What Are Outliers267
- What Are Outliers263

You are on page 1of 15

• the outliers may be of particular interest . These data objects are outliers. • Outliers can be caused by measurement or execution error.• A database may contain data objects that do not comply with the general behavior or model of the data.

Applications: • Fraud detection • Medicine • Public health • Sports statistics • Detecting measurement errors .

OUTLIER DETECTION METHODS • Statistical Distribution-Based Outlier Detection • Distance-Based Outlier Detection • Density-Based Local Outlier Detection • Deviation-Based Outlier Detection .

Statistical Distribution-Based Outlier Detection • assumes a distribution for the given data set • identifies outliers with respect to the model using a discordancy test • requires knowledge of the data set parameters • knowledge of distribution parameters • expected number of outliers. .

How does the discordancy testing work? • This test examines two hypotheses: • working hypothesis • alternative hypothesis .

that is. where i = 1. F. • Verifies whether oi is <> in relation to F • Assume T is some statistic used as discordancy test • Assume value of the statistic for object oi is vi • Then distribution T is constructed • SP(vi)=Prob(T > vi). 2. • H : oi E F. … . is evaluated • If SP(vi) is small H is rejected . n.• A working hypothesis. H. is a statement that the entire data set of n objects comes from an initial distribution model.

G. which states that oi comes from another distribution model.• An alternative hypothesis. . • The result is very much dependent on which model F is chosen because oi may be an outlier under one model and a perfectly valid value under another. H. is adopted.

where i = 1. • Inherent alternative distribution H’ : oi E G. n. 2. where i = 1. • Slippage alternative distribution . H : oi E (1-mu)F +muG. n • Mixture alternative distribution G. 2. : : : . : : : .• kinds of alternative distributions.

. is a distancebased (DB) outlier with parameters pct and dmin. if at least a fraction. of the objects in D lie at a distance greater than dmin from o.Distance-Based Outlier Detection • An object. pct. D. a DB(pct.dmin)-outlier. o. in a data set.11 that is.

algorithms for mining distance-based outliers • Index-based algorithm • Nested-loop algorithm • Cell-based algorithm .

.Density-Based Local Outlier Detection • Distance-based outlier detection is based on global distance distribution • It encounters difficulties to identify outliers if data is not uniformly distributed.

Deviation-Based Outlier Detection • it identifies outliers by examining the main characteristics of objects in a group • two techniques for deviation-based outlier detection • Sequential Exception Technique • OLAP Data Cube Technique .

Sequential Exception Technique .

OLAP Data Cube Technique .

- What Are Outliers141Uploaded byniyati25
- What Are Outliers167Uploaded byniyati25
- What Are Outliers217Uploaded byniyati25
- A Comprehensive Guide to Data ExplorationUploaded bybobby
- DMBA 2Uploaded byNikita Singhania
- stats eport projectUploaded byapi-320818400
- section 6 datat analysisUploaded byapi-270010595
- What Are Outliers209Uploaded byniyati25
- What Are Outliers190Uploaded byniyati25
- WHAT ARE OUTLIERS64.pptxUploaded byniyati25
- What Are Outliers161Uploaded byniyati25
- What Are Outliers206Uploaded byniyati25
- What Are Outliers44Uploaded byniyati25
- What Are Outliers246Uploaded byniyati25
- What Are Outliers146Uploaded byniyati25
- dataexplorationUploaded byapi-319349662
- Skew Kurtos RobustUploaded byGui A AP
- Data PresentationUploaded byRobel Metiku
- weebly syllabusUploaded byapi-362028921
- Climatol GuideUploaded byFressia
- Body of the Research Report.docUploaded byanon_984898452
- Methanol, EtcUploaded bypiemacbook5360
- Concept-Attainment ModelUploaded byAzimi Azmin
- Evaluating Drug Literature a Statistical Approach PDFUploaded byJonathan
- 8614.pdfUploaded bybasharat_jehan8603
- June 2014 QP - S1 EdexcelUploaded byabhay
- HemSvendsen_H09Uploaded bygarycwk
- RelvalFMA From DamodaranUploaded byPragya Pandey Tripathi
- Como escrever em inglêsUploaded byfive-ufal
- A Comprehensive Guide to Data ExplorationUploaded byinag2012

- 98qwertyUploaded byniyati25
- Cs453 d HTML Javascript 1Uploaded byniyati25
- 9.eUploaded byniyati25
- BhavanaRam PaperUploaded byniyati25
- Computer Graphics1Uploaded byniyati25
- What Are Outliers271Uploaded byniyati25
- ch27.pdfUploaded byniyati25
- Ieee Test PlanUploaded byniyati25
- 184890998 CEH v8 Labs Module 13 Hacking Web Applications PDFUploaded byniyati25
- FTKManualUploaded byniyati25
- 2 D Transformations 2Uploaded byniyati25
- 9.eUploaded byniyati25
- chap 7Uploaded byniyati25
- Computer Graphics December 2010 - Old(Vtuplanet.com)Uploaded byniyati25
- 4.80 SY ITUploaded byniyati25
- What Are Outliers260Uploaded byniyati25
- What Are Outliers268Uploaded byniyati25
- What Are Outliers269Uploaded byniyati25
- What Are Outliers270Uploaded byniyati25
- What Are Outliers257Uploaded byniyati25
- What Are Outliers272Uploaded byniyati25
- What Are Outliers262Uploaded byniyati25
- What Are Outliers258Uploaded byniyati25
- What Are Outliers266Uploaded byniyati25
- What Are Outliers261Uploaded byniyati25
- What Are Outliers264Uploaded byniyati25
- What Are Outliers259Uploaded byniyati25
- What Are Outliers267Uploaded byniyati25
- What Are Outliers263Uploaded byniyati25