Professional Documents
Culture Documents
Outlier Analysis
Outlier Analysis
Data Mining:
Data Mining Methods
with Dr. Qin Lv
Learning objective: Apply techniques
for outlier analysis and explain how they
work. Evaluate and compare methods.
Outlier/Anomaly
Ø General/normal patterns
• Frequent patterns, classification, clustering
Ø Abnormal: differs from the norm
Ø Outlier analysis/anomaly detection
• “Outliers are errors and should be removed?”
• Outliers may be significant events, frauds, …
Types of Outliers/Anomalies
280
Kelvin
Ø Contextual 220
Ø Collective
180
160
Example: Remote
240
Kelvin
220
Sensing Data
200
180
160
199 200 200 200 200 200
8 -09 0-0 1-0 2-1 4-0 5-0
1-3 6-1 0-2 3-1 7-2
Ø Spatial-temporal anomaly
-18 1 5 8 1 4
Date (yyyy-mm-dd)
Example:
Wind Farm
Ø Fault prediction
Ø Fault diagnosis
Example: Solar
Farm (1)
Ø Hierarchical anomaly
PV Str ing Com biner Box Inver ter Utility Gr id
PV
Panel
Example: Solar Farm (2)
Ø Collaborative fault detection
Summary: Outlier Analysis
Ø Types of outliers/anomalies
• Global, contextual, collective
Ø Methods
• Unsupervised, (semi-)supervised, context, structure
Ø Interpretation
• Errors or significant events