Professional Documents
Culture Documents
PPT Presentation
DATA WAREHOUSING AND DATA MINING(PEC-IT602B)
Even Semester 2024
Outlier analysis
PRESENTED BY-
Introduction to Outlier
Analysis
Outlier analysis involves the identification and examination of data points
that significantly differ from the majority of the dataset. Understanding
and addressing outliers is crucial in data analysis and decision-making.
Types of Outliers
1 Data Integrity
Outlier detection ensures the accuracy and reliability of data stored in warehouses,
thereby maintaining data integrity.
Data Preprocessing
Applying normalization and transformation techniques to prepare the data for
statistical analysis.
Identification Techniques
Using measures like median absolute deviation to identify and label outliers
within the dataset.
3 One-class SVM
Assumes that the majority of the data is in one class, and
identifies the outliers as observations that lie far from it.
Challenges and Limitations in Outlier
Analysis
Noisy Data High-Dimensional Scalability
Dealing with noisy and Data The need for efficient
irrelevant data points that Challenges in detecting outlier detection methods
may be mistaken as outliers in datasets with that can handle large and
outliers, posing challenges numerous dimensions, as dynamic datasets without
in accurate identification. it increases the complexity significant performance
of analysis. degradation.
Applications of Outlier Analysis in
Data Warehousing and Data Mining
Financial Fraud Detection Identifying unusual patterns in transaction
data to detect potential fraudulent activities.
2 Collaborative Efforts
Highlighting the need for cross-functional collaboration to effectively address outliers
in complex datasets.
3 Value of Insights
Underlining the potential business value derived from uncovering valuable insights
through outlier analysis processes.
Reference
• https://www.scaler.com/topics/data-mining-
tutorial/outlier-analysis-in-data-mining/
• https://www.educba.com/outlier-in-data-mining/
• https://www.mygreatlearning.com/blog/outlier-
analysis-explained/
• https://www.javatpoint.com/what-is-outlier-in-
data-mining
THANK YOU