Professional Documents
Culture Documents
Unit – II set, Mining frequent item-sets using vertical data, Evaluation of Association
Patterns, From Association Analysis to Correlation Analysis
Self-Study: Sequential Pattern Mining Algorithms, Pattern mining in multi-
level, multi-dimensional space Data Integration: different types of digital data
and their sources, ETL (extract transform and load)Tools
Classification and Prediction
Classification: Decision Tree Classifier, Lazy Learner: KNN Classifier,
Unit –
Classifier Accuracy Measures, techniques for Evaluating Classifier Accuracy.
III
Prediction: Linear, Non-Linear Regression.
Self-Study: Case-Based Reasoning, Associative Classification
Clustering and Outlier Detection
Cluster Analysis:
Categories of Clustering methods, Different Types of Clusters, Partitioning
methods: k-Means, k-Medoids; Hierarchical Clustering Methods:
Unit – Agglomerations Grid Based Methods: STING, Cluster Evaluation
IV Outlier Analysis:
Types of outlier, Proximity based approach: distance based, Density based
approach
Self-Study: Grid Based Methods: CLIQUE, Density based Clustering: