Professional Documents
Culture Documents
Methods can be viewed from different perspectives, data mining methods include:
Market Basket Analysis
Classification analysis
Clustering analysis
Regression of various forms
AI:
Artificial Neural Network (ANN)
Rule induction (decision trees)
Genetic algorithms (supplement)
Techniques
5
Statistical
Market-Basket Analysis - find groups of items
Memory-Based Reasoning- case based
Cluster Detection - undirected (quantitative)
Artificial Intelligence
Link Analysis - MCI’s Friends & Family
Decision Trees, Rule Induction - production rule
Neural Networks - automatic pattern detection
Genetic Algorithms - keep best parameters
Models
6
Regression: Y = a + bX
Classification: assign new record to class
Predictive: assign value to new record
Clustering: groups for data
Time-series: assign future value
Links: patterns in data
Fitting
7
Underfitting: not enough detail
leave out important variables
Overfitting: too much detail
memorizes training set, but doesn’t help with
new data
data set too small
redundancy in data
Comparison of Features
8