Professional Documents
Culture Documents
Why do we mine?
What do we mine?
How do we mine?
What is Data Mining
➤ Economics
➤ Unprecedented affordability of MIPS and MB
➤ Parallel computing
➤ Enormous amounts of data can be processed
➤ Defining a study
➤ Supervised-
Supervised-articulating goal, choosing dependent
variable or output and specifying data fields
➤ Unsupervised-
Unsupervised-group similar types of data or identify
exceptions
CS753 Dr. Mary Ann Robbert
Steps in Data Mining (con’t)
➤ Prediction
➤ Choose the best outcome based on historical data
➤ Genetic Algorithms
➤ Neural Nets
➤ Agents
➤ Statistics
➤ Visualization
:
/
/
w
w
w
.
a
➤Example
➤Distinguishing different chemical
compounds
➤Detecting anomalies in human tissue
that may signify disease
➤Reading handwriting
➤Detecting fraud in credit card use
➤ Tasks
➤ automate repetitive tasks
➤ finding and filtering information
➤ summarizing complex data
➤ SAS, SPSS
➤ Pros - Established technology
➤ Cons - Needs assumptions, nominal
variable handling, management
acceptance?
➤ http://www.kdnuggets.com/software/
➤ http://www.attar.com/ download
➤ http://www.cs.bham.ac.uk/~anp/software.ht
ml software listing
➤ http://www.rulequest.com/gritbot-info.html