Professional Documents
Culture Documents
AND
DATA MINING
ICS426: Introduction
Course Overview
Introduction
Data Preporcessing
DW and OLAP
Data Mining
ICS426: Introduction
Motivation
Data flood
ICS426: Introduction
Motivation
ICS426: Introduction
ICS426: Introduction
Data mining
ICS426: Introduction
ICS426: Introduction
ICS426: Introduction
Information
Data
February 22, 2009
ICS426: Introduction
10
Evolution
ICS426: Introduction
11
Walmart -- 24 Terabytes
Weather images
ICS426: Introduction
12
ICS426: Introduction
13
Data Warehouse
A data warehouse is a
subject-oriented
integrated
time-varying
non-volatile
ICS426: Introduction
14
Extraction
Cleansing
Data Warehouse
Engine
Purchased
Data
Legacy
Data
February 22, 2009
Analyze
Query
Metadata Repository
ICS426: Introduction
15
ICS426: Introduction
16
Decision Support
ICS426: Introduction
17
ICS426: Introduction
18
Fraud detection
ICS426: Introduction
19
What
Whatproduct
productpromprom-otions
-otionshave
havethe
thebiggest
biggest
impact
impacton
onrevenue?
revenue?
Who
Whoare
aremy
mycustomers
customers
and
what
products
and what products
are
arethey
theybuying?
buying?
Which
Whichcustomers
customers
are
most
are mostlikely
likelyto
togo
go
to
tothe
thecompetition
competition??
What
Whatimpact
impactwill
will
new
newproducts/services
products/services
have
haveon
onrevenue
revenue
and
andmargins?
margins?
ICS426: Introduction
20
Alternative names
Many Definitions
ICS426: Introduction
21
ICS426: Introduction
22
Predictive:
Regression
Classification
Collaborative Filtering
Descriptive:
Deviation detection
ICS426: Introduction
23
Applications
Targeted marketing:
ICS426: Introduction
24
Applications
ICS426: Introduction
25
The course
(4)
DS
DS
OLAP
(2)
(3)
DP
DW
DM
Association
DS
(5)
Classification (6)
Clustering
(7)
DS = Data source
DW = Data warehouse
DM = Data Mining
DP = Data processing
February 22, 2009
ICS426: Introduction
26
END
ICS426: Introduction
27