Professional Documents
Culture Documents
• Data Warehouse
• Benefits of Data Warehouse
• Characteristics of a Data Warehouse
• Data Mining
• Kind of Data Can Be Mined and
Discovered
CT107-3.-2 Enterprise Systems Data Warehousing & Data Mining Slide ‹2› of 9
Learning Outcomes
Data Mining
Type of data
CT107-3.-2 Enterprise Systems Data Warehousing & Data Mining Slide ‹4› of 9
Data Warehouse
• Main Difference
Database Data Warehouse
Designed and optimized to store data Designed and optimized to respond to
analysis questions that are critical for
a business
• Sales performance
– Use the data to determine sales profitability
and productivity for all territories and regions;
can obtain and analyze results by geography,
product, sales group, or individual.
• Time variant
– Data are kept for many years so they can be
used for identifying trends, forecasting, and
making comparisons over time.
• Relational
– Uses a relational structure.
• Web-based
– Designed to provide an efficient computing
environment for Web-based applications.
• Real-time
– It is possible to arrange for real-time
capabilities.
• Characterization
• Discrimination
• Association analysis
• Classification
• Prediction
• Clustering
• Outlier analysis
• Evolution and deviation analysis
CT107-3.-2 Enterprise Systems Data Warehousing & Data Mining
Characterization
–Target Class
–Contrasting Class
• Association rules.
• Commonly used for market basket analysis.
• It studies the frequency of items occurring
together in transactional databases, based on:
– Support: identifies the frequent item sets.
– Confidence: conditional probability than an
item appears in a transaction when another
item appears.
CT107-3.-2 Enterprise Systems Data Warehousing & Data Mining Slide ‹34› of 9
Summary of Main Teaching Points
CT107-3.-2 Enterprise Systems Data Warehousing & Data Mining Slide ‹35› of 9
Question and Answer Session
Q&A
CT107-3.-2 Enterprise Systems Data Warehousing & Data Mining Slide ‹36› of 9
What we will cover next