Professional Documents
Culture Documents
1
Data Warehousing, Access, Analysis,
Mining, and Visualization
MSS foundation
Many new concepts
Object-oriented databases
Intelligent databases
Data warehouse
Data mining
Online analytical processing
Multidimensionality
Internet / Intranet / Web
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 2
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Data Warehousing, Access, Analysis,
and Visualization
What to do with all the data that organizations collect,
store, and use?
(Information overload!)
Solution
Data warehousing
Data access
Data mining
Online analytical processing (OLAP)
Data visualization
Data sources
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 3
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
The Nature and Sources of Data
Data: Raw
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 4
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
DSS Data Items
Documents
Pictures
Maps
Sound
Animation
Video
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 5
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Data Sources
Internal
External
Personal
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 6
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Data Collection, Problems, and
Quality
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 7
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Data Quality Issues in Data
Warehousing
Uniformity
Version
Completeness check
Conformity check
Genealogy check (drill down)
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 8
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
The Internet and
Commercial Database Services
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 9
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
The Internet and
Commercial Databases Servers
Use Web Browsers to
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 10
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Database Management Systems in DSS
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 11
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Database Organization and
Structure
Relational databases
Hierarchical databases
Network databases
Object-oriented databases
Multimedia-based databases
Document-based databases
Intelligent databases
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 12
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Data Warehousing
Physical separation of operational and decision support
environments
Purpose: to establish a data repository making operational data
accessible
Transforms operational data to relational form
Only data needed for decision support come from the TPS
Data are transformed and integrated into a consistent structure
Data warehousing (information warehousing): solves the data
access problem
End users perform ad hoc query, reporting analysis and
visualization
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 13
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Data Warehousing Benefits
Increase in knowledge worker productivity
Supports all decision makers’ data requirements
Provide ready access to critical data
Insulates operation databases from ad hoc processing
Provides high-level summary information
Provides drill down capabilities
Yields
– Improved business knowledge
– Competitive advantage
– Enhances customer service and satisfaction
– Facilitates decision making
– Help streamline business processes
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 14
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Data Warehouse Architecture
and Process
Two-tier architecture
Three-tier architecture
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 15
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Data Warehouse Components
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 16
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
DW Suitability
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 17
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Characteristics of Data
Warehousing
2. Integrated data
3. Time-variant data
4. Non-volatile data
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 18
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
OLAP: Data Access and Mining,
Querying, and Analysis
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 19
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
OLAP Activities
Generating queries
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 20
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
OLAP uses the data warehouse
and a set of tools, usually with
multidimensional capabilities
Query tools
Spreadsheets
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 21
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Using SQL for Querying
Example:
SELECT Name, Salary
FROM Employees
WHERE Salary >2000
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 22
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Data Mining for
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 23
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Major Data Mining
Characteristics and Objectives
Data are often buried deep
Client/server architecture
Sophisticated new tools--including advanced visualization
tools--help to remove the information “ore”
End-user miner empowered by data drills and other power
query tools with little or no programming skills
Often involves finding unexpected results
Tools are easily combined with spreadsheets, etc.
Parallel processing for data mining
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 24
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Data Mining Application Areas
Marketing
Banking
Retailing and sales
Manufacturing and production
Brokerage and securities trading
Insurance
Computer hardware and software
Government and defense
Airlines
Health care
Broadcasting
Law enforcement
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 25
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Intelligent Data Mining
Use intelligent search to discover information within data
warehouses that queries and reports cannot effectively reveal
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 26
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Main Tools Used in
Intelligent Data Mining
Case-based Reasoning
Neural Computing
Intelligent Agents
Other Tools
– Decision trees
– Rule induction
– Data visualization
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 27
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Data Visualization and
Multidimensionality
Data Visualization Technologies
Digital images
Geographic information systems
Graphical user interfaces
Multidimensions
Tables and graphs
Virtual reality
Presentations
Animation
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 28
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Multidimensionality
3-D + Spreadsheets (OLAP has this)
Data can be organized the way managers like to see them, rather
than the way that the system analysts do
Different presentations of the same data can be arranged easily
and quickly
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 29
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Multidimensionality Limitations
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 30
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Geographic Information
Systems (GIS)
A computer-based system for capturing, storing,
checking, integrating, manipulating, and displaying
data using digitized maps
Spatially-oriented databases
Useful in marketing, sales, voting estimation, planned
product distribution
Available via the Web
Can use with GPS
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 31
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Virtual Reality
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 32
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Business Intelligence
on the Web
Can capture and analyze data from Web
Tools deployed on Web
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 33
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Summary
Data for decision making come from internal and external
sources
The database management system is one of the major
components of most management support systems
Familiarity with the latest developments is critical
Data contain a gold mine of information if they can dig it
out
Organizations are warehousing and mining data
Multidimensional analysis tools and new enterprise-wide
system architectures are useful
OLAP tools are also useful
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 34
Copyright 2001, Prentice Hall, Upper Saddle River, NJ
Summary (cont’d.)
Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition 35
Copyright 2001, Prentice Hall, Upper Saddle River, NJ