Professional Documents
Culture Documents
By
Department of CSE&S
Balochistan University of Engineering and Technology
Khuzdar
2
Data Warehousing & Mining 3
Lecture # 3-4
Introduction & Background
Why a Data
Warehousing?
The Need for a Data
Warehouse
Historical Overview
Data Warehouse
Definition
Bill Inmon’s view of a
DWH
What is Data
Warehousing?
How is different?
Comparison of SDLC &
CLDS
Data Warehouse Vs.
OLTP
Architectural view
Reading Materials
8
How is Different?
Fundamentally different
Business user
needs info
Answers result
User requests
in more questions
IT people
?
Business user
may get answers
IT people do
system analysis
and design
IT people
send reports to IT people
business user create reports
9
Data Warehousing & Mining
(CLO-1, PLO-1)
How is Different?
• Does not follows the traditional development
model
Requirements
Program
Classical SDLC
Requirements gathering
Analysis
Design
Programming
Testing
Integration
Implementation
Data Warehousing & Mining 10
(CLO-1, PLO-1)
How is Different?
• Does not follows the traditional development
model
DWH
Program
Requirements
DWH SDLC (CLDS)
Implement warehouse
Integrate data
Test for biasness
Program w.r.t data
Design DSS system
Analyze results
Understand requirement
11
Data Warehouse Vs. OLTP
DWH
Select balance, age, sal, gender from
customer_table, tx_table
Where age between (30 and 40) and
Education = ‘graduate’ and
CustID.customer_table =
Customer_ID.tx_table;
OLTP DWH
Primary key used Primary key NOT used
No concept of Primary Index Primary index used
Few rows returned Many rows returned
May use a single table Uses multiple tables
High selectivity of query Low selectivity of query
Indexing on primary key Indexing on primary index
(unique) (non-unique)
Semistructured
MOLAP
Sources Query/Reporting
www data
Meta
Data
Extract
Data
Analysis
Archived
Transform
Load Warehouse
data
(ETL) ROLAP Business
IT Data Mining
Users
Users
Operational
Data Bases
Data sources Data Marts Tools
Business Users