You are on page 1of 18

‫ن‬ ‫ش‬ ‫ن‬

‫ہللا کے ام سے روع ج و ب ڑا مہرب اننہای ت رحم‬


‫‪Data Warehousing & Mining‬‬ ‫‪1‬‬
‫کرے واال ے‬
Data Warehousing & Mining
(CS-411)
Credit Hours: 3 + 1

By

Engr.Shakeel Ahmed Shaikh


Lecturer
Email: shakeel.sheikh@buetk.edu.pk

Department of CSE&S
Balochistan University of Engineering and Technology
Khuzdar
2
Data Warehousing & Mining 3
Lecture # 3-4
Introduction & Background

Data Warehousing & Mining 4


Previous Lecture Review

 Why a Data
Warehousing?
 The Need for a Data
Warehouse
 Historical Overview
 Data Warehouse
Definition
 Bill Inmon’s view of a
DWH

Data Warehousing & Mining


5
(CLO-1, PLO-1)
Today’s Lecture Outlines

 What is Data
Warehousing?
 How is different?
 Comparison of SDLC &
CLDS
 Data Warehouse Vs.
OLTP
 Architectural view
 Reading Materials

Data Warehousing & Mining 6


(CLO-1, PLO-1)
What is a Data Warehouse ?
It is a blend of many technologies, the basic
concept being:

 Take all data from different operational systems.


 If necessary, add relevant data from industry.

 Transform all data and bring into a uniform format.

 Integrate all data as a single entity.

Data Warehousing & Mining


7
(CLO-1, PLO-1)
What is a Data Warehouse ? (cont..)
It is a blend of many technologies, the basic
concept being:

Store data in a format supporting easy access for


decision support.
 Create performance enhancing indices.

 Implement performance enhancement joins.

 Run ad-hoc queries with low selectivity.

8
How is Different?
 Fundamentally different
Business user
needs info

Answers result
User requests
in more questions
IT people

?
Business user
may get answers
 IT people do
system analysis
and design

IT people
send reports to IT people
business user create reports

9
Data Warehousing & Mining
(CLO-1, PLO-1)
How is Different?
• Does not follows the traditional development
model
Requirements

 Program

Classical SDLC

 Requirements gathering
 Analysis
 Design
 Programming
 Testing
 Integration
 Implementation
Data Warehousing & Mining 10
(CLO-1, PLO-1)
How is Different?
• Does not follows the traditional development
model
DWH

Program

 Requirements
DWH SDLC (CLDS)

 Implement warehouse
 Integrate data
 Test for biasness
 Program w.r.t data
 Design DSS system
 Analyze results
 Understand requirement
11
Data Warehouse Vs. OLTP

OLTP (On Line Transaction Processing)


Select tx_date, balance from tx_table
Where account_ID = 23876;

Data Warehousing & Mining


12
(CLO-1, PLO-1)
Data Warehouse Vs. OLTP

DWH
Select balance, age, sal, gender from
customer_table, tx_table
Where age between (30 and 40) and
Education = ‘graduate’ and
CustID.customer_table =
Customer_ID.tx_table;

Data Warehousing & Mining


13
(CLO-1, PLO-1)
Data Warehouse Vs. OLTP

OLTP DWH
Primary key used Primary key NOT used
No concept of Primary Index Primary index used
Few rows returned Many rows returned
May use a single table Uses multiple tables
High selectivity of query Low selectivity of query
Indexing on primary key Indexing on primary index
(unique) (non-unique)

Data Warehousing & Mining


14
(CLO-1, PLO-1)
Putting the pieces together

Data Data Warehouse Server OLAP Servers Clients


(Tier 0) (Tier 1) (Tier 2) (Tier 3)


Semistructured
MOLAP
Sources Query/Reporting

www data
Meta
Data 
 Extract
Data 


 Analysis



 Archived
Transform
Load Warehouse 
 data
(ETL) ROLAP Business
IT Data Mining
Users
Users
Operational
Data Bases  
Data sources Data Marts  Tools
Business Users

Data Warehousing & Mining


15
(CLO-1, PLO-1)
Reading Materials

• Data Ware Housing Fundamentals: A


Comprehensive Guide for I.T Professionals by
Paulraj Ponnjah
• W. H. Inmon, Building the Data Warehouse
(Second Edition), John Wiley & Sons Inc., NY.

Data Warehousing & Mining


16
(CLO-1, PLO-1)
Data Warehousing & Mining
17
(CLO-1, PLO-1)
Data Warehousing & Mining
18
(CLO-1, PLO-1)

You might also like