Professional Documents
Culture Documents
Lecture 2
Lecture 2
Data Warehousing
Lecture-2
Introduction and Background
Ahsan Abdullah
Assoc. Prof. & Head
Center for Agro-Informatics Research
www.nu.edu.pk/cairindex.asp
FAST National University of Computers & Emerging Sciences, Islamabad
1
DWH-Ahsan Abdullah
Introduction and Background
2
DWH-Ahsan Abdullah
Why a Data Warehouse (DWH)?
Data recording and storage is growing.
3
DWH-Ahsan Abdullah
Reason-1: Why a Data Warehouse?
Data Sets are growing.
How Much Data is that?
1 MB 220 or 106 bytes Small novel – 31/2 Disk
Paper rims that could fill the back of
1 GB 230 or 109 bytes
a pickup van
50,000 trees chopped and converted
1 TB 240 or 1012 bytes
into paper and printed
Academic research libraries across
2 PB 1 PB = 250 or 1015 bytes
the U.S.
All words ever spoken by human
5 EB 1 EB = 260 or 1018 bytes
beings
4
DWH-Ahsan Abdullah
Reason-1: Why a Data Warehouse?
Size of Data Sets are going up .
Cost of data storage is coming down .
The amount of data average business collects
and stores is doubling every year
6
DWH-Ahsan Abdullah
Caution!
A Warehouse of Data
is NOT a
Data Warehouse
7
DWH-Ahsan Abdullah
Caution!
Size
is NOT
Everything
8
DWH-Ahsan Abdullah
Reason-2: Why a Data Warehouse?
9
DWH-Ahsan Abdullah
Reason-2: Why a Data Warehouse?
DBMS Approach
List of all items that were sold last
month?
What is happening?
What do you want to happen?
12
DWH-Ahsan Abdullah
What is a Data Warehouse?
13
DWH-Ahsan Abdullah
What is a Data Warehouse?
Complete repository
History
Transaction System
Ad-Hoc access
Knowledge workers
14
DWH-Ahsan Abdullah
What is a Data Warehouse?
Transaction System
Management Information System (MIS)
Could be typed sheets (NOT transaction
system)
Ad-Hoc access
Dose not have a certain access pattern.
Queries not known in advance.
Difficult to write SQL in advance.
Knowledge workers
Typically NOT IT literate (Executives, Analysts, Managers).
NOT clerical workers. 15
Subject
Oriented
Integrated
Time
Variant
Non
Volatile
16
DWH-Ahsan Abdullah