Professional Documents
Culture Documents
Cap 2
Cap 2
Objectives
2-2
Lesson Agenda
2-3
Characteristics of a Data Warehouse
2-4
OLTP Systems Versus Data Warehouses
2-5
Data Warehouse Architectures:
Basic Data Warehouse
Operational
system Analysis
Metadata
2-6
Data Warehouse Architectures:
Basic Data Warehouse with Staging Area
Operational
system Analysis
Metadata
2-7
Data Warehouse Architectures:
Basic Data Warehouse with Staging Area and Data Marts
Operational
system Analysis
Metadata
Sales
Purchasing
Summaries Raw data
Operational Staging Inventory
system area Reporting
2-8
Lesson Agenda
2-9
Data Warehouse Design
2 - 10
Data Warehouse: Design Phases
2 - 11
Data Warehouse Physical Design
Entities
Attributes
Unique
identifiers Indexes Materialized Dimensions
views
2 - 12
Data Warehouse Physical Structures
Materialized views (MVs) • Are query results that have been stored in
advance
• Are used transparently (similar to indexes)
and improve performance
2 - 13
Data Warehousing Objects
2 - 14
Characteristics of Fact Tables
2 - 15
Dimensions and Hierarchies
CUSTOMERS
PRODUCTS
#prod_id Unique identifier #cust_id
...
Fact table cust_last_name
SALES cust_city
cust_state_province
cust_id
Relationship prod_id
Hierarchy
...
TIMES CHANNELS
... PROMOTIONS ...
...
Dimension table Dimension table
Dimension table
2 - 17
Using Hierarchies for Drill
on Data and Aggregate Data
Market Hierarchy
Group
Region 1 Region 2
City 1 City 2
2 - 18
Data Warehousing Schemas
2 - 19
Schema Characteristics
• Star schema:
– It is characterized by one or more large fact tables and a
number of much smaller dimension tables.
– Each dimension table is joined to the fact table by using a
primary key to foreign key join.
• Snowflake schema:
– Dimension data is grouped into multiple tables instead of one
large table.
– The number of dimension tables are increased, requiring
more foreign key joins.
• Third normal form (3NF) schema:
– It is a classical relational-database model that minimizes
data redundancy through normalization.
2 - 20
Star Schema Model: Central Fact Table
and Denormalized Dimension Tables
Product Dimension Table Store Dimension Table
Product_id Store_id
Product_disc District_id
... ...
Sales Fact Table
Product_id
Store_id
Item_id
Day_id
Sales_amount
Sales_units
...
Time Dimension Table
Item Dimension Table
Day_id
Month_id Item_id
Year_id Item_desc
... ...
2 - 21
Star Dimensional Schema: Advantages
2 - 22
Snowflake Schema Model
Store Table
Product Table District Table
Store_id
Product_id District_id
Store_desc
Product_desc District_desc
District_id
Item_id
Store_id
Product_id
Week_id
Sales_amount
Sales_units
2 - 23
Snowflake Schema Model
2 - 24
Summary
2 - 25