Professional Documents
Culture Documents
• Volume
• Velocity
• Variety
5C
Clean
• Consistent
• Conformed -> across division
• Current
• Comprehensive -> all data they need
Datawarehouse:
• Integrated
• Subject oriented
• Time-variant
• Non volatile
Framework architecture:
1. The information architecture defines the business context necessary for successful BI
solutions to be built on a sustaining basis.
2. The data architecture defines the data along with the schemas, integration, transformations,
storage, and workflow
3. The technical architecture defines the technologies that are used to implement and support
a BI solution
4. The product architecture defines the products, their configurations, and how they are
interconnected
Information Architecture
1. Data Preparation
2. Data Franchising
Data Architecture
Ralph Kimball vs Bill Inmon (hub and spoke)
• Focus on business oriented data mart vs EDW as the core of architecture
• Top-Down design creating vs bottom-up business requirements
• ETL performs 5’C data vs subject oriented EDW with detail data
• Dimensional model for data marts vs 3NF for EDW
1. Additive
•Summarized by addition across all dimensions
• Common measures such as sales, cost, and profit
2. Semi-Additive
• Summarized by addition in some but not all dimensions such as time
• Periodic measurements such as account balances and inventory levels
3. Non-Additive
• Cannot be summarized by addition through any dimension
• Historical facts such as unit price for a sale
Datawarehouse Design
1. Top down approach
2. Bottom up approach
Methodological scenarios
1. Supply-driven approach
2. Demand-driven approach
Three different logical model that can be used to represent a multi dimensional data structure,
are:
1. The relational Model / Relational OLAP Systems(ROLAP) -> multidimensional (star
schema & snowflake)
2. Multidimensional Model / Multidimensional OLAP Systems(MOLAP)
3. Hybrid Model / Hybrid OLAP(HOLAP)
Views : to simplify
1. Primary -> corresponds to finest
2. Secondary -> aggregate
Temporal Scenarios
Conceptual design:
1. Yesterday for today
2. Today for yesterday
3. Today or yesterday
4. Today and yesterday
Dynamic hierarchies:
1. Type 1 : today for yesterday