You are on page 1of 4

3V:

• Volume
• Velocity
• Variety

5C
Clean
• Consistent
• Conformed -> across division
• Current
• Comprehensive -> all data they need

Datawarehouse:
• Integrated
• Subject oriented
• Time-variant
• Non volatile

Core building blocks:


Data integration -> Data warehousing -> Business Intelligence

Framework architecture:
1. The information architecture defines the business context necessary for successful BI
solutions to be built on a sustaining basis.
2. The data architecture defines the data along with the schemas, integration, transformations,
storage, and workflow
3. The technical architecture defines the technologies that are used to implement and support
a BI solution
4. The product architecture defines the products, their configurations, and how they are
interconnected
Information Architecture

1. Data Preparation

2. Data Franchising

Data Architecture
Ralph Kimball vs Bill Inmon (hub and spoke)
• Focus on business oriented data mart vs EDW as the core of architecture
• Top-Down design creating vs bottom-up business requirements
• ETL performs 5’C data vs subject oriented EDW with detail data
• Dimensional model for data marts vs 3NF for EDW

Analytical Data Architecture (ADA)


There are three layers of data that enable BI:
1. Systems of record (SOR)—source systems where data is created, updated and deleted.
This layer is the source of data for systems of integration (SOI) layer but is managed
outside theADA.
2. Systems of integration (SOI)—data structures used to integrate data to enable the 5C’s
of information: consistent, clean, comprehensive, conformed and current.
3. Systems of analytics (SOA)—data structures used by BI application to enable analytics

1. Additive
•Summarized by addition across all dimensions
• Common measures such as sales, cost, and profit
2. Semi-Additive
• Summarized by addition in some but not all dimensions such as time
• Periodic measurements such as account balances and inventory levels
3. Non-Additive
• Cannot be summarized by addition through any dimension
• Historical facts such as unit price for a sale

Datawarehouse Design
1. Top down approach
2. Bottom up approach

Methodological scenarios
1. Supply-driven approach
2. Demand-driven approach

Three different logical model that can be used to represent a multi dimensional data structure,
are:
1. The relational Model / Relational OLAP Systems(ROLAP) -> multidimensional (star
schema & snowflake)
2. Multidimensional Model / Multidimensional OLAP Systems(MOLAP)
3. Hybrid Model / Hybrid OLAP(HOLAP)

Views : to simplify
1. Primary -> corresponds to finest
2. Secondary -> aggregate
Temporal Scenarios
Conceptual design:
1. Yesterday for today
2. Today for yesterday
3. Today or yesterday
4. Today and yesterday

Dynamic hierarchies:
1. Type 1 : today for yesterday

You might also like