Professional Documents
Culture Documents
UC Berkeley
Peter Cava
Manager Data Warehouse Services
October 5, 2006
EDW UC Berkeley
EDW UC Berkeley
EDW UC Berkeley
Traditional Database
Application-specific Data
Current Data
Organized for Data Entry
Updated Data
Normalized Data
Encoded Data
Detailed/Summarized Data
Raw Data
Clerical User
EDW UC Berkeley
EDW UC Berkeley
EDW UC Berkeley
Scientific Databases
Administrative
Systems
World
Wide
Web
Digital Libraries
Different interfaces
Different data representations
Duplicate and inconsistent information
Vertical fragmentation of informational systems (vertical stove pipes)
Result of application (user)-driven development of operational systems
EDW UC Berkeley
World
Wide
Web
Digital Libraries
Scientific Databases
Personal
Databases
EDW UC Berkeley
Key Concepts
Data Warehouse vs. Data Mart
Data Mart Single subject area
Data Warehouse Integrated Data Marts
Star Schema
Dimensions
Hierarchies
Descriptive Attributes
Fact Tables
Metadata
EDW UC Berkeley
tables
z Fact tables
Place where numerical measurements of business are
stored
EDW UC Berkeley
EDW UC Berkeley
Reporting
z
EDW UC Berkeley
Strengths of OLAP
z It
EDW UC Berkeley
EDW UC Berkeley
Comparison
z Budget vs. Expenses
z Ranking
z Access to detailed and aggregate data
z Complex criteria specification
z Visualization
EDW UC Berkeley
Reporting
EDW UC Berkeley
INTERACTIVE REPORTING
EDW UC Berkeley
ANALYTICS
EDW UC Berkeley
Dashboard
EDW UC Berkeley
EDW UC Berkeley
Scorecard
EDW UC Berkeley
EDW Today
z
z
z
Foundation
Student Pilot (In development)
EDW UC Berkeley
Technical Environment
z Oracle
9i
z ETL Tool Informatica
z Reporting Tool Hyperion Brio
z Data Modeling Tool - ERwin
EDW UC Berkeley
Report Request
EDW UC Berkeley
EDW UC Berkeley
Data Extract
Get Data from source
Data Transformation
Data Cleansing - Data Quality Assurance
Data Scrubbing - Removing errors and inconsistencies
Processing Calculations
Applying Business Rules
Changing Data Types
Making the Data More Readable
Replacing Codes with Actual Values
Summarizing the Data
Data Load
Load data into Warehouse
z
z
z
z
EDW UC Berkeley
EDW UC Berkeley
EDW UC Berkeley
Workforce analysis
Financial analysis of operations.
Financial analysis research and projects
Curriculum and enrollment analysis
Student pipeline analysis (pre-registration to alumni)
Graduate support analysis
Private support analysis (donors and gifts)
Purchasing analysis
Recharge analysis
Classroom and facilities utilization analysis
Absenteeism analysis
Enterprise risk assessment
EDW UC Berkeley
Slice and dice many different ways. Tool support for this.
Documentation of available data.
Enough history for trend analysis.
Reconcile alternative views of same information; e.g., as-reported vs. final.
EDW UC Berkeley
Metadata
Data about Data
z Business Meta Data
z Technical Meta Data
z Operational Meta Data
z Project Meta Data
EDW UC Berkeley
and Prioritization
z Decentralized reporting systems
z Meta Data management
z Resource availability
z Central vs. local reporting needs
z Funding BAI vs. Campus enterprise