<Insert Picture Here>

S292621 Overview: Approaches to Oracle Data Warehousing
Robert Stackowiak Vice President, Business Intelligence, Oracle Technology Business Unit

The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle.

Best Practices from….

And you!!!

Starting Out: On the Same Page?
Different Agendas Typical in IT & Lines of Business

Information Technology

Lines of Business

It is about… managing and delivering quality data consistently and at low cost

It is about… immediate access to information needed for business decisions

Finding Benefits
IT Benefits are only Somewhat Motivating • ROI = Present Value of Benefits - Costs • Increasing ROI for information producers by
• Reducing current costs of analysis for LOBs • Empower LOBs with tools, applications, and automated analysis rather than manually creating custom reports • Reducing ongoing costs of analysis for IT • Faster and simpler development of analytic applications • Centralized management

Finding Benefits
Lines of Business produce the Big Wins • ROI = Present Value of Benefits - Costs • Increasing ROI for information consumers (LOBs) by
• Enabling top-line growth • New customers/markets, most profitable customers, … • React faster than the competition • Enabling bottom-line savings • Optimize operations, financial management, …

Typical Business Drivers
• Financial
• Cross-sell, targeted marketing, more efficient processes

• Manufacturing
• Supply chain analysis / just in time inventory

• Retail
• Supply chain analysis / inventory control

• Media & Telecommunications
• Revenue across multiple channels

Typical Business Drivers
• Transportation
• Optimized logistics management

• Healthcare
• Cost containment / quality of care

• Civilian Government
• Budget spend and reporting to constituents

• Military & Intelligence
• Logistics management, threat analysis

Business Strategy Map

Deploy within Scope Quickly
• Pick a strategic business area
• Should have potentially highly visible ROI

• Build business / IT partnership
• Limit scope, deliver projects incrementally

• Assemble team with the right skills
• Engage systems integrators as if employees

Organization Strategy

Assessment & Scope

Discovery

Solution Development & Presentation

Approval & Budget

Your Infrastructure?
MULTIPLE ROLES DIFFERENT METRICS AD-HOC PROCESS DISPARATE BI TOOLS AND REPORTING SYSTEMS FRAGMENTED DATA SOURCES

GOAL

FINANCE

EBITDA

FINANCE

? ?
HUMAN RESOURCES HIRING QUALITY

Multiple ERP

PLAN

?
MARKETING BRAND

MODEL

OPERATIONS

Multiple SUPPLY CHAIN Multiple DATA WAREHOUSES Multiple LEGACY APPLICATIONS

?
CUSTOMER SERVICE SERVICE LEVELS

ANALYZE

PLANNING

?
EXECUTIVE COMPANY HEALTH

Multiple POINT APPLICATIONS

REPORT

A Re-architecture Approach
Orchestration
Sales Marketing

Applications

Data Warehouse Master Data & Data Warehouse Database
EAI

Data Marts BI Tools Reporting & Dashboards

Call Center Financials Providers Eligibility Claims

Master Data Hub DW DW DW Data Warehouse
EAI ETL ETL

Analytical Data

Business Intelligence

Self Service

Plan for Data Growth
1,487% Increase in Size from 1998 to 2005 245% Increase just from 2003 to 2005
100

80

Database Size (TB)

60

Size of the largest data warehouse in TopTen Programs

40

20

0 1998

1999

2000

2001

2002

2003

2004

2005

Source: 2005 TopTen Program, November 2005 © Winter Corporation, Waltham, MA, USA

Time to Consider Grid Deployment?

Fusion Middleware

Real Application Clusters Grid Control

Automatic Storage Management

Configuring the Platform
Rule of Thumb: 200MB/s per CPU → Number of HBA per node = number CPUs per node Maximal Number of Switches = Number of HBAs Number of HBAs + No. of Controllers

CPU,Node

Host Bus Adapter Switch
Number of nodes <=8 → GigE, otherwise infiniband

Controller Disk Interconnect

* 2Gbit based

Minimum number of disks = number of controller x 4

Typical Platform Sizing Questions
• Highest level basic questions
• How much data needs to be stored in the system? • What is the mix of query types (e.g. simple, intermediate, complex)? • How many concurrent users are there?

• Technical questions
• What is the expected read / query rate from the system in GB / second? • What is the expected read / query rate from the system in GB / second? • Do queries run concurrently with the data load?

Goal: Consistent Sustained Performance
Component
CPU core 1/2 Gbit HBA 16 port switch Fiber channel Disk controller GigE NIC (interconnect) Disk (spindle)

Hardware spec
1/2 Gbit/s 8 * 2 Gbit/s 2 Gbit/s 2 Gbit/s 2 Gbit/s

Sustained
100 - 200 MB/s 100/200 MB/s 1,200 MB/s 200 MB/s 200 MB/s 80 MB/s* 30 - 50 MB/s

* Limited by the protocol

Simplify: Use Optimized Warehouses
• Oracle Optimized Warehouse Reference Configurations
• Documented best-practice balanced configurations for data warehousing • Available Today! • Dell / EMC, HP, IBM, Sun

• Oracle Optimized Warehouses
• Scalable systems pre-installed and preconfigured: ready to run out-of-the-box • Recent announcements

Leverage Oracle Database Flexibility
Materialized Views, Hybrid Schema…
SQL Query Region Date

Materialized Views
Sales by Region Sales by Date

Query Rewrite

Sales by Product

Sales by Channel

Product Relational Star Schema

Channel

… and New Database Capabilities
Cube Organized Materialized Views
SQL Query Region Date

Materialized Views

Query Rewrite

Product

Channel

Automatic Refresh

OLAP Option Cube

Analyze Large Data Sets in Database
Better Performance for Sophisticated Users
• Analytic Functions in the Database • OLAP Option in the Database • Data Mining Option Algorithms in the Database
• Naïve Bayes, Associations, Adaptive Bayes Network, Clustering algorithms, Support Vector Machines, Nonnegative Matrix Factorization, Decision Trees, Generalized Linear Models You can leverage existing mining tools • SPSS Clementine • InforSense

Optimizing Oracle BI Suite EE Plus
Leverage the BI Server as a Cache

Interactive Dashboards

Ad hoc Analysis

Proactive Detection and Alerts

Microsoft Office

Reporting & Publishing BI Publisher Financial Reporting Interactive Reporting SQR Production Reporting Web Analysis

Common Enterprise Information Model Oracle BI Server
Set Goals Plan Insight Performance Action Report Monitor Analyze Align

OLTP & ODS Systems

Data Warehouse Data Mart

SAP, Oracle PeopleSoft, Siebel, Custom Apps

Files Excel XML

Business Process

Financial Performance Management Applications

Hyperion Essbase

Match Solutions to Needs
Hyperion Essbase Performance Management Oracle Database OLAP Option Enhanced Data Warehousing

Hyperion Essbase

Oracle Database OLAP

Data Mgt. Strategy System Ownership Metadata Mgt. Hot-pluggable

Heterogeneous LoB User Owned Data Sources

Oracle-centric IT IT Managed BI Tools

Still Managing Oracle like it is 1999?
• Consider leaving init.ora parameters set as defaults
• Adaptive degree of parallelism, etc.

• Leverage Enterprise Manager
• Database Resource Manager • CPU allocation by groups of users, proactive query governing, automatic queuing, dynamic re-prioritization • Automatic Database Diagnostics Monitor alerting & advisors • Partitioning, SQL Access, SQL Tuning, Segment, Mean Time to Recovery, and other advisors

Storage Management through ASM
Database 1 File #1 File #2 • Files spread across disks to balance load • Integrated mirroring
• SAME vs. RAID 5

• Portable, high performance, cluster file system & volume manager

File #1 Database 2

• Accessible through Enterprise Manager

Change Management Considerations
Leverage Change Assurance Capabilities
Detect and Tune Performance Changes Capture and Replay Workload Set Up Test Environments Provision - Upgrade or Clone Configure and Maintain Production System

Package Incidents for Support Online ‘Hot’ Patching

Partitioning Strategies
ORDERS ORDERS ORDERS Europe USA Jan Large Table
Difficult to Manage

Feb

Jan

Feb

Partition
Divide and Conquer Easier to Manage Improve Performance

Composite Partition
Higher Performance More flexibility to match business needs

• New: Interval partitioning, more composites (list-list, list-range, list-hash, range-range), partitioning on virtual columns

Data must be Available
Information Lifecycle Management to contain Cost • Management parameters:
• Data classification • Age / date, product, privacy, etc. • Storage Tiers • High performance, low cost, historical • Advanced Compression in tiers • Access Management & Migration • By class

• ILM Assistant simplifies:
• Policy management, data management, compliance & security

The Data Warehouse and Availability
• Disaster Recovery
• Identical RAC Clusters in distance separated sites

• Other Strategies: Active Data Guard, Total Recall
1. Extract from source systems
Extract Servers ETL Manager STG2 (ETL/Staging) Data Warehouse 2

2. Integrate, transform, and denormalize
STG1 (ETL/Staging)

3. Query and analyze

4. Data access and publishing
Users

Data Warehouse 1 BI Tool

Share Data and Protect it?

Audit Vault

Database Vault

Label Security

Advanced Security

Secure Backup

Configuration Mgmt Pack

Monitor, Alert, Multi-factor Data Consolidate DBA Controls Classification

Encrypted Data

Encrypted Secure Data on Tape Deployments

Example: Virtual Private Database
Transparent Security inside the Database • Server dynamically rewrites SQL
• Query modification based on PL/SQL package assigned to an object • “Where” clause appended to SQL statement

SELECT * FROM orders
Western Sales Rep Where territory_id = 12

SELECT * FROM orders
Where product_id - 22 Product Manager

orders

Deploying Pervasive BI
Execute Action
Provide values for the following action parameters:

Begin Marketing Campaign
Region Campaign ID West 42684

Help

Cancel

OK

OBI Suite EE Plus & BPEL

Consider Analytic Applications
Build from Scratch with Traditional BI Tools Training / Roll-out Define Metrics & Dashboards DW Design Training / Rollout Back-end ETL and Mapping Define Metrics & Dashboards DW Design Back-end ETL and Mapping Months or Years Weeks or Months Oracle Analytic Applications

Oracle Analytic Applications solutions approach: • Faster time to value • Lower TCO • Assured business value
Easy to use, easy to adapt Role-based dashboards and thousands of pre-defined metrics Prebuilt DW design, adapts to your EDW Prebuilt Business Adapters for Oracle, PeopleSoft, Siebel, SAP, others

Source: Patricia Seybold Research, Gartner, Merrill Lynch, Oracle Analysis

Oracle BI Applications
Check alignment to Business Requirements
Auto Comms Complex Consumer Sector Mfg & Media Energy Financial Services High Tech Insurance Life & Health Sciences Public Sector Travel & Trans

Sales
Pipeline Analysis Triangulated Forecasting Sales Team Effectiveness Up-sell / Cross-sell Cycle Time Analysis Lead Conversion

Service & Contact Center
Churn Propensity Customer Satisfaction Resolution Rates Service Rep Effectiveness Service Cost Analysis Service Trends

Marketing
Campaign Scorecard Response Rates Product Propensity Loyalty and Attrition Market Basket Analysis Campaign ROI

Order Management & Fulfillment
Order Linearity
Orders vs. Available Inventory

Supply Chain
Supplier Performance Spend Analysis Procurement Cycle Times Inventory Availability Employee Expenses BOM Analysis

Financials
A/R & A/P Analysis

Human Resources
Employee Productivity

GL / Balance Compensation Sheet Analysis Analysis
Customer & Product Profitability

Cycle Time Analysis Backlog Analysis Fulfillment Status Customer Receivables

HR Compliance Reporting Workforce Profile Turnover Trends Return on Human Capital

P&L Analysis Expense Management Cash Flow Analysis

Prebuilt adapters:

Other Operational & Analytic Sources

BI Tools, Database, Complementary Vertical Models / Applications

A BI Applications Suite Spot
Hyperion Financial Performance Management
Strategic Planning
Set Strategic Objectives Treasury Strategies Corporate Development Long-Term Planning Forecasting Allocate Resources

Planning and Budgeting
Cascade Targets Detailed Budgeting

Master Data Business Rules Metrics/KPIs Targets, Plans & Actuals Financial Close Process Cost and Profitability Management
Define Costing Methods Data Cleansing Consolidate Internal Reporting & Analysis External Reporting & Compliance Report & Analyze Perform Allocations Define Cost Drivers

Good News: Improving Technology
• Faster Time to Deployment
• Analytic applications, SOA components, Optimized Warehouses

• Pervasive Business Insight
• Automated business processes, guided decisions

• Right Time Predictive Insight
• Advanced predictive analytics

• Highly Available and Secure Solutions
• On-line data accessible all of the time

• Decreasing TCO
• Lower-cost, self-managed, less tuning needed

Challenges
• Understanding business needs & priorities • Navigating sponsorship & politics • Getting info to design and size properly • Planning for future using corporate vision • Finding architects skilled in business, technology • Unforeseen developments

Presentation Review
• • • • • • • • • • Business and IT must be on the same page Deploy quickly in well-scoped increments Evaluate re-architecture as needed Choose the right tools for the right users Data volume, access determines where to analyze it Use newer database features in design, management Design to meet security, availability needs Insist on Optimized Warehouse configurations Plan ahead: data growth & changing business Evaluate applications as aligned to business

For More Information

search.oracle.com
business intelligence

or oracle.com/bi

Sign up to vote on this title
UsefulNot useful