Professional Documents
Culture Documents
Informatica Agile Virtualization Apr17 2012
Informatica Agile Virtualization Apr17 2012
Kerry Holton
Informatica Senior Sales Engineer
1
Let’s Win Something!!!
A copy of “Lean Integration.”
Tell me which box is the ONLY thing
that data virtualization built on data
federation does – and why???
Sign-Up
JOIN & DISCUSS
2000+ Strong
Business
/ IT “Demands by users of business intelligence
BI (BI) applications to "just get it done" are turning
typical BI relationships, such as business/IT
alignment and the roles that traditional and next-
generation BI technologies play, upside down. As
business users demand more control over BI
applications, IT is losing its once-exclusive
control over BI platforms, tools, and applications.”
– Boris Evelson, Forrester Research, Blog -
“Top 10 BI Predictions for 2012”
• Business-focused BI
• $100M Qtr. in 2011
• 10k+ customers
Informatica Corporation Confidential – Do Not Distribute
6
How Long Does it Take to Deliver New
Critical Data or Reports to the Business?
ETL SOA
Hand Coding ESB/EAI EII
Business Portal BI
IT
(WebSphere) (Cognos)
NO
REUSE
30,000 Data Marts Data Warehouse Facets [Benefits, Products] Product Config Mgmt
(MS Access) (DB2) (Sybase ASE) (MS SQL Server)
The Fundamental Problem(s)…
Typical
Data Integration Process
• It takes too long to explain
1. Design requirements
2. Change
3. Integrate
• It takes months to change a
DW / add new critical data
4. Unit Test
5. Validate • It takes many iterations to
6. Deploy get the right data / reports
Business is
Involved Too Late • Changes can break existing
integrations & impact apps.
As-Is Value Stream Map (LOT OF WAIT & WASTE)
No Reuse
DATA
MART
Data
Abstraction FAST, DIRECT ACCESS TO
CUSTOMER ORDER PRODUCT …
DATA THE BUSINESS TRUSTS
DW
• Addresses specific use cases
• No data movement / no copies / only federation
• Code heavy / not model-based / no reuse
It’s like ONE step forward
• Not tools for business self-service &
• SQL/XQuery-only transformations TWO steps backward
• No data profiling / no data quality
Informatica Corporation Confidential – Do Not Distribute
14
What Are the Top 3 Key Capabilities for a
Project that Needs Data Virtualization?
If Performance is a given…
Dataset - 600
Optimizations Common
& Caching Metadata
CRM Virtual Table Accounts Virtual Table
Merge
Virtual View DW
Prototype Move to DW
First or Instantly Reuse
Access as SQL / WS
DW
• Single environment for both data integration and data federation
• No data movement / no copies – but easily reuse virtual views for batch
• Early & iterative business (analyst) involvement – self-service
• Pre-built library of rich ETL-like advanced data transformations
• Integrated real-time, on-the-fly data profiling & data quality
NEW QUERY
INSTANT REUSE
SELECT *
SELECT *
FROM EXISTING
FROM customer_table
customer_table QUERY
INNER JOIN
support_table ON
SELECT *
customer_table.customer_num =
FROM SUPPORT
support_table.customer_id
WHERE customer_name=‘ACME’
DM
DM
DW
DM
DW
CUSTOMER CustSUPPORT
DW PRODUCT INVOICE
DM ODS
WEB
Data
On-boarding
Complement
Trusted
Virtual
New
Results
Retrieve
Query
quality
query
view
blend
retrieved
historical
is
rules
for
new
can
processed
data
ofreport
applied
be
historical
data
in
architecture
customer
physically
real-time
needing
does
by
on-the-
and
not
operational
materialized
without
virtualization
break
with
fly
dataagainst
virtualization
data
datatxt
integrations
not
data
later
movement
in data
DW
delivered
layer
into DW
Business Portal BI
IT
(WebSphere) (Cognos)
NO
Instant Reuse
DW, BI, SOA & MDM
REUSE
(SQL, Web Services, Batch)
“Virtual Table”
Common Data Model
MEMBER CLAIM PRODUCT ORDER
30,000 Data Marts Data Warehouse Facets [Benefits, Products] Product Config Mgmt
(MS Access) (DB2) (Sybase ASE) (MS SQL Server)
What Does Informatica’s Data
Virtualization Solution Look Like?
NEW New PowerCenter Edition for
PowerCenter
Data Virtualization Edition
AGILITY & PRODUCTIVITY
Partitioning
Combines:
Data integration (PowerCenter SE)
Data Profiling
Data Virtualization (IDS Full Use)
Data Federation
(Data Services) Data Profiling (IDE Full Use)
Developer Tool Business-IT Collaboration (Analyst)
Analyst Tool
Packaged for simplicity and
2 Adapters
(PWX for Relational)
attractively priced
Reuses existing skills and
ETL
(PC Standard Edition) resources
Weeks/Days
Months
2
Virtual View
MDM
HUB
MDM TRANSACTIONAL
SYSTEMS
Deliver a complete view of master & DATA
transactional data in real-time INCOMPLETE VIEW
COMPLETE VIEW
WAREHOUSE
OF CUSTOMER
Applications
3
Registry BPM
SOA
ESB
Biz. Services
Deliver the missing data services
layer to SOA & applications Data Abstraction
Data Sources
Informatica Corporation Confidential – Do Not Distribute
22
What are the Benefits of Informatica’s
Solution?
• Provide fast, direct access to critical
new data & reports in days vs. months
24
BI, MDM, SOA – HealthNow NY Improves
Risk & Pricing Analysis With Data Services
BI (Cognos) Portal
(WebSphere)
SQL, Web Service
Virtual Table
IDS
Data Marts Data Warehouse Facets [Benefits, Products] Product Config Mgmt
(MS Access) (DB2) (Sybase ASE) (MS SQL Server)
25
BI, SOA - Large Latin American Bank
Improves Governance
Microsoft Reporting Services Customized Applications
SQL, Web Service
Virtual Table
Data Virtualization
Transactions Tables Data Warehouse Credit Analysis, Applications, AML Financial Institutions
(Mainframe – Adabas, DB2) (DB2 LUW) (SQL Server) (Flat Files and Messages)
26
BI, MDM – VW Leverages Delivers a
Complete View of Critical Data On-Demand
BI Portal
MDM Hub (Customer, Purchase, Case) DW (Service History) PRD [Campaign History] Transactional Systems (Warranty, Service)
(IBM) (Teradata) (SAGA/Win) (Varied)
27
Data Virtualization in Action
VIRTUAL TABLE
• Define entities & directly access &
merge data to create virtual views
SQL or Portal
Common
• Rapidly profile data sources & Metadata Web Service
logic without more processing
• Quickly find data & rules via
business glossary
• Collaborate, test, validate &
share results
Batch
Developer Tool ETL
• Cuts the wait & the waste in the (Eclipse) Data Warehouse
process
29
The 7 Steps to AGILITY & PRODUCTIVITY
1 2
Customer
Name Virtual Table
Address
Category
Orders 3
7 CRM Accounts
Virtual Table
Optimizations Common
& Caching Metadata
CRM Virtual Table Accounts Virtual Table
30
1. Model
31
31
2. Access and Merge
Turn many data sources into
ONE with Data Virtualization
CUSTOMER SUPPORT PRODUCT INVOICE
32
3. Profile in RT
Rich set of integrated profiling
capability to find data
anomalies and to discover keys
and hidden relationships:
• Midstream or Comparative
Profiling
• Dependency Profiling
33
4. Transform in RT
• Metadata-driven, codeless,
graphical environment
34
5. Reuse Instantly
Batch
METADATA
REPOSITORY
35
6. Move or Federate
Data Federation Data Integration
BI BI
Deliver
Merge
Virtual View DW
Single-click deployment to DW
PowerCenter (batch)
Access
DW Advanced Transform
Extract & Load
Quality
36
7. Scale & Perform
• Leverage the proven, high-
performance Informatica engine
• Optimized SQL Query engine &
graphical Query Plan
• High-performance Web services
server
• Rich set of optimizations &
caching mechanisms
• Rule Based, Cost Based, Push Down,
Early Projection, Early Selection, Semi-
Join, Virtual Table & Result Set Caching
37
Data Virtualization Built On Data
Federation Does 1 Box – Which 1?
1 2
Customer
Name Virtual Table
Address
Category
Orders 3
7 CRM Accounts
Virtual Table
Optimizations Common
& Caching Metadata
CRM Virtual Table Accounts Virtual Table
38
Do it Right – Avoid Costly Mistakes!
Enabling Rapid Analyzing & Integrating Scaling with Leveraging
Development Profiling with Quality Flexibility Investments
Sustain & Get it Right Bake-in Prototype First Re-purpose
Maintain 1st Time Quality & Then Scale Logic & Skills
TIME COST TIME COST RISK TIME COST RISK TIME COST TIME COST
Virtual Table
EII
Optimizations
Model & metadata- Profile data AND Leverage pre-built Virtualize or physically Naturally extend
driven environment logic anywhere logic including quality materialize in 1 tool your infrastructure
SQL EII
XQuery X
Simple Cleansing
Web Service
TIME COST TIME COST RISK TIME COST RISK TIME COST RISK TIME COST
39
Data Virtualization in Action
40
Scenario – Big Company
ISSUES
IMPACT
41
Demo – Big Company
Business needs a new report – NOW vs. months!
Quickly merge data from multiple systems & cleanse
Analysts know the data – want some self-service
Join CUSTOMER (Oracle CRM) & ORDER (file)
Get ORDER TOTAL for ACTIVE customers
42
Why Informatica?
Power of
THE BEST OF
The Platform
THE BEST OF
“DATA INTEGRATION” “DATA VIRTUALIZATION”
(SOPHISTICATION) (AGILITY)
REUSES SKILLS
enhancements, cloud integration, common metadata,
successful data integration strategy.”
and role-specific tools.”
Ted Friedman, VP Distinguished Analyst, Gartner
The Forrester Wave: Data Virtualization, Q1 2012
Transform
Virtual View DW
Prototype Move to DW
First or Instantly Reuse
Access as SQL/WS
DW
Sign-Up
JOIN & DISCUSS
2000+ Strong