You are on page 1of 49

Cloud Data Warehouse and

Data Lake Modernization

MODULE -3 `
Today’s Agenda

1 Welcome

2 Cloud Data Quality

3 Data Governance for Cloud Data Warehouse & Data Lakes


 

4 Close

2 © Informatica. Proprietary and Confidential.


Cloud Data Quality

Chris Philips
Vice President `
Product Management,
Data Governance, Quality and Privacy
Cloud is the Key Enabler of the New Digital Era

81% 83% 75%


of enterprise workloads will of organizations expect of all databases will be
be in the cloud by 2020 workloads to move freely deployed or migrated to a cloud
between clouds platform by 2022

Source: Forbes, Logic Monitor Cloud Adoption Source: Turbonomic’s 2019 State of Multicloud  Source: Gartner, The Future of the DBMS Market Is Cloud by 
Adam Ronthal, Donald Feinberg, Merv Adrian  |  June 23, 2019    

4 © Informatica. Proprietary and Confidential.


Data 3.0 Makes Data Quality And Governance Complex

Poor Quality Makes Exponential Data Growth Data Access & Technical Complexity
AI Value Realization a Makes Scaling Data Understanding Makes Inhibits Organizational
Challenge Practices a Challenge Democratization a Use of Data
Challenge

80% of time 61% CAGR 44% of data 60% of organizations


Spent on search and data In worldwide data Is tagged and cataloged challenged by data quality
preparation reaching 175ZB by 2025 for consumption
Source: IDC, Worldwide Global Source: IDC, Data Integration and
Source: IDC, Source: IDC, Data Age 2025
DataSphere Forecast 2019-2023 Integrity End User Survey, June 2019
Advanced Predictive Analytics Survey

5 © Informatica. Proprietary and Confidential.


Organizations Need to Empower Use of Trusted Data

Any Pattern Any User Any Data

Companies that empower employees to consistently use data as a basis


for their decision making, are nearly twice as likely as others to report
reaching their data and analytics objectives.
McKinsey: How Leaders in Data and Analytics Have Pulled Ahead

6 © Informatica. Proprietary and Confidential.


To Drive Organizational Initiatives
Insurance Retail
Digital Transformation Channel Analysis
Marketing Analytics Marketing Analysis
Demographics Analytics Product Development
Risk Evaluation and Optimization Supply Chain Management

Financial Services Medical


Gene Analytics
Compliance (GDPR, BCBS 239, etc.)
Medical Imaging Insights
Marketing Analytics
Device and Drug Comparative Effectiveness
Risk Assessment and Fraud Detection
Diagnostic Error Prevention
Data Security and Deidentification
Portfolio Analytics
Personalized Financial Planning Healthcare
Customer Operations Automation
Provider Data Management
Patient Data Analytics
Real time case prioritization
Automotive and Transportation
Engine Monitoring and Autonomous Maintenance Data Pregnancy Management
Personalized Medications and Care
Ride Sharing Analytics and Optimization
Population Health Management
Route Optimization
IDMP Compliance

Energy and Public Sector


Citizen Services
Manufacturing Identity Resolution for Fraud and Abuse
Smart Metering
Crime and Disorder Analysis
Supply Chain and Production Optimization
City Planning
Predictive Infrastructure Maintenance
7 © Informatica. Proprietary and Confidential.
Initiatives can be Delayed or Fail Because…

Inability to share data Application and data


from multiple sources silos causes slow
and partners project delivery

Lack of confidence in your


data delays decisions and
Duplication of effort across lost opportunities
projects

Shape and format of data


Fragmented and manual data Data varies wildly
governance processes

Data quality erodes if you do Cannot identify where


not actively manage sensitive data is located

8 © Informatica. Proprietary and Confidential.


Enabling Any User with Self-Service Data Quality

Powerful User Interactions Business Focused

Architect IT SaaS /EDW Citizen Data Citizen Data Business


Specialist Integrators Analyst Leader
Owner Scientist

Scalable Enterprise-ready Simple


The First and only micro-services
based multi-tenant cloud data
quality solution integrated with the
industry’s leading iPaaS
Embed Quality & Governance in CDW/DL Architecture

Streaming 2
Stream Real-time Business
6
Processing
Stream Storage Analytics User

IoT Machine Apps


Data
5
Cloud Data Lake Enterprise

Data Provisioning
Data Integration
Data
Log files Social Mobile Analytics Analyst
3 Data Integration & Quality Cloud Data
Data Ingestion

On-Premises Warehouse Line of Business /


Self-Service Line of
Business
Mainframe Application Databases Data Analytics
Servers Landing Enterprise
Enrichmen 4
Zone Zone
t

Data Provisioning
Documents Data
Warehouse Data
Engineer

SaaS Cloud Storage Data Science/AI

Spark Processing Data


Scientist
ERP DRM

Data Quality – Cleanse, Parse, De-dupe, Standardize

Data Catalog &


11
1
Data Governance
© Informatica.
Discovery
Proprietary and Confidential.
Lineage Glossary
Use a Consistent Quality Process and Methodology

Discover

Measure Data Quality


& & Governance Define
Monitor Process

Apply

12 © Informatica. Proprietary and Confidential.


Automate and Scale Data Discovery
and Identification
Identify data domain types
with out of the box and
custom created rules

Profile data, and assess


quality status

Centralized reporting and


analysis with trend visibility
and drill down into details

13 © Informatica. Proprietary and Confidential.


Empower Self-service and Business
Ownership
Define metrics to measure
the quality of data within
your application’s key data
elements

Apply DQ rules to business


processes and applications

Promote business and IT


collaboration

14 © Informatica. Proprietary and Confidential.


Build Once and Re-use Everywhere
Across Cloud and On-premises Ensure consistency of data
quality with centralized rules
management and execution

Embed enforcement of quality


rules into data pipelines and
business processes

Automate application of data


quality rules across sources in
on-premises and multi-cloud
hybrid environments

15 © Informatica. Proprietary and Confidential.


Provide Continuous Insight
Align data quality activities with
data governance and privacy
activities

Track and remediate data quality


changes over time

Enable focus on strategic projects

16 © Informatica. Proprietary and Confidential.


Discover

Discover Measure Define

Apply

Enterprise Discovery and Profiling

Profile data to examine its


structure and context using out-
of-the-box templates

Drill down to see details


and filter on results

Compare profile runs to identify


trends over time

17 © Informatica. Proprietary and Confidential.


Discover

Define Measure Define

Apply

Business Rule Definition

Empower the business to lead


data quality initiatives

Reduce project cycles

Enable IT to focus on strategic


projects

18 © Informatica. Proprietary and Confidential.


Discover

Apply Measure Define

Apply

Centralized Re-usable Rules


Consistently apply data quality
rules across the enterprise in
support of
data governance

Reduce cost through re-use of


centrally managed data quality
rules

Streamline the resolution


of data quality issues

19 © Informatica. Proprietary and Confidential.


Discover

Manage and Monitor Measure Define

Apply

Provide Continuous Insight

Align data quality and data


governance efforts

Track data quality


improvements over time

Enable IT to focus on strategic


projects

20 © Informatica. Proprietary and Confidential.


Powered by

Democratize Data Quality for


Everyone, Everywhere
• Expanded User Community
• Multi-Cloud, Hybrid, On-Premises
• Apply Data Quality to Any Data

"Having an automated, integrated solution from Informatica is making a difference in our data governance
program – because you cannot manage what you cannot see.”
­­– Paul Keller, Sr. Director Enterprise Data Governance, L.A. Care Health Plan

© Informatica. Proprietary and Confidential.


Powered by

Increase Cloud Data Warehouse


Productivity and Value
• Simplify Data Cleansing
• Build a High-quality Data Pipeline
• Deliver Trusted Insights from Your Cloud
Data Warehouse

“As data comes in from connected cars, Informatica allows us to profile, normalize, and catalog it so that a
business analyst can find the information they need when they need it and take action.”
Christopher Cerruto, VP of Global Enterprise Architecture and Analytics, Avis Budget Group

© Informatica. Proprietary and Confidential.


Powered by

Accelerate your SaaS Adoption

• Powerful Discovery with Automated Rule


Association
• Re-use Data Cleansing Rules
• Surface Data Quality Trends

“While some data quality issues are unfathomably complex, many yield quickly and produce outsize gains.
Eliminating a single root cause can prevent thousands of future errors, save millions, and make things
better for all involved.”
Redman, Salmon and Nagle, - Harvard Business Review
© Informatica. Proprietary and Confidential.
Data Quality for
Everyone, Everywhere
Consistent Quality Process and
Methodology

Multi-cloud, Hybrid, On-Premises

Self-Service Model
Cloud Data Quality
Market Leader

24 © Informatica. Proprietary and Confidential.


Demo
Benefits of Cloud Data Quality

Faster Deployments Improved Trust Greater Adoption Reduced Cost of


in Results Data Management

“It costs ten times as much to complete a unit of work when the data are flawed in any way
as it does when they are perfect.” Thomas Redman. Harvard Business Review

26 © Informatica. Proprietary and Confidential.


Key Takeaways

Organizations are Only Informatica provides


Proactive data quality
embracing cloud market leading first ever
process increases business
technologies and investing microservices based Cloud
analyst, data stewards, data
heavily in modernizing Data Quality and its self-
scientist productivity and
applications, but value service capabilities is the
helps in the decision
creation is still limited due fuel for transformative new
making
to bad data quality business initiatives

27 © Informatica. Proprietary and Confidential.


Start Your Free Cloud Trial Today

28 © Informatica. Proprietary and Confidential.


Data Governance for
Cloud Data Warehouse &
Data Lakes
Susan Wilson
Vice President `
Data Governance and Privacy
Managing Data Is Getting Harder

Explosion in New Users New Data Data in Machine


Data Volume Types the Cloud Learning/AI
(mobile, social, IoT)

20.6 zettabytes 500 million 20 billion Over 94% of data 1 billion workers
per year business data connected center traffic will be assisted by
in global data users and growing devices will come from machine learning
center traffic the Cloud or AI
The Role of the CDO is Shifting

DATA
Risk and Compliance Management
GDPR, CCPA, FERC, MIFID, BCBS 239, MDR, IFRS17

31 © Informatica. Proprietary and Confidential.


The Role of the CDO is Shifting
Driving business value

Business Corporate
Revenue Growth Cost Reduction Margin Expansion Talent Management
Goals Governance

Business Product/Service Operational Data Driven Market Regulatory


Customer Focus
Objectives Innovation Efficiency Culture Reputation Compliance

DATA

80% of organizations with a high ability to innovate have strong data sharing
practices both internally and beyond company borders.
MIT Sloan: Data & Analytics Global Executive Study

32 © Informatica. Proprietary and Confidential.


Data Democratization Drives Business Value
Broad, consistent, and easy consumption of data for all employees
EXECUTIVE TEAMS
Management Reporting

DATA SCIENCE TEAMS FINANCE TEAMS


DataOps & Model Training Financial Planning & Analysis

SELF-SERVICE
ACCESS

MANUFACTURING TEAMS MARKETING TEAMS


Overall Equipment Effectiveness Personalized Next Best Offer

PROCUREMENT TEAMS
Spend Under Management
33 © Informatica. Proprietary and Confidential.
Drivers of Data Governance

1. Insights & Analytics 3. Business Enablement


• Self Service • Change Management
• Data Quality & Privacy • Customer Experience
• Predictive Analytics • Customer Satisfaction
• Data Driven Decisions • Market Penetration
• Data Lake Reporting • Business Decision
• Data Definition & Understanding • Data Ownership
• Data Lineage • Elimination of Silos
• Enterprise Collaboration

2. Regulations 4. Digital Transformation


• GDPR • Cloud Migration
• CCPA • Data Warehousing
• FERC • Data Lake
• MIFID Management
• BCBS 239 • Data Explosion
• MDR • Data as an Asset

34 © Informatica. Proprietary and Confidential.


Organization Landscape

Data Management Business


Managing the growing complexity of data in Looking to define, understand and trust data to
multiple systems & applications. Creating a data support regulatory compliance, analytics, risk
digitalized platform to support business consume- management, customer experience, etc.
ability

IT Personas Business Personas


Data Architects Data Governance Data Analyst
Data Engineers Data Scientist
System Owners Privacy Officer
Database Admins Business Stewards
Technical Stewards Governance Managers

35 © Informatica. Proprietary and Confidential.


Data Needs to Be Clean, Prepared and Trusted

80% of time 25% of useful data 60% of organizations 13% of tagged data
Is spent on search Created in 2018 was Are challenged Was used
and data preparation *1 tagged *2 by data quality *1 for analysis *2

Sources
*1 - IDC, End-User Survey Results: Deployment and Data Intelligence in 2019, #US45652419, Nov 2019
*2 - IDC, Worldwide Global DataSphere Forecast, 2019-2023, #US44615319, Jan 2019

36 © Informatica. Proprietary and Confidential.


And it Needs to be Easy to Access and Consume

Population

New
Self-Service
Model

Technical
Skill
ETL Data Citizen Citizen Citizen
Data Scientist Developer Engineer Integrator Analysts Consumer
Technical Business

37 © Informatica. Proprietary and Confidential.


Intelligent Data Governance
Value Quality

Collaboration Agility AI/Automation Trust

Democratize Business
Lineage Protection
Relevance

Enterprise Reduce Adoption


Privacy
Risk

End to End Connected


What Makes Our Approach Different?

Automated Scalable
Leveraging the Power of Enterprise Grade Solution
AI / ML to Ensure Designed to Grow with
Technology Does the You and Your Program
Heavy Lifting

Extensible Agile
Fully Integrated platform Carefully Curated Model to
solution to support Ensure Quick And
Enterprise Data Sustained Success
Governance
Informatica Enterprise Data Governance Solution
The Core, Modular, Unified Platform for All Data Governance Use Cases

Quality Catalog
Informatica Data Quality Enterprise Data Catalog
Measure data quality
Discover what’s being
metrics and scorecards
defined. E.g. Schemas,
Tables, Columns, etc.

Master Privacy
Master Data Management Data Privacy Management
Repository of trusted master
Enforce policies, report on risk,
and reference data
Data Governance search subject registry, breach
Axon Data Governance analysis, etc.
Business content of data, define
processes, policies,
ownership/stewardship and
enable your non-technical
consumer the ability to
understand and access data.
40 © Informatica. Proprietary and Confidential.
Build a Data-Driven Culture
How do I
ensure our Where can I find,
policies are discover, understand
How do I adhered to? data required for my
manage Quality, analysis?
changes in the Privacy
CDW/L? Steward
Citizen
Operations Consumer
Leader
Data
Data Scientist
Engineer
Control Is my domain
Operate Consume performing? What
actions can we take
How can I find to improve quality
data to onboard
into the CDW/L?
Data Data
Steward
Architects Build Democratization Measure
Automation Scale Intelligence Powered by CLAIRE

Cloud Data Warehouse and Data Lake

Are we achieving our adoption


targets? Are we achieving our
Data Data
Governance Management efficiency targets?
41 © Informatica. Proprietary and Confidential. Lead Leader
Intelligent Data Governance
Key Enablers for Data Governance 3.0

AI/ML
Contextual Operationalized Business
Domain & Lineage Graph Search Data Privacy Adoption
Discovery

Intelligent Automated
Data Quality Comprehensive
Glossary Workflow
Automation Impact Assessment
Associations Collaboration

42 © Informatica. Proprietary and Confidential.


Informatica delivers
industry’s only integrated,
intelligent, enterprise-
scale, governed Data
Marketplace

43 © Informatica. Proprietary and Confidential.


Demo
Benefits of Data Governance for Cloud DW/L
Enterprise Data
Data Data Governance Data Privacy
Quality Catalog & Privacy
• Greater Collaboration Across Enterprise

• Find Relevant Data Assets

• Reduce Cost of Operations Control


• Greater Trust and Compliance Operate Consume

Companies that empower employees


to consistently use data as a basis for
Build Data Measure
their decision making, are nearly twice
as likely as others to report reaching Democratization
their data and analytics objectives.
McKinsey: How Leaders in Data and Automation Scale Intelligence Powered by CLAIRE
Analytics Have Pulled Ahead

Cloud Data Warehouse and Data Lake

45 © Informatica. Proprietary and Confidential.


Call to Action: Cloud Data Warehouse Modernization
https://www.informatica.com/solutions/move-to-the-cloud

CTA for DG
Learn & Prepare Deep Dive Get Started
Whitepapers & Workbooks Reference Architecture Guides Free Trials

Free 30 day trial

© Informatica. Proprietary and Confidential.


Partner Resources - PARC
For assistance,
contact the PARC
Helpdesk
partners@informati
ca.com

IU Product Training/Certifi Implementation Workshops


cations Partner Learning Series
Sales Training
Log into PARC
first, and then click
these resources.

Competitive Technical Webinars Informatica Network Success Portal

48 © Informatica. Proprietary and Confidential.


Don’t Forget the Quiz!
Thank You

You might also like