You are on page 1of 39

DELIVERING DATA

GOVERNANCE WITH A YES


MARCH, 2019
ABOUT US
Stewart Bond, IDC @StewartLBond

• Research Director of IDC’s Data Integration and Integrity


Software service.
• Delivers industry best practice, market research and analysis
• Author of the research report ”Data Intelligence Software for Data
Governance”

Jean-Michel Franco, Talend,


@jmichel_franco

• Sr Product Marketing Director, Data governance


• 25 years of experience in Data Management and BI
• Authored 4 books, and regular publications https://www.talend.com/resources/data-intelligence-software-for-data-governance/
• Talend is a next-generation leader in cloud and big data
integration software that helps companies make data a strategic
asset. 2
Data Enablement through Data Intelligence
Infusing Trust with Knowledge
Stewart Bond, Research Director
March 2019
© IDC
Agenda

▪ Data in the era of digital transformation

▪ Symptoms illustrating a lack of data intelligence and trust

▪ Prescribing data intelligence software

▪ Applying data intelligence in the organization

© IDC 4
Data is the lifeblood of digital transformation (DX)

Key
Takeaway Digital transformation environments are different, dynamic and diverse © IDC 5
Hybrid and multi-cloud data environments are the new normal

Q. Thinking of your IT environment, select the platforms where data is, or will be persisted on when applying data
integration and integrity functions.

n = 289
Source: Data Integration and Integrity End User Survey 2017, IDC, November, 2017

Key
Takeaway Data is becoming more distributed © IDC 6
Data more distributed and the technologies we are using to manage data
are more diverse, resulting in distributed technical complexity
Select all the types of data that are or will be processed by your
Data management technologies that are being used to store
data integration / integrity solutions; Now, within 6 months, 12
data, cross referenced with the platforms where data is.
months, 18 months.

Data Technology by Deployment Variety of Data Integrated by 2020

Number of Data Types Integrated


Ten
OnPremises Only Nine
Eight
Seven
Six
Hybrid Five
Four
Three
Cloud Only Two
One
0% 20% 40% 60% 80% 100%
0% 20% 40% 60% 80% 100%
% of Respondents
% of Respondents

Flat Files Hadoop Relational NoSQL Analytical In-Memory By 2020 Today


n = 300
Source: Data Integration and Integrity End User Survey 2017, IDC, November, 2017

Key
Takeaway Diversity of data and technologies drives complexity © IDC 7
Custom code, community open source software and spreadsheets are
prevalent in data integration
What approximate percent of the DII solutions in your organization have How often do you use spreadsheets for the following activities?
been or will be deployed within the next 6 months using each of the
following methods: How often do you use each of the following spreadsheet functions?

% Distribution of Alternatives, 2017 Frequency of Spreadsheet Activity


Other Enterprise Data Sorting 4,25
Software; 15,3%
Custom Code; Data Shaping 4,01
22,6%
Data Prep for Presentations 3,90

Data Augmentation 3,74

Data Prep for BI Software 3,73

Data Visualization 3,71


Community 51% use copy
Open Source; What-If Analysis 3,66 / paste to
import data
12,8%
Data Cleansing 3,63 into
spreadsheets!!
Pivot Tables 3,57
Commercial;
49,3% Number of Times per Week
n = 300 n = 207
Source: Data Integration and Integrity End User Survey 2017, IDC, November, 2017 Source: Data Integration and Integrity End User Survey 2017, IDC, November, 2017

Key
Takeaway Spreadsheets are the shadow IT of distributed data integration, degrading trust ToC © IDC 8
Data in the era of DX

1 Data is the lifeblood of DX and data integrity is critical for the success of DX initiatives

2 Data in the era of DX is dynamic, diverse, and distributed

3 Data governance and integrity is being challenged by uncontrolled persistence, access


and consumption

© IDC 9
The 80/20 rule is still in effect: 80% of time is being spent on searching,
preparing and protecting data with only 20% being spent on analysis

How many hours per week on average do you spend on each of the following data related activities?

% of Time Spent on Data Activities (weekly)

Searching
20%

Preparing
Analyzing Managing Data
37%
19% 81%

Protecting
24%

n = 300
Source: Data Integration and Integrity End User Survey 2017, IDC, November, 2017

Key Despite advancements in technology, the complexity and diversity of data


10
Takeaway continues to drive inefficiencies © IDC
On average, 25% of the time people are searching, preparing and protecting
data is wasted

How often are you successful in data asset searching, preparation, and protection?

Up to another 10
hours is wasted
on re-creating
n = 225
existing assets
Source: Data Integration and Integrity End User Survey 2017, IDC, November, 2017

Key
Takeaway The state of data complexity is resulting in wasted time and money  ToC © IDC 11
Organizations feel the pain of these inefficiencies; knowing there is value in
trusting and understanding data

Please indicate the importance of the following as it relates to your work with data and information assets

Importance of Data and Information Asset Attributes


Assets are Consistent and Complete
Knowing if Assets can be Trusted
Attributes that Infuse Trust
Knowing who is Using the Asset
Access to Timely Assets
Knowing Asset Business Terms
Asset Ownership and Responsibility
Understanding Relationships and Lineage Data Intelligence Attributes

Understanding Asset Context


Ability to Find Assets
Separate Governed from Ad-Hoc Assets

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
% of Respondents

n = 225 Very Important Important


Source: Data Integration and Integrity End User Survey 2017, IDC, November, 2017

Key
Takeaway How an organization is enabled by data is a differentiator © IDC 12
Symptoms illustrating a lack of data intelligence

1 The current sate of data intelligence is costing organizations time and money

2 Trust in data can be improved through data enablement supported by intelligence

3 There is potential to invert the 80/20 rule with data intelligence

© IDC 13
It’s elementary: Data intelligence software answers the 5 W’s of data
The why, what, who and how of data The who, what, where, where, why and
intelligence software in data governance how of data and data relationships:

Organizations lack data knowledge for efficient and effective data


WHY
governance activities; 30% of the time spent on governance is wasted.

Data intelligence software is used for data discovery, cataloging, profiling,


WHAT
mastering, and lineage; uncovering the data supply chain.

Chief Data Officer; Chief Information Officer; IT Director; Line of Business IT


WHO
Managers; Data Security Officer; Data Stewards and Owners.

Data intelligence informs data professionals with the knowledge required to


HOW
govern data assets, and enables the organization with data.

Key
Takeaway Data intelligence enables organizations with data, infusing trust with knowledge © IDC 14
IDC’s view of the Data Intelligence Software Market

Data Quality Management Use Case

Self-Service Data Use Case

Data Self-Service
Data Master Data Data
Data Lineage Stewardship Data
Cataloging Intelligence Preparation
Quality
and Profiling
Other DII Segments
Data Governance Use Case

Key Data governance, self-service and quality management enabled by data


15
Takeaway intelligence © IDC
Growth is being driven by regulations, recognition of data asset value, and because
of the current state of data intelligence

How do you currently perform data discovery and cataloging in your organization?

n = 225
Source: Data Integration and Integrity End User Survey 2017, IDC, November, 2017

Key
Takeaway The current state of data intelligence is mostly manual and likely irrelevant © IDC 16
The current state of data intelligence is also reflected in the maturity of
data governance in the US

▪ Too much focus on technology

▪ Governance = No

▪ Resource constraints

▪ Lack of data intelligence

▪ Lack of data literacy

Source: IDC, IDC MaturityScape Benchmark: Data Governance in the United States, 2017, #US41714617

Key
Takeaway It’s time to turn the No of Data Governance into the Yes of Data Enablement © IDC 17
Identify the key stakeholders of data intelligence – and the roles required
to implement

Data Enablement with Data Intelligence RACI Chart ▪ CDO – Chief Data Officer

CDO CIO CSO CPO ITD LOBIT DS DO DU ▪ CIO – Chief Information


Officer
Data Dictionary A C I I C C R C C ▪ CSO – Chief Security
Officer
Business Glossary A C I I C C R R C
▪ CPO – Chief Protection
Catalog A C I I C C R R C Officer
Lineage A C I I C R C C C ▪ ITD – IT Directors
▪ LOBIT – Line of Business IT
Stewardship and Profiling A C I I C C R C C
▪ DS – Data Stewards
Master Data Intelligence A C I I C R C R C
▪ DO – Data Owners
Notes: ▪ DU – Data Users

R - Responsible, A - Accountable, C - Consulted, I - Informed

Key Data intelligence isn’t an only IT project but requires collaboration with business
18
Takeaway and requires roles that are accountable for data and data outcomes © IDC
ROI can build a business case for data intelligence

▪ Regulatory fines

▪ Reduced Risk

▪ Employee productivity

▪ Better business outcomes

Key
Takeaway Compliance reduces risk, improves productivity and outcomes © IDC 19
Prescribing and applying data intelligence

1 Use data intelligence software to answer the 5 W’s + Relationship of data

2 Assess where your organization is on the data governance maturity curve

3 Build a business case for further investment and plan your enablement program

© IDC 20
DELIVERING TRUSTED DATA
AT THE SPEED OF
BUSINESS WITH TALEND
JEAN-MICHEL FRANCO
BUSINESS NEEDS DATA AT SPEED

“Speed is the critical Time to value


competitive advantage”
-Reid Hoffman
Time to adapt to change

22
BUSINESS NEED DATA YOU CAN TRUST

DATA
Quality proofed PEOPLE ORGANIZATION

47% 81% +70%


of data has time spent looking % cost of bad data
Governed integrity issues for trusted data increase /year

23
ORGANIZATIONS ARE FORCED TO MAKE A
CHOICE BETWEEN SPEED AND TRUST

Hand coding
Legacy enterprise solutions
Point solution SPEED TRUST

Unscalable, ungoverned Expensive, slow, restrictive

24
TALEND DELIVERS BOTH
SPEED AND TRUST

SPEED TRUST

WITHOUT COMPROMISE
25
TO THE
FROM REALITY
PROMISED LAND

26
A THREE-STEP PLAN
TO DELIVER DATA YOU CAN TRUST

27
#1 DISCOVER
& CLEAN

28
DISCOVER & CLEAN

EXPLORE HIGHLIGHT DELEGATE CLEANSING


ANY DATA DATA QUALITY ISSUES IN THE CLOUD
29
#1 DISCOVER #2 ORGANIZE
& CLEAN & EMPOWER

30
ORGANIZE & EMPOWER

CREATE A SINGLE ENCOURAGE PEOPLE ORCHESTRATE


SOURCE OF TRUST WITH DATA CURATION STEWARDSHIP
31
#1 DISCOVER #2 ORGANIZE #3 AUTOMATE
& CLEAN & EMPOWER & ENABLE

32
AUTOMATE & ENABLE

<YES/NO>

APP
AP
P
APP
AP
P

LEARN WITH ML ENABLE EVERYONE PUBLISH TRUSTED DATA


FOR REMEDIATION WITH CLOUD APPS WITH API SERVICES
33
DELIVER DATA YOU CAN TRUST

#1 DISCOVER #2 ORGANIZE #3 AUTOMATE


& CLEAN & EMPOWER & ENABLE

34
FROM DATA INTEGRATION TO INTELLIGENCE
DATA • DATA CATALOGING
INTELLIGENCE •

DATA LINEAGE
METADATA MGMT + Talend
Data Catalog
Trust

• DATA PREPARATION e.g. Cloud Data


DATA
INTEGRITY


DATA STEWARDSHIP
DATA QUALITY
+ Management
Platform

• APPLICATION INTEGRATION
DATA • DATA INTEGRATION
e.g. Stitch, Cloud
INTEGRATION • DATA LOADING Data Integration

35
4 USE CASES FOR DATA GOVERNANCE
Governed Analytics Data Compliance & Privacy
Reach a wider audience GDPR, PDPA, CCPA,.
Enforce control BCBS 239, IFRS, IDMP…
Crowdsource knowledge ACORD, CDISC…

IT Modernization & Change Mgmt The Data Marketplace


Change analysis & migrations Self-Service Analytics
Establish a single point of Mgmt Customer 360
Data Auditing Data Monetization

36
PROVIDING SUSTAINABLE ENERGY
WITH A DIGITAL PLATFORM

Up to 30%
Accelerated time Cost of integrating
additional revenue
for analytics by up data reduced by
in Trading &
to 70% 80% Generation

70% Faster,
80% Cheaper
100% compliant 37
YOUR SPEEDWAY
TO TRUSTED DATA
OUR DEFINITIVE
GUIDE
https://info.talend.com/definitiveguidedatagovernance.html

38
Questions?

You might also like