You are on page 1of 29

Data Governance

Micr
oso

ORIGINAL
Microsoft 365

DATA GOVERNANCE

Electronic Version is Current—Uncontrolled Hard Copy Valid Only at the Time of Printing

Document Number …..


Document Title
Version
Issuing Department
Legacy

Security Class
Retention
Next Review Date
Storage Type
OMF

Country Legal Discipline Document Type


ID GEN (General) Information
Technology (IT)

Automatic Electronic Approvals


Action Name Date
Created/Updated Hairistryan 16 February 2022
Approved
Published
Comment

Approved

Data Governance confidential


Page 2 of 29
CHANGE SHEET

All major changes to the document at this current revision are recorded below.
(Does not include correction of typos or formatting changes)

Revision: Date: Originator: Description of Change:

0.1 Hari ini Infimedia 1.0

0.2

0.3

0.3

0.4

Data Governance confidential


Page 3 of 29
TABLE OF CONTENTS

WHAT IS DATA GOVERNANCE AND WHY DOES IT MATTER?.....................................5


WHY DATA GOVERNANCE MATTERS..........................................................................................................................
DATA GOVERNANCE GOALS AND BENEFITS...............................................................................................................
WHO'S RESPONSIBLE FOR DATA GOVERNANCE?......................................................................................................
COMPONENTS OF A DATA GOVERNANCE FRAMEWORK..........................................................................................
DATA GOVERNANCE IMPLEMENTATION.....................................................................................................................
BEST PRACTICES FOR MANAGING DATA GOVERNANCE INITIATIVES.....................................................................
DATA GOVERNANCE CHALLENGES...........................................................................................................................
KEY DATA GOVERNANCE PILLARS.............................................................................................................................
DATA GOVERNANCE VENDORS AND TOOLS...........................................................................................................
WHAT IS THE BEST STRUCTURE FOR DATA GOVERNANCE PROGRAMS?...............16
WHY ORGANIZATIONS NEED A SOLID DATA GOVERNANCE STRATEGY................17
THE DAWN OF DATA GOVERNANCE........................................................................................................................
DATA MANAGEMENT STRATEGY BREAKDOWNS.....................................................................................................
THE BENEFITS OF EFFECTIVE DATA GOVERNANCE..................................................................................................
BREAKING DOWN DATA SILOS WITH STRONG DATA GOVERNANCE.....................22
WHAT IS A DATA SILO?.............................................................................................................................................
WHY ARE DATA SILOS BAD?.....................................................................................................................................
DATA GOVERNANCE AND BREAKING DOWN DATA SILOS.....................................................................................
BUILDING A DATA GOVERNANCE POLICY................................................................................................................

Data Governance confidential


Page 4 of 29
WHAT IS DATA GOVERNANCE AND WHY DOES IT MATTER?

Data governance (DG) is the process of managing the availability, usability, integrity and
security of the data in enterprise systems, based on internal data standards and policies
that also control data usage. Effective data governance ensures that data is consistent
and trustworthy and doesn't get misused. It's increasingly critical as organizations face
new data privacy regulations and rely more and more on data analytics to help optimize
operations and drive business decision-making.

A well-designed data governance program typically includes a governance team, a


steering committee that acts as the governing body, and a group of data stewards. They
work together to create the standards and policies for governing data, as well as
implementation and enforcement procedures that are primarily carried out by the data
stewards. Executives and other representatives from an organization's business
operations take part, in addition to the IT and data management teams.

While data governance is a core component of an overall data management strategy,


organizations should focus on the desired business outcomes of a governance program
instead of the data itself, Gartner analyst Andrew White wrote in a December 2019 blog
post. This comprehensive guide to data governance further explains what it is, how it
works, the business benefits it provides and the challenges of governing data. You'll also
find an overview of data governance software and related tools. Click through the
hyperlinks to get expert advice and read about data governance trends and best
practices.

WHY DATA GOVERNANCE MATTERS

Without effective data governance, data inconsistencies in different systems across an


organization might not get resolved. For example, customer names may be listed
differently in sales, logistics and customer service systems. That could complicate data
integration efforts and create data integrity issues that affect the accuracy of business
intelligence (BI), enterprise reporting and analytics applications. In addition, data errors
might not be identified and fixed, further affecting BI and analytics accuracy.

Data Governance confidential


Page 5 of 29
Reasons to have a data governance program

Poor data governance can also hamper regulatory compliance initiatives, which could
cause problems for companies that need to comply with new data privacy and
protection laws, such as the European Union's GDPR and the California Consumer
Privacy Act (CCPA). An enterprise data governance program typically results in the
development of common data definitions and standard data formats that are applied in
all business systems, boosting data consistency for both business and compliance uses.

DATA GOVERNANCE GOALS AND BENEFITS

A key goal of data governance is to break down data silos in an organization. Such silos
commonly build up when individual business units deploy separate transaction
processing systems without centralized coordination or an enterprise data architecture.
Data governance aims to harmonize the data in those systems through a collaborative
process, with stakeholders from the various business units participating.

Another data governance goal is to ensure that data is used properly, both to avoid
introducing data errors into systems and to block potential misuse of personal data
about customers and other sensitive information. That can be accomplished by creating
uniform policies on the use of data, along with procedures to monitor usage and

Data Governance confidential


Page 6 of 29
enforce the policies on an ongoing basis. In addition, data governance can help to strike
a balance between data collection practices and privacy mandates.

Besides more accurate analytics and stronger regulatory compliance, the benefits that
data governance provides include improved data quality; lower data management costs;
and increased access to needed data for data scientists, other analysts and business
users. Ultimately, data governance can help improve business decision-making by giving
executives better information. Ideally, that will lead to competitive advantages and
increased revenue and profits. Read more about the benefits of a successful data
governance strategy and how to build one in an article by data management consultant
Andy Hayler.

WHO'S RESPONSIBLE FOR DATA GOVERNANCE?

In most organizations, various people are involved in the data governance process. That
includes business executives, data management professionals and IT staffers, as well as
end users who are familiar with relevant data domains in an organization's systems.
These are the key participants and their primary governance responsibilities.

Chief data officer The chief data officer (CDO), if there is one, often is the senior
executive who oversees a data governance program and has high-level responsibility for
its success or failure. The CDO's role includes securing approval, funding and staffing for
the program, playing a lead role in setting it up, monitoring its progress and acting as
an advocate for it internally. If an organization doesn't have a CDO, another C-suite
executive usually will serve as an executive sponsor and handle the same functions.

Data governance manager and team In some cases, the CDO or an equivalent
executive -- a director of enterprise data management, for example -- may also be the
hands-on data governance program manager. In others, organizations appoint a data
governance manager or lead specifically to run the program. Either way, the program
manager typically heads a data governance team that works on the program full time.
Sometimes more formally known as the data governance office, it coordinates the

Data Governance confidential


Page 7 of 29
process, leads meetings and training sessions, tracks metrics, manages internal
communications and carries out other management tasks.

Data governance committee the governance team usually doesn't make policy or
standards decisions, though. That's the responsibility of the data governance committee
or council, which is primarily made up of business executives and other data owners.
The committee approves the foundational data governance policy and associated
policies and rules on things like data access and usage, plus the procedures for
implementing them. It also resolves disputes, such as disagreements between different
business units over data definitions and formats.

Data stewards the responsibilities of data stewards include overseeing data sets to
keep them in order. They're also in charge of ensuring that the policies and rules
approved by the data governance committee are implemented and that end users
comply with them. Workers with knowledge of particular data assets and domains are
generally appointed to handle the  data stewardship role. That's a full-time job in
some companies and a part-time position in others; there can also be a mix of IT and
business data stewards.

Key participants in the data governance process

Data Governance confidential


Page 8 of 29
Data architects, data modelers and data quality analysts and engineers are also part of
the governance process. In addition, business users and analytics teams must be trained
on data governance policies and data standards so they can avoid using data in
erroneous or improper ways. You can learn more about data governance roles and
responsibilities and how to structure a governance program in a related article.

COMPONENTS OF A DATA GOVERNANCE FRAMEWORK

A data governance framework consists of the policies, rules, processes, organizational


structures and technologies that are put in place as part of a governance program. It
also spells out things such as a mission statement for the program, its goals and how its
success will be measured, as well as decision-making responsibilities and accountability
for the various functions that will be part of the program. An organization's governance
framework should be documented and shared internally to show how the program will
work, so that's clear to everyone involved upfront.

On the technology side, data governance software can be used to automate aspects of


managing a governance program. While data governance tools aren't a mandatory
framework component, they support program and workflow management,
collaboration, development of governance policies, process documentation, the creation
of data catalogs and other functions. They can also be used in conjunction with data
quality, metadata management and master data management (MDM) tools.

DATA GOVERNANCE IMPLEMENTATION

The initial step in implementing a data governance framework involves identifying the
owners or custodians of the different data assets across an enterprise and getting them
or designated surrogates involved in the governance program. The CDO, executive
sponsor or dedicated data governance manager then takes the lead in creating the
program's structure, working to staff the data governance team, identify data stewards
and formalize the governance committee.

Data Governance confidential


Page 9 of 29
Once the structure is finalized, the real work begins. The data governance policies and
data standards must be developed, along with rules that define how data can be used
by authorized personnel. Moreover, a set of controls and audit procedures are needed
to ensure ongoing compliance with internal policies and external regulations and
guarantee that data is used in a consistent way across applications. The governance
team should also document where data comes from, where it's stored and how it's
protected from mishaps and security attacks.

Data governance initiatives usually also include the following elements.

Data mapping and classification. Mapping the data in systems helps document data
assets and how data flows through an organization. Different data sets can then be
classified based on factors such as whether they contain personal information or other
sensitive data. The classifications influence how data governance policies are applied to
individual data sets.

Business glossary. A business glossary contains definitions of business terms and


concepts used in an organization -- for example, what constitutes an active customer. By
helping to establish a common vocabulary for business data,  business glossaries can aid
governance efforts.

Data catalog. Data catalogs collect metadata from systems and use it to create an
indexed inventory of available data assets that includes information on data lineage,
search functions and collaboration tools. Information about data governance policies
and automated mechanisms for enforcing them can also be built into catalogs.
Consultant Anne Marie Smith details the key steps for building a data catalog.

BEST PRACTICES FOR MANAGING DATA GOVERNANCE INITIATIVES

To the extent that data governance may impose strictures on how data is handled and
used, it can become controversial in organizations. A common concern among IT and
data management teams is that they'll be seen as the "data police" by business users if
they lead data governance programs. To promote user buy-in and avoid resistance to

Data Governance confidential


Page 10 of 29
governance policies, experienced data governance managers and industry consultants
recommend that programs be business-driven, with data owners involved and the
governance committee making the decisions on standards, policies and rules.

"Only by agreeing to corporate-wide data governance with responsibility by business


units will the foundations be laid for successful data management across the enterprise,"
Hayler wrote in an article about the need to eliminate incompatible data silos.

Training and education on data governance is a necessary component of initiatives,


particularly to familiarize business users and data analysts with data usage rules, privacy
mandates and their responsibility for helping to keep data sets consistent. Ongoing
communication with corporate executives, business managers and end users about the
progress of a data governance program is also a must, via a combination of reports,
email newsletters, workshops and other outreach methods.

In a report published in October 2019, Gartner analyst Saul Judah listed these seven
foundations for successfully governing data and analytics applications:

 a focus on business value and organizational outcomes.

 internal agreement on data accountability and decision rights.

 a trust-based governance model that relies on data lineage and curation.

 transparent decision-making that hews to a set of ethical principles.

 risk management and data security included as core governance components.

 ongoing education and training, with mechanisms to monitor their


effectiveness. and

 a collaborative culture and governance process that encourages broad


participation.

Professional associations that promote best practices in data governance processes


include DAMA International and the Data Governance Professionals Organization. The

Data Governance confidential


Page 11 of 29
Data Governance Institute, an organization founded in 2004 by then-consultant Gwen
Thomas, published a data governance framework template and a variety of guidance on
governance best practices. It's no longer active, but the information is still available on
its website. Similar guidance is also available elsewhere -- for example, in the Data
Management University online library maintained by consultancy EWSolutions.

DATA GOVERNANCE CHALLENGES

Often, the early steps in data governance efforts can be the most difficult because it's
characteristic that different parts of an organization have diverging views of key
enterprise data entities, such as customers or products. These differences must be
resolved as part of the data governance process -- for example, by agreeing on
common data definitions and formats. That can be a fraught and fractious undertaking,
which is why the data governance committee needs a clear dispute-resolution
procedure.

Other common challenges that organizations face on data governance include the
following.

Demonstrating its business value. That often starts at the very beginning: "It can be a
real struggle to get your data governance initiative approved in the first place," data
governance consultant and trainer Nicola Askham wrote in a September 2019 blog post.
To help build a business case for a data governance program, Askham recommended
that proponents document data quality horror stories and tie the expected outcomes of
the program to specific corporate priorities.

On an ongoing basis, demonstrating business value requires the development of


quantifiable metrics, particularly on data quality improvements. That could include the
number of data errors resolved on a quarterly basis and the revenue gains or cost
savings that result from them. Other common data quality metrics measure accuracy
and error rates in data sets and related attributes such as data completeness and
consistency. Read more about the close ties between data governance and data quality,

Data Governance confidential


Page 12 of 29
plus other kinds of metrics that can also be used to show the value of a governance
program.

Supporting self-service analytics. The self-service BI and analytics movement has


created new data governance challenges by putting data in the hands of more users in
organizations. Governance programs must make sure data is accurate and accessible for
self-service users, while also ensuring that those users -- business analysts, executives
and citizen data scientists, among others -- don't misuse data or run afoul of data
privacy and security restrictions. Streaming data that's used for real-time analytics
further complicates those efforts.

Governing big data. The deployment of big data systems also adds new governance
needs and challenges. Data governance programs traditionally focused on structured
data stored in relational databases, but now they must deal with the mix of structured,
unstructured and semi-structured data that big data environments typically contain, as
well as a variety of data platforms, including Hadoop and Spark systems, NoSQL
databases and cloud object stores. Also, sets of big data are often stored in raw form in
data lakes and then filtered as needed for analytics uses. A related article offers more
details on the challenges and advice on best practices for big data governance.

KEY DATA GOVERNANCE PILLARS

Data governance programs are underpinned by several other facets of the overall data
management process. Most notably, that includes the following:

 Data stewardship. As discussed earlier, an essential responsibility of the data


steward is to be accountable for a portion of an organization's data, with job
duties in areas such as data quality, security and usage. Teams of data
stewards typically are formed to help guide and execute the implementation
of data governance policies. Often, they're data-savvy business users who are
subject matter experts in their domains, although data steward can also be an
IT position. Data stewards collaborate with data quality analysts, database
administrators and other data management professionals, while also working

Data Governance confidential


Page 13 of 29
with business units to identify data requirements and issues. In his December
2019 blog post, Gartner's White also pointed to an emerging need for
analytics stewardship that would handle similar functions specifically for
analytics systems, calling it "a missing link in analytics, BI and data science."

 Data quality. Data quality improvement is one of the biggest driving forces


behind data governance activities. Data accuracy, completeness and
consistency across systems are crucial hallmarks of successful governance
initiatives. Data cleansing, also known as data scrubbing, is a common data
quality element. It fixes data errors and inconsistencies and also correlates
and removes duplicate instances of the same data elements, thus
harmonizing the various ways in which the same customer or product may be
listed in systems. Data quality tools provide those capabilities through data
profiling, parsing and matching functions, among other features. Get tips on
managing data quality efforts in an article by managed services strategist and
consultant Chris Foot.

 Master data management. MDM is another data management discipline


that's closely associated with data governance processes. MDM initiatives
establish a master set of data on customers, products and other business
entities to help ensure that the data is consistent in different systems across
an organization. As a result, MDM naturally dovetails with data governance.
Like governance programs, though, MDM efforts can create controversy in
organizations because of differences between departments and business units
on how to format master data. In addition, MDM's complexity has limited its
adoption compared to data governance. But the combination of the two has
led to a shift toward smaller-scale MDM projects driven by data governance
goals, as explained in a separate article.

 Data governance use cases. Effective data governance is at the heart of


managing the data used in operational systems and the BI and analytics
applications fed by data warehouses, data marts and data lakes. It's also a
particularly important component of digital transformation initiatives, and it
can aid in other corporate processes, such as risk management, business

Data Governance confidential


Page 14 of 29
process management, and mergers and acquisitions. As data uses continue to
expand and new technologies emerge, data governance is likely to gain even
wider application. For example, efforts are underway to apply data
governance processes to machine learning algorithms and other AI tools.
Also, high-profile data breaches and laws like GDPR and the California
Consumer Privacy Act have made data protection and privacy more central to
governance efforts. Compliance with the GDPR and CCPA privacy directives is
another new use case for data governance -- Hayler offers advice on building
privacy protections into governance policies to meet those requirements.

Some of the core elements of data governance initiatives

DATA GOVERNANCE VENDORS AND TOOLS

Data governance tools are available from various vendors. That includes major IT
vendors, such as IBM, Informatica, Information Builders, Oracle, SAP and SAS Institute, as
well as data management specialists like Adaptive, ASG Technologies, Ataccama,
Collibra, Erwin, Infogix and Talend. In most cases, the governance tools are offered as
part of larger suites that also incorporate metadata management features and data
lineage functionality.

Data Governance confidential


Page 15 of 29
Data catalog software is included in many of the data governance and metadata
management platforms, too. It's also available as a stand-alone product from vendors
such as Alation, Alteryx, Boomi, Cambridge Semantics and Data.world. Learn more about
the features that data catalog software offers, including its governance-related
capabilities.

Data Governance confidential


Page 16 of 29
WHAT IS THE BEST STRUCTURE FOR DATA GOVERNANCE
PROGRAMS?

The optimal approach to a data governance framework includes a program team, a data
governance council and stewards. Expert Anne Marie Smith explains what role each
group plays.

Many organizations struggle with the design and implementation of their data
governance programs for a variety of reasons. They may not understand the need for an
enterprise approach to data governance and may attempt to implement policies via a
line of business, an application or a functional area division.

This is a recipe for failure, as all successful data governance programs are enterprise-


wide; they span all lines of business, all applications and all functional areas. Without
this holistic view and the support of all the enterprise's areas, the goals of data
governance will not be met.

Data governance is the development and implementation of policies, practices,


standards and procedures that manage data and information across an organization. If
all of the above are not applied across the organization, then the existing data anomaly
collisions will remain and be reinforced.

There is an optimal organizational structure that successful data governance programs


use. The structure includes:

 A data governance program team comprising data management and data


governance professionals. The team members develop the policies and
practices in conjunction with the lead business data stewards and set the
standards for data and metadata to be used throughout the organization.

 A data governance council consisting of senior leadership representatives


from all the business areas and IT. This council approves the policies, practices

Data Governance confidential


Page 17 of 29
and standards for the program team. They also appoint the data stewards and
select the lead business data stewards for each subject area.

 A set of data stewardship teams from each business area that helps design
the policies and practices with the data governance team and implement
them in conjunction with the data custodians from IT. Each team of data
stewards is directed by a lead business data steward. The leads then form a
data stewardship coordinating group that settles cross-group differences with
policies and practices and refines definitions and other metadata as needed.

Although these groups may use different names in other organizations, this structure
has been proven to be the most successful approach to implementing enterprise data
governance programs.

WHY ORGANIZATIONS NEED A SOLID DATA GOVERNANCE


STRATEGY
The flood of data flowing into data warehouses, data lakes and other systems makes
effective data governance a must for successful business analytics initiatives.

When I started working as a database administrator in 1983, IT was all about


centralization. Data was safely kept on the corporate mainframe, and only those
programmers who had the skills to navigate prerelational databases could access it.
Nearly four decades later, it's all about data democratization -- and the
corresponding need for a strong data governance strategy.

Back in the day, business analysts had to go cap in hand to the IT department because
they didn't know how to navigate an IMS or IDMS database and wouldn't have been
granted access even if they could. The IT department printed out monthly reports and
distributed them, like Moses descending from the mountain with tablets of stone. There
was no real notion of, or requirement for, data governance.

With the advent of the PC, the balance of power shifted radically. Suddenly, business
users had access to spreadsheets and could create their own calculations and analyses,

Data Governance confidential


Page 18 of 29
even if corporate data was still mostly out of reach. Then came client-server computing
and a rush to decentralize that data, which brought giddying new possibilities but also
confusion, as different departments used different versions of data and fought over
whose version was correct.

Data analytics could now be done by business analysts and executives. But without
agreement on the legitimacy of the data sources, chaos ensued. The mushrooming data
silos hampered business intelligence (BI) and analytics efforts, and the first glimmers of
a need to better govern data came to light.

THE DAWN OF DATA GOVERNANCE

IT initially responded to those challenges with the data warehouse, which gathered up


data from disconnected transaction systems for the sole purpose of enabling analytics.
In addition, clever BI and reporting tools appeared that made it easier to manipulate,
join and summarize raw tables of transaction data for analysis -- maybe even by
downloading the data to spreadsheets.

Sure, the original data was still stored in different applications and formats. But with
enough effort, a data warehouse could be coaxed into making sense of it all by
providing consolidated data in multiple dimensions, such as customer, product, asset
and location. However, to actually produce consistent sets of data, the inconsistencies of
the underlying systems had to be resolved.

Data Governance confidential


Page 19 of 29
Survey results on enterprise data governance programs

Master data management (MDM) was born and, alongside it, the concept of a data
governance strategy arose. Business users were encouraged or cajoled into deciding
which classifications of customers and products were "golden records" to be held aloft
across the enterprise and which were to be cast into the wilderness of department-
specific, local terminology. This frequently was -- and still is -- an acrimonious process,
with different departments and data owners arguing over the best way to classify data.

Organizations also began creating enterprise data governance programs, typically with a


data governance council that sets data policies and data stewards who oversee data and
ensure that the policies are implemented. Some corporate cultures suit this approach

Data Governance confidential


Page 20 of 29
more than others. Highly centralized companies are used to having things dictated from
on high, but decentralized ones often rail against that and struggle to keep within
ordained data governance boundaries. Analysts in such companies think of themselves
as freedom fighters, whereas IT managers may regard them as data terrorists.

DATA MANAGEMENT STRATEGY BREAKDOWNS

It seems clear that, in a lot of companies, the freedom fighters are now in the ascendant.
A sign of this is the growing market for self-service data preparation tools. These
products can access data from a wide variety of sources, including traditional databases,
big data systems, packaged business applications, Excel and systems outside the
corporate firewall. They enable some data quality techniques, such as data profiling, and
empower business users and data scientists to set up data transformations and
manipulate data to their heart's content. Extracts, transformations and data scrubbing
can also be automated via repeatable workflow processes.

Such a market simply wouldn't exist if corporate data warehouses and MDM systems
were doing their job. Data quality checks and transformations are supposed to happen
before data is fed into a data warehouse. The trouble is that the data warehouse has
been stretched beyond its natural limits. Data now comes from so many sources and in
such volumes that traditional data management approaches are breaking down.

E-commerce systems may generate web traffic logs of such size that normal databases
can't handle the processing. Sensors on vehicles and machinery -- airplanes, cars, smart
meters, elevators, pipelines and so forth -- transmit huge volumes of streaming data. All
this is in addition to corporate transaction data, plus data from business partners and
data brokers.

With that much data coming at you, who has time for meetings to discuss the merits of
different customer classification hierarchies or the attributes of a particular data asset as
part of an MDM project?

Data Governance confidential


Page 21 of 29
THE BENEFITS OF EFFECTIVE DATA GOVERNANCE

Nonetheless, corporations need to take back control of this fast-flowing stream of data
if they are to make sense of it for enterprise business analytics uses -- also to help boost
data security and compliance with GDPR, the California Consumer Privacy Act and other
new laws that regulate the use of personal data by companies.

Developing a data governance strategy may not be a sexy topic, but it's at the heart of
what needs to be done from data collection and processing to analytics
applications. Data lakes can become data swamps if there's no way to peer into their
depths and bring some order to the data pooled there. And all the pretty charts that
analysts create with BI and analytics tools mean nothing if you can't agree on what the
underlying data means -- and whether it's trustworthy.

In the absence of some governance structure, we'll just be back to the old days, with
analysts waving their charts at each other and arguing over whose data is correct.
Putting the data genie back in the bottle is difficult, but in all too many organizations,
things now feel chaotic rather than managed. It's not about imposing rules from the top
down but about embedding data management and analytics discipline throughout the
layers of the organization. Otherwise, valuable business insights may be overlooked, and
potential competitive advantages lost.

A 2018 McKinsey report found that high-performing companies were twice as likely as


others to say they had a strong data governance strategy, and more than twice as likely
to say they had a clear and well-understood data strategy overall. The survey-based
report also reckoned that the gap between the high performers and the pack was
growing rapidly. Time is of the essence if you are to fully exploit analytics opportunities
and gain business benefits from them.

Data Governance confidential


Page 22 of 29
BREAKING DOWN DATA SILOS WITH STRONG DATA
GOVERNANCE

Not having a single source of truth can be a huge issue for data professionals. But
strong data governance policies can prevent data silos and lead to better-quality data.

One of the leading causes of data quality problems is siloed data. It produces duplicate
data and, in many cases, different understandings of important business processes.
That's why it's important to make breaking down data silos a central component of
any data quality initiative.

WHAT IS A DATA SILO?

A data silo is a collection of information that isn't effectively shared across an


organization. Although the business unit that owns the siloed system and perhaps some
more entities know the data exists, other parts of the organization are unaware of its
availability and its usage by those in the know.

Some of the most common causes of data silos are:

Lack of organizational oversight. The organization isn't effectively managing data as a


corporate asset. For example, there's no entity that governs data and how it's used at
the enterprise level.

Business growth. During growth periods, organizations often add new operational


units to better focus on key aspects of their business. The units create or acquire data
elements that meet what they think are their own unique business needs. As the number
of units expands, so do the data silos.

Cultural issues. Business units often see data they control as an asset that increases
their importance to the organization. A less dysfunctional reason for one to keep data
siloed is if it feels that sharing the information may lead to a loss of control over data
quality or the need to make unwanted changes to its systems if data definitions are
modified to meet enterprise needs.

Data Governance confidential


Page 23 of 29
Technical reasons. The increasing popularity of third-party and, more specifically, SaaS
applications is accelerating data silo growth. That's because many of these applications
store information in vendor-supplied cloud data silos.

WHY ARE DATA SILOS BAD?

If you have been in IT for any length of time, there's a good chance you understand
the problems data silos cause, including:

 No single source of truth to rely on for high-quality decision-making

 Duplication of data in systems

 Duplicate efforts to create and manage the same information

 The inability to build a 360-degree view of a customer, partner or product

 Not being able to identify enterprise-wide business trends

 Different definitions for the same data elements in separate systems, leading
to inconsistencies and poor data quality

DATA GOVERNANCE AND BREAKING DOWN DATA SILOS

Reducing the number of data silos and their negative impact will require that you
develop both prevention and corrective-action strategies. Here are some steps you
should take to do so.

First, build a culture of viewing all corporate data as an enterprise asset. Promote the
benefits of good data quality, data sharing and a single source of truth across IT and
business units.

Next, assign ownership of enterprise-wide data governance to a person or team,


depending on your organization's size. Give them the responsibility and authority for
governing all data assets and managing data quality, which includes consolidating data
silos and building data-sharing procedures. Their role should include helping developers

Data Governance confidential


Page 24 of 29
determine if the information they need already exists and enforcing where new data
elements will be stored.

Once you identify a data silo, meet with the owner, evaluate the data's usage and
determine if the silo should be consolidated, replaced or treated as a system of record
for the organization.

If you don't already have one, evaluate the need for a data warehouse environment to
consolidate data from multiple operational sources. Data warehouses help you to tightly
govern and easily share the information you are duplicating from operational systems.

Also, evaluate master data management software and data quality tools to facilitate
effective data governance and help you build a single source of truth. The alternatives
range from open-source offerings like Talend to commercial platforms from IBM,
Informatica, Information Builders, Oracle, SAP, SAS and other vendors.

BUILDING A DATA GOVERNANCE POLICY

A data governance policy outlines the roles, rules, processes and best practices that an
organization follows to ensure the quality and proper use of its data; it also can assist in
breaking down data silos. A suggested outline includes a statement of purpose that
defines the governance policy's mission and goals, backed up by executive sponsor
signatures. Additionally, a policy should cover the following aspects of a data
governance program:

Structure. An organizational structure is necessary, including senior management


sponsors, a steering committee that sets data standards and rules, team members
responsible for enterprise-level data quality, and data stewards who assist in the
governance of departmental information. Each role should include their governance
responsibilities, activities and authority.

Data creation. There should be policies that control the ownership, storage and
definition of new data elements. These policies should also include security and

Data Governance confidential


Page 25 of 29
regulatory framework classifications, access methods and data auditing, retention,
backup and archiving measures.

Data access. Data request procedures should cover security and regulatory adherence
reviews, read vs. update, data access procedures and tools, and performance impact
metrics.

Data usage. Write an ethical code-of-conduct for data that includes strictures on


misuse, unauthorized changes, deliberate falsification and intentional destruction.

Data integrity. Instate procedures that protect the quality and lineage of existing data,
including guidelines on modifying data definitions and updating data elements.

Data correction. Define procedures outlining the steps to identify, correct and


determine the root cause of poor-quality data.

Data sharing. Best practices for organizational data sharing include providing controls


for interdepartmental data modifications, provisions for identification and consolidation
of data silos, and shared data store creation and usage guidelines.

HOW TO SELECT THE RIGHT DATA GOVERNANCE TOOL

There are many tools on the market now that can help you with your data governance
initiative. In particular, there are numerous products that hold and manage your data
glossary, data catalogue and data dictionaries.  These have proved very popular and the
number of players in the market has increased over the last few years.

If you are lucky enough to have the budget to purchase such a tool, please make sure
that you're well prepared so that you can choose the right vendor for your
organisation’s needs. If you select the wrong tool, it won’t help your Data Governance
initiative and even worse it could distract from or even derail it!

Data Governance confidential


Page 26 of 29
To help you avoid making such a mistake I want look at some of the common pitfalls in
DG tool selection and the kind of questions you need to ask your vendors, so that you
are really clear on what you're looking for before you embark on a tool selection
process.

Let's look at the most common pitfalls first.  The three main ones that I've seen are:

·      Little or no business user involvement

·      Unclear requirements

·      Overly complex initial implementation

Taking each of these in turn:

Firstly, there's little or no business involvement early in the process. Many people wait
until the tool is purchased and even being implemented before they involve business
users.  In my experience, this is a huge problem and should be avoided at all costs.

I have seen a few implementations go wrong because the eventual business users were
not involved in selection.  Think about it from their point of view.  They have not asked
for such a tool, nor does it help them to do an existing task more quickly or easily.  So,
when you come to implement your shiny new tool, the business users feel they're
having some IT tool foisted upon them. Generally, they do not react well and I can recall
one instance when the whole implementation had to go back to the drawing board. 
Once the business users understood what they needed to use the tool for, their
requirements were vastly different from what had been delivered.

The second pitfall is being unclear on what you require of the tool. Often someone has
latched on to the fact that a tool could help them and dived straight in and bought one
without being really clear what they want the tool to do. Please make sure that you take
time to work out what your objective is from having the tool. Once you've worked that
out, progress to defining some clear detailed requirements (just a requirement to have a
data glossary is not sufficient).

Data Governance confidential


Page 27 of 29
Finally, another common pitfall is trying to make the initial implementation too complex.
Some of the more established tools on the market have been around for a while and
have evolved over time to provide a multitude of functionalities, all of which can
facilitate and enable your data governance and data quality activities. But please, when
you're looking at selecting a vendor initially,  be very clear what you want a tool to do
now. Also, consider what you definitely want it to do in the future.  Finally, you can make
a “nice to have” list. Just make sure you take a thorough approach to determine clear
requirements.

I've seen implementations of tools fail or the wrong tool selected because of vague or
overly complex requirements (just because the tool does it, does not mean that your
business really needs it).

Now we've looked at what the main pitfalls are. I wanted to share with you a few
questions that would be useful to ask the vendors to ensure they're a good fit for you
and your data governance initiative. Since I've highlighted the need for objectives and
clear requirements, the first question to ask them is, how does their tool meet your
requirements.  Notice I say how does it meet… and not does it meet. If you ask, “does
your tool meet our requirements”, most vendors will say yes.

What you want to know is how.  Is it simply out of the box functionality that is ready to
go or will there have to be manual workarounds, or even worse a lot of customisation or
configurations in the tool that may make future upgrades very difficult for you.

Secondly, I'd ask what implementation support will be given to you. You have to
remember these tools are by their very nature, flexible, and you need to set them up in a
way that works for your business. This means that you will need some support from the
vendor. So, make sure that you are very clear upfront about what kind of support they
will be giving you.  Knowing what is and isn't covered will prevent any nasty surprises in
the future.

Thirdly, ask what training they provide for both you and the team implementing it.
Perhaps they may even support training your business users on how to use their tool. 
Definitely work out what training you want and ask what training is available.

Data Governance confidential


Page 28 of 29
Some final thoughts on how to choose the right Data Governance Tool for your
organisation:

I’ve said it already but please remember that to successfully choose the right tool for
your company, it is absolutely vital that you are very clear on what you need the tool to
do before starting a selection process.  Clear requirements should be the start of the
process.

Make sure that you understand not only the support arrangements of the tool (as I
mentioned in the last section) but also the upgrade path of the tool. I've come across
more than one situation where an organisation has customised a tool to such a degree
that is not possible to follow the upgrade path.  On one occasion they needed a project
to redesign and implement a new data glossary to be able to upgrade and take
advantage of the new functionality.

Lastly, I would say that when you're working with vendors, going through workshops or
maybe an RFP process you are going to meet a whole variety of personalities. Bear in
mind that these are not the people that you will be working with if you choose and
select this tool. Whether you like or dislike them, do not be swayed by the personalities.
They will not be around for the implementation, and the ongoing support will be
provided by other people. So don't let yourself be influenced just because you like or
dislike their sales team!

Just remember that such tools can be great enablers to your data governance initiative,
but they need to be put in place once your data governance initiative is already going
so that you are very clear on what you want.

Data Governance confidential


Page 29 of 29

You might also like