Professional Documents
Culture Documents
Designing
must be made to ensure effective man-
agement and use of IT (decision do-
mains) and who makes the decisions
(locus of accountability for decision-
Data
making).
! Management involves making and
implementing decisions.
Governance
For example, governance includes
establishing who in the organization
holds decision rights for determining
standards for data quality. Manage-
ment involves determining the actual
metrics employed for data quality.
Here, we focus on the former.
Corporate governance has been de-
fined as a set of relationships between
ORGANIZATIONS ARE BECOMING INCREASINGLY SERIOUS a company’s management, its board,
about the notion of “data as an asset” as they face its shareholders and other stakehold-
increasing pressure for reporting a “single version of ers that provide a structure for deter-
mining organizational objectives and
the truth.” In a 2006 survey of 359 North American monitoring performance, thereby en-
organizations that had deployed business intelligence suring that corporate objectives are
attained. Considering the synergy be-
and analytic systems, a program for the governance tween macroeconomic and structural
of data was reported to be one of the five success policies, corporate governance is a key
“practices” for deriving business value from data element in not only improving eco-
nomic efficiency and growth, but also
assets.a In light of the opportunities to leverage enhancing corporate confidence.b A
data assets as well ensure legislative compliance to framework for linking corporate and
IT governance (see Figure 1) has been
mandates such as the Sarbanes-Oxley (SOX) Act and proposed by Weill and Ross.10
Basel II, data governance has also recently been given Unlike these authors, however, we
significant prominence in practitioners’ conferences, differentiate between IT assets and in-
formation assets: IT assets refers to tech-
such as TDWI (The Data Warehousing Institute) nologies (computers, communication
World Conference and DAMA (Data Management and databases) that help support the
automation of well-defined tasks, while
Association) International Symposium. information assets (or data) are defined
The objective of this article is to provide an overall as facts having value or potential value
framework for data governance that can be used by that are documented. Note that in the
context of this article, we do not differ-
researchers to focus on important data governance entiate between data and information.
issues, and by practitioners to develop an effective data Next, we use the Weill and Ross
framework for IT governance as a start-
governance approach, strategy and design. Designing ing point for our own framework for
data governance requires stepping back from day-to- data governance. We then propose a set
day decision making and focusing on identifying the a http://mediakit.businessweek.com/pdf/research/
fundamental decisions that need to be made and who KnightsbridgeWhitePaper.pdf (last viewed on August 2,
2007)
should be making them. Based on Weill and Ross,10 b http://www.oecd.org/dataoecd/32/18/31557724.pdf
of five data decision domains, why they Figure 1: Key organizational assets to be governed; adapted from Weill and Ross.10
are important, and guidelines for what
governance is needed for each decision
domain. By operationalizing the locus
of accountability of decision making (the
“who”) for each decision domain, we
create a data governance matrix, which
can be used by practitioners to design Figure 2: Decision domains for data governance.
their data governance. The insights
presented here have been informed by
field research, and address an area that
is of growing interest to the informa-
tion systems (IS) research and practice
community.
each decision domain with examples data stewards wherein they employ IT ing standards with respect to various
of the types of decisions to be made for tools (such as, DataFlux, Informatica dimensions of data quality, defines
each domain. The far righthand col- Data Quality) that help surface quality mechanisms for communicating busi-
umn in Table 1 also provides examples issues for the business owners (or data ness uses of data on an ongoing basis,
of potential organizational roles that owners/trustees). and delineates procedures for evaluat-
could be vested with decision rights for Data Quality. Poor data quality can ing the quality of data. By providing a
the various domains—that is, the “lo- impact an enterprise at both operation- roadmap for interpreting (metadata)
cus of accountability.” A case study that al and strategic levels7; current prob- and assessing data, data quality deci-
we conducted with a large insurance lems in data quality reportedly cost US sions are pivotal in the effective gover-
company revealed several such roles businesses more than $611 billion ev- nance of data assets.
for data governance: for example, the ery year in postage, printing, and staff Metadata. Defined as “data about
governance of data access was vested overhead.c Similar to product quality,3 data,” metadata describes what the data
in an Enterprise Architecture Develop- the quality of data refers to its ability to is about and provides a mechanism for
ment Committee. satisfy its usage requirements.5 While a concise and consistent description
Data Principles. Effective data prin- data quality has multiple dimensions, of the representation of data, thereby
ciples establish the linkage with the such as accuracy, timeliness, complete- helping interpret the meaning or “se-
business. For example, the organiza- ness and credibility, these dimensions mantics” of data. Different types of
tional decision to standardize business are relative and need to be defined in metadata such as physical, domain-in-
processes implies that there should be the context of the end use of data.1,5,9 dependent, domain-specific, and user
a clearly defined business owner of data For example, while 85% accuracy of the metadata8 play a role in the discovery,
assets (data principle). By delineating name, address, and phone number of retrieval, collation and analysis of data.
the business uses of data, data prin- physicians may be acceptable for an At the lowest level, physical metadata
ciples therefore establish the extent to insurance company that is targeting includes information about the physi-
which data is an enterprisewide asset, physicians as potential customers, this cal storage of data. Domain-independent
and thus what specific policies, stan- metric would not be acceptable for or- metadata includes descriptions such
dards and guidelines are appropriate. ganizations that need to notify prescrib- as the creator/modifier of data and
In keeping with the notion of data as ing physicians about a drug recall. authorization/audit/lineage informa-
an asset, data principles also establish/ ! Accuracy refers to correctness of data, tion related to the data. By providing a
foster opportunities for sharing and re- that is, whether the recorded value is in set of mappings from a representation
using data. Each principle is supported conformity with actual value, with re- language to agreed-upon concepts in
by a rationale and a set of implications. spect to its intended use. the real world, domain-specific meta-
Data principles take into account the ! Timeliness indicates that the recorded data connects a database to the “real
usage of external data, such as, custom- value is up-to-date for the task at hand. world.” Domain-specific metadata, for
er data from third-party service provid- ! Completeness suggests that the req- example, can be specified at different
ers. An organization’s data principles uisite values are recorded (not miss- levels—such as division and organiza-
also take into consideration the regula- ing) and that it is of adequate depth/ tion; at the division-level it provides
tory environment that could influence breadth. descriptions of the application data for
the business uses of data. ! Credibility indicates the trustworthi- individual units, while at the organiza-
Data principles therefore define ness of the source as well as its content. tion-level it supports reconciliation of
the desirable behaviors both for IS The data quality decision domain— domain-specific (data) descriptions for
professionals and business users. For which could be vested with roles such the entire organization. Finally, user
example, the notion of business own- as data quality manager, data quality metadata includes annotations that
ers of data implies that business users analyst, data quality trainer and sub- users may associate with data items or
have an important role in managing ject matter expert—provides underly- collections; such annotations can, for
data quality as well as its lifecycle, in- example, capture user preferences and
terpretability and access. On the other c http://www.dw-institute.com/research/display. usage history.
hand, IS professionals play the role of aspx?ID=6626 The metadata that is employed in
an enterprise depends on the intended Table 2: Potential example of data governance matrix.
use of and access to the data, as well
Decision
as the management of its life cycle. To Domain Data Data
Data Quality Metadata Data Access
support retrieval and analysis of data, Locus of Principles Lifecycle
accountability
the metadata decision domain may
be vested in such roles as enterprise Centralized ✓
executives who serve as data trustees. In propriate for different decision do-
contrast, decision rights for data qual- mains in the same organization. Simi-
ity may belong to business managers lar to Weill and Ross,10 we also suggest
who are data owners in many different that a “one page” design matrix (Table
business units, and thus be highly de- 2) may be useful for communicating a
centralized. Decisions related to data given organization’s data governance
access and data life cycle may be vested approach. The proposed framework
with an enterprise data architect and also provides a common terminology
a data security officer, respectively, as that can be used by researchers to share
the hub, but with business unit partici- their findings with other members of
pation but not authority (such as, data the IS community.
beneficiaries) as the spokes. Finally, the
decision rights for the metadata domain
References
may involve both data consumer and 1. Ballou, D. P. and Pazer, H. L. Modeling data and
data modeling engineers, and a more process quality in multi-input, multi-output
information systems. Management Science 31,(1985),
balanced approach to responsibility and 150-162.
2. Brown, C. V. Horizontal mechanisms under differing
accountability; hence, it is modeled here IS organization contexts. MIS Quarterly 23, (1999),
as at the midpoint on the continuum. 421-454.
3. Griffin, A. and Hauser, J. R. The voice of customer.
Both structural and non-structural Marketing Science 12, (1993), 1-27.
mechanisms2 can be employed to imple- 4. Levitin, A. V. and Redman, T. C. Data as resource:
Properties, implications, and prescriptions. Sloan
ment the governance structure shown Management Review,(1998), 89-101.
in Table 2. For example, a committee 5. Olson, J. E. Data Quality: The Accuracy Dimension.
Morgan Kaufmann, San Francisco, CA, 2003.
of business leaders may review and ap- 6. Pfleeger, C. P. and Pfleeger, S. L. Security in computing.
prove IT project requests and/or act as Prentice Hall, Upper Saddle River, NJ, 2003.
7. Redman, T. C. The impact of poor data quality on the
the governing body for developing and typical enterprise. Comm. ACM 41, (1998), 79-82.
enforcing a set of data principles. For 8. Singh, G., Bharathi, S., Chervenak, A., Deelman, E.,
Kesselman, C., Manohar, M., Patil, S., and Pearlman,
other decision domains that require col- L. A metadata catalog service for data intensive
laboration across business unit and IS applications. In Proceedings of the ACM/IEEE
SC2003 Conference on High Performance Networking
professionals, similar standing commit- and Computing. (Phoenix, AZ, 2003)
tee mechanisms can also be employed, 9. Wang, R. Y. and Strong, D. M. Beyond accuracy: What
data quality means to data consumers. Journal of
as well as processes that help ensure Management Information Systems 12, (1996), 5-34.
10. Weill, P. and Ross, J. W. IT governance: How top
consistent behaviors across multiple performers manage IT decision rights for superior
business and IS units. Corporate an- results. Harvard Business School Press, Boston, MA,
2004.
nouncements and other central commu-
nications using Web-based portals could
Vijay Khatri (vkhatri@indiana.edu) is an associate
be the mechanisms employed to dissem- professor at the Kelley School of Business of Indiana
inate policy decisions and procedures, as University, Bloomington, Indiana, USA.
well as to convey the organization’s data Carol V. Brown (Carol.Brown@stevens.edu) is a
governance objectives. Finally, organi- distinguished professor at the Howe School of Technology
Management of Stevens Institute of Technology, Hoboken,
zational incentives and reward systems New Jersey, USA.
could be designed to reinforce the value © 2010 ACM 0001-0782/10/0100 $10.00
that the organization places on manag-
ing data as an organizational asset.
Conclusion
We have presented a data governance
framework that can be used by practi-
tioners to develop a data governance
strategy and approach for managing
data as an organizational asset. We
have identified five decision domains,
presented arguments for why each of
these domains is important, described
some key decisions to be made for each
domain, and provided some examples
of organizational positions that may be
given accountability.
We also have proposed that differ-
ing levels of centralized, decentralized,
and shared decision rights may be ap-