Professional Documents
Culture Documents
Textbook Open Data Exposed Bastiaan Van Loenen Ebook All Chapter PDF
Textbook Open Data Exposed Bastiaan Van Loenen Ebook All Chapter PDF
Loenen
Visit to download the full and correct content document:
https://textbookfull.com/product/open-data-exposed-bastiaan-van-loenen/
More products digital (pdf, epub, mobi) instant
download maybe you interests ...
https://textbookfull.com/product/programming-interviews-exposed-
giguere/
https://textbookfull.com/product/open-heritage-data-an-
introduction-to-research-publishing-and-programming-with-open-
data-in-the-heritage-sector-henriette-roued-cunliffe/
https://textbookfull.com/product/data-simplification-taming-
information-with-open-source-tools-1st-edition-berman/
https://textbookfull.com/product/data-stewardship-for-open-
science-implementing-fair-principles-first-edition-mons/
Open Data for Sustainable Community: Glocalized
Sustainable Development Goals Neha Sharma
https://textbookfull.com/product/open-data-for-sustainable-
community-glocalized-sustainable-development-goals-neha-sharma/
https://textbookfull.com/product/open-data-for-education-linked-
shared-and-reusable-data-for-teaching-and-learning-1st-edition-
dmitry-mouromtsev/
https://textbookfull.com/product/open-data-politics-a-case-study-
on-estonia-and-kazakhstan-maxat-kassen/
https://textbookfull.com/product/gis-in-sustainable-urban-
planning-and-management-open-access-a-global-perspective-1st-
edition-martin-van-maarseveen-editor/
https://textbookfull.com/product/kelly-exposed-innocence-
corrupted-1st-edition-viktor-redreich/
Information Technology and Law Series IT&LAW 30
Volume 30
Editor-in-chief
Simone van der Hof, eLaw (Center for Law and Digital Technologies),
Leiden University, Leiden, The Netherlands
Series editors
Bibi van den Berg, Institute for Security and Global Affairs (ISGA),
Leiden University, The Hague, The Netherlands
Gloria González Fuster, Law, Science, Technology & Society Studies (LSTS),
Vrije Universiteit Brussel (VUB), Brussels, Belgium
Eleni Kosta, Tilburg Institute for Law, Technology and Society (TILT),
Tilburg University, Tilburg, The Netherlands
Eva Lievens, Faculty of Law, Law & Technology, Ghent University,
Ghent, Belgium
Bendert Zevenbergen, Center for Information Technology Policy,
Princeton University, Princeton, USA
More information about this series at http://www.springer.com/series/8857
Bastiaan van Loenen Glenn Vancauwenberghe
•
Joep Crompvoets
Editors
123
Editors
Bastiaan van Loenen Joep Crompvoets
Faculty of Architecture Public Governance Institute
and the Built Environment KU Leuven
Delft University of Technology Leuven, Belgium
Delft, The Netherlands
Glenn Vancauwenberghe
Faculty of Architecture
and the Built Environment
Delft University of Technology
Delft, The Netherlands
This T.M.C. ASSER PRESS imprint is published by the registered company Springer-Verlag GmbH, DE
part of Springer Nature
The registered company address is: Heidelberger Platz 3, 14197 Berlin, Germany
Series Information
The Information Technology & Law Series was an initiative of ITeR, the national
programme for Information Technology and Law, which was a research programme
set up by the Dutch government and The Netherlands Organisation for Scientific
Research (NWO) in The Hague. Since 1995 ITeR has published all of its research
results in its own book series. In 2002 ITeR launched the present internationally
orientated and English language Information Technology & Law Series. This
well-established series deals with the implications of information technology for
legal systems and institutions. Manuscripts and related correspondence can be sent
to the Series’ Editorial Office, which will also gladly provide more information
concerning editorial standards and procedures.
Editorial Office
Open data initiatives came on the scene just a decade ago. In December 2007, in
Sebastopol, California, 30 open government advocates agreed on the—now—
famous eight Open Government Data Principles. Soon after, the US and UK
governments decided to include the topic in their political agendas. In January
2009, on his first day in office, President Barack Obama signed legislation pro-
moting open government, while the first US and UK open data portals were
launched to make public sector information (PSI) available in the form of “open
data”. At the same time, the European Commission started the review process of the
2003 PSI Re-use Directive and published a revised version in 2013.
However, the open data movement did not appear out of the blue just 10 years
ago. Three areas provide a rich context. Freedom of information (FOI) is an area in
which governments all over the world have been active for decades, some for cen-
turies. The Swedish Freedom of the Press Act of 1766 guaranteed public access to
government documents. Secondly, digital technologies gave a completely different
dynamics to the generation, storage and at the same time access and dissemination of
PSI. Lastly, the “openness” movement was quite influential in the early twenty-first
century, promoting “open” in several areas including open source software, open
standards, open knowledge/content and open science. The “open data” addition to
this palette appeared rather natural, framing the FOI discussion for the digital era.
The expectations were high from the beginning. The early open data rhetoric
could be summarised as “publish all your data in whatever way and, here it
comes… transparency, growth, efficiencies and innovation”. Using Gartner’s hype
cycle, the 2008–2014 period could be seen as the period of inflated expectations.
Open data advocates believed that just by making government data available, we
will automatically achieve open government. For this reason, pressure was put on
public organisations to publish data. However, this happened in a supply-driven
way, ignoring to a large extent real users, requirements and demand for open data.
vii
viii Foreword
Annoyingly, demand did not follow the supply. By the mid-2010s, we realised that
although we had several hundreds of thousands of open data sets available via
hundreds of portals maintained by public authorities at all administrative levels, i.e.
on local, regional, national and European level, the demand, actual use and
exploitation of open data were very low. Public agencies were doing—some better
than others—their homework, but the promised benefits were not there. What was
wrong?
Several hidden issues surfaced when the open data rhetoric hits reality:
• Governance. Open data should become part of the corporate information and
data management plan. Although data is a valuable resource and as such it needs
to be appropriately managed, we still lack policies for managing information
inside public organisations. We need much more managing of open data as part
of an overall corporate information management portfolio.
• Funding and costs. The initial enthusiasm underestimated the cost of publishing
open data. There are direct costs related to the publication process and other
important costs related to changing existing business models that are based on
revenues for public agencies. On many occasions, upfront investments and
long-term commitments of resources were not realistically estimated.
• Licensing and privacy. Licensing open data has proved to be complicated, while
ambiguity in this area prevents reuse. Moreover, privacy issues seem to dete-
riorate with the advancement of de-anonymisation techniques.
• Usability. In a rather naïve way, it was thought that open data could be used
directly by everyone, by all citizens and businesses. However, special skills are
required for using and getting value out of open data. This skill set is often quite
advanced as the published data most times suffers from low quality, inconsis-
tencies and need demanding curation, cleansing, integration, etc.
• Ecosystems. Open data alone is not enough. Applications, communities, power
users, standards, platforms play not just a supportive role but are prerequisites
for value creation.
• Just opening data in any form and format is not enough. Specific policies for
promoting publication quality and ensuring interoperability and compliance to
standards are needed to avoid a Tower of Babel of open data. Advanced skills
are also needed inside public organisations to support the entire open data life
cycle from generation to exploitation.
• Global applicability and national specificities. The open data movement was
born in an Anglo-Saxon context. Its application to other countries and conti-
nents revealed specificities and special characteristics based on cultural, insti-
tutional and organisational factors that need to be carefully considered. It is not
always possible to just copy practices from one country to the other.
• Evaluation and assessment. It is very difficult to assess and evaluate in an
objective and quantitative way, using validated and verifiable evidence, the
value and impact of open data, with respect to transparency, efficiencies,
innovation, job creation, etc.
Foreword ix
With this extensive list of challenges, coupled with the (apparently) low use of
open data, in the post-2015 era, agencies started questioning the open data return on
investment. How to persuade budget holders to continue investing in something of
which the benefits cannot be assessed, and usage is low despite considerable
investments already having been made?
It seems that we may be moving towards the “trough of disillusionment”.
However, the first indications of a new level of maturity or the “slope of enlight-
enment” can be seen. Researchers, policy-makers and open data advocates are
realising the limitations and are coming up with plans, proposals and ideas to
overcome these limitations. The publication of this book with the topics it covers is
a manifestation of the maturity of the discussion.
This book aims to expose the hidden issues as well as key aspects that have a
relevance for opening data and to provide illustrative examples of how open data is
implemented worldwide. It includes in-depth information about the historical
background and the key components of open data, presents several interesting case
studies on open data initiatives and infrastructures, and critically discusses the
current and future developments in the open data ecosystem.
Open data alone may not be the Holy Grail for the public sector, but remains a
very powerful tool with which to increase transparency, reduce costs, create new
services and boost reuse/collaboration within the public sector and with the private
sector. We now realise that it cannot come alone, and it needs a broader environ-
ment to thrive, coupled with crucial elements such as policies, planning, manage-
ment, funding, prioritisation, standards, skills, awareness, assessment frameworks
and legal clarity, just to mention a few. If we see open data not as a technology
trend, but as a vital parameter of the Freedom of Information discussion shaped in
the digital era, we can all agree that it is a one-way street. You cannot go back.
xi
xii Contents
xiii
xiv Abbreviations
EUR Euro
FLOSS Free/libre open source software
GAV Gross added value
GBP British pound
GDP Gross domestic product
GDS UK Government Digital Service
GIS Geographic information system
GLA Greater London Authority
GML Geography Markup Language
HRM Human resource management
INSPIRE Directive EU Directive 2007/2/EC of the European Parliament and
of the Council of 14 March 2007 establishing an
Infrastructure for Spatial Information in the European
Community
IRM Independent Reporting Mechanism
JSON JavaScript Object Notation
JSON-LD JSON for Linking Data
KMS Danish mapping agency
LAPOR! A mobile and web application developed to report
inappropriate practices of government in Indonesia
LGA UK Local Government Association
LIDAR LIght Detection And Ranging; an optical remote-sensing
technique producing highly accurate x, y, z measurements
(ESRI 2018)
LOD Linked open data
NapTAN UK National Public Transport Access Nodes
NASG National Administration of Surveying, Mapping and
Geoinformation of China
NLP Natural language processing
NOK Norwegian Krone
NPTG UK National Public Transport Gazetteer
NSDI National Spatial Data Infrastructure
NWB Dutch National Roads Database
NZGOAL New Zealand Government Open Access and Licensing
framework
OD4D Open Data for Development Network
ODbL Open Database Licence
ODI One Data Indonesia
ODRA Open Data Readiness Assessment
ODSC Open Data Strategy Council (Korea)
ODUG Open Data User Group (UK)
Abbreviations xv
Contents
1.1 Introduction........................................................................................................................ 1
1.2 Definition and Principles of Open Data............................................................................ 3
1.3 Open Data Ecosystem and Open Data Infrastructures ..................................................... 4
1.4 Outline of the Book........................................................................................................... 7
References .................................................................................................................................. 9
1.1 Introduction
This book is about open data, i.e. data that does not have any barriers in the (re)use.
Open data aims to optimize access, sharing and using data from a technical, legal,
financial, and intellectual perspective. Data increasingly determines the way people
live their lives today. Nowadays, we cannot imagine a life without real-time traffic
information about our route to work, information of the daily news or information
about the local weather. At the same time, citizens themselves now are constantly
generating and sharing data and information via many different devices and social
media systems. Especially for governments, collection, management, exchange, and
use of data and information have always been key tasks, since data is both the primary
input to and output of government activities. Also for businesses, non-profit orga-
nizations, researchers and various other actors, data and information are essential.
It is estimated that 90% of the world’s data was just generated in the past two
years.1 Where in the past the entire data collection of the globe would fit a single
memory stick of 8 Gb, in 2013 it was estimated that the world creates 2.5 quintillion
bytes of data daily,2 totalling a 2.7 Zettabytes3 of data existing in the digital uni-
verse.4 While in the past government data were regarded as being among the largest
creators and collectors of data in many different domains,5 this is anno 2018 not the
case anymore.6 The majority of the Zettabytes that are collected on a daily basis,
often through citizens,7 are in the private domain.8
Since data is so critical for a well-functioning society,9 access to the data should
be optimised. One way of arriving at such an ideal position is to open data. While the
concept of making data freely available for the common good originally stems from
the academic community10—many years before the creation of the internet—in
recent years especially governments have been active in setting up open data ini-
tiatives and making their data available as open data on the web. Although there are
also movements towards open commercial data, open research data and open sci-
ence,11 and open data provided by citizens, this book’s focus is on open government
data, which still makes up the majority of open data. Therefore, open data mainly
translates into open government data in this book, unless otherwise indicated.
The main objective of this book on open data is to expose key aspects that have a
relevance when dealing with open data and to provide appealing examples of how
open data is implemented worldwide. This chapter first provides a definition of
open data and discusses the values associated with open data. The chapter also
introduces the reader to the central concepts of open data infrastructures and the
open data ecosystem, and ends with the outline of the book.
1
IBM 2016; SINTEF 2013.
2
Jacobson 2013.
3
A Zettabyte equals 1 billion Terabytes. A Terabyte equals 1000 Gigabytes.
4
Karr 2012; see also DOMO 2017.
5
Janssen 2011.
6
Mulcahy 2017.
7
See Montargil and Santos 2017.
8
See Kelly 2012.
9
See also Castells and Himanen 2002.
10
See Chignard 2013: The term open data appeared for the first time in 1995, in a document from
an American scientific agency (see National Research Council 1995).
11
See for the Findable, Accessible, Interoperable and Reusable (FAIR) principles of open science:
Wilkinson et al. 2016.
1 Open Data Exposed 3
Open data is data that can be accessed, shared, used and reused without any barrier
for any type of (re)user. According to the Open Definition,12 open data can be
defined as data that be freely used, modified, and shared by anyone for any purpose
subject, at most, to measures that preserve provenance and openness.13 Open data
requires datasets to be either in the public domain, or distributed through an open
license. The data must be provided as a whole, free of charge, and preferably
downloadable via the Internet, including any additional information that might be
necessary to comply with the open license’s terms. Openness requires the data to be
provided in a readily machine-readable form. The format must be open as well,
meaning that it does not place any restriction upon its use, and that the files in that
format can be processed with open-source software tools.
The Open Definition speaks broadly of open ‘works’, rather than of open data.
Focusing on data tout court, one can move from the Open Government Data14
(OGD) principles. According to the OGD principles, which are arguably founda-
tional in understanding the concept of open data, data must be: Complete;15
Primary;16 Timely;17 Accessible;18 Machine-processable;19 Non-discriminatory;20
12
‘Open Definition 2.1’. http://opendefinition.org/. Accessed May 2018.
13
‘Open Definition 2.1’. http://opendefinition.org/. Accessed May 2018. Please note that the
attribution and sharealike requirements are possible limitations in the use, therefore not strictly
adhering to the ‘without any barrier’ requirement of open data (see further Chap. 6 of this book).
14
As pointed out by the principles’ authors, “(t)here are many definitions of “open” and this is but
one. The 2007 working group’s definition sits at the unique intersection of open government and
open data and has United States sensibilities”—Dietrich et al. 2007.
15
Open data must be the default setting for governmental data releases, provided that said data is
of a public nature, meaning that it must not be subject to valid privacy, security or other legitimate
and legally sanctioned limitations. Moreover, while some resources are by now digital by default,
some other artefacts are not: in the latter case, they should be made digitally available to the
maximum extent possible.
16
I.e. collected at its very source, and as granular as possible; entities obtaining the original
dataset, processing it, and publishing it in a modified (e.g. aggregated) form, should have the
obligation to publish the original data set in its default format, thus contributing in preserving it for
posterity.
17
Each piece of information has its own lifecycle, and the accuracy—and therefore the utility—of
a dataset partly depends on the time lapsed from the dataset’s creation. Timeliness, therefore,
means that government data should be released as early as possible, to preserve the data’s value.
18
Data must be accessible to the widest possible number of users for the widest possible array of
purposes. Accessibility is deemed to be lacking if the data is not accessible through automated
means, due to technological, policy, or other kinds of restrictions.
19
As following from the accessibility principle above, data must be machine-readable and pro-
cessable, and thus in a widely used, normalized and sufficiently documented format.
20
I.e. available to anyone, for any purpose, without access control, ‘walled gardens’, or other
gatekeeping activities that might lead to differences in accessing information.
4 B. van Loenen et al.
In this book we consider open data as part of a bigger system that has been referred to
as the open data ecosystem.26 An ecosystem can be defined as “a system of people,
practices, values, and technologies in a particular local environment”.27 Ecosystems
consist of interacting, relatively tightly connected components with substantial inter-
dependencies. However, the specific components vary from ecosystem to ecosystem.
Recently, the concept was also introduced in open data practice and research, and
several scholars started to explore how the concept can be a valuable heuristic for
understanding and approaching open data practices and initiatives.28 It was Rufus
Pollock who coined the concept of the open data ecosystem in 2011.29 Pollock argued
that we should transform our one way open data streets towards an ecosystem where
data is cycled and recycled among producers and users with a prominent role for
info-mediaries.30 Such an ecosystem should thrive on collaboration, componentization
21
No entity should have exclusive control over the data format employed. As several proprietary
formats are widely used, and conversely some open formats have a narrow user base, the decision
of releasing a dataset in both widely used proprietary formats and in less used but open formats is
compliant with the principle in discussion, as long as the only format used is not a proprietary one.
22
I.e. not subject to limitations deriving from IP rights, thus either in the public domain or
disciplined by an open license.
23
See Carrara et al. 2016.
24
See Kulk and Van Loenen 2012; Van Loenen et al. 2016.
25
See Welle Donker 2016.
26
Pollock 2011; Harrison et al. 2012; Zuiderwijk et al. 2014; Jetzek 2017; Styin et al. 2017.
27
Nardi and Day 1999.
28
See e.g. Heimstädt et al. 2014; Zuiderwijk et al. 2014; Styrin et al. 2017.
29
Pollock 2011.
30
See also Jetzek 2017: Stressing that the open data ecosystem should be circular in nature.
1 Open Data Exposed 5
and open data.31 Since Pollock’s approach, other approaches emerged,32 often within
the context of open government data.33 Some scoped the ecosystem purely technical
by addressing the optimisation of the data provision to promote use.34 Others argued
that the ecosystem would comprise “an active network, facilitating interaction and
communication amongst everybody interested and/or involved in open data and the
re-use of information and data, internal, as well as external to the organisation”.35,36 In
Dawes approach, the open data ecosystem fulfils a communication or networking
platform where stakeholders in the system can meet, greet and interact.37 Further, the
mutual interdependency of the different stakeholders in the ecosystem creates a
common responsibility: only together can they make or break the system.38
Although the concept of the open data ecosystem is not fully developed as of
today, and its applicability to open data may be questioned, what is central “to the
ecosystems metaphor is the recognition that users, technology innovators, and
government leaders, data managers, and policy makers are mutually interdependent
in developing this business efficiently, effectively, and in ways calculated to bring
value to all participants”.39 An open data ecosystem can be considered as a cyclical,
sustainable, demand-driven environment oriented around agents that are mutually
interdependent in the creation and delivery of value from open data.40 However,
such an ecosystem cannot function properly without a foundation, the infrastructure
that fulfils the basic requirements of the ecosystem. Whilst the ecosystem especially
allows niche uses (specialised communities of use) to emerge,41 the infrastructure
seeks to support the widest possible range of uses of the data.
In her work on open data infrastructures, Zuiderwijk defined this infrastructure as “a
shared, (quasi-) public, evolving system, consisting of a collection of interconnected
social elements (e.g. user operations) and technical elements (e.g. open data analysis
tools and technologies, open data services) which jointly allow for OGD use”.42 In this
perspective the infrastructure is regarded “as a central place where researchers can find
31
Pollock 2011.
32
Ubaldi (2013) argues that the open data ecosystem consists of three interacting ecosystems: the
data provider ecosystem, the data user ecosystem and the infomediary ecosystem.
33
See, for example, Ubaldi 2013; Dawes et al. 2016; Harrison et al. 2012.
34
Zuiderwijk et al. 2014.
35
Share PSI 2.0 Best Practice: Establish an Open Data Ecosystem, 25 July 2016 [on line] available
at: https://www.w3.org/2013/share-psi/bp/eode/. Accessed May 2018.
36
See also Dawes et al. 2016; Ubaldi 2013; Harrison et al. 2012.
37
Such ecosystem may exist at different levels of scale: within an organisation, a country, region
or the worldwide open data ecosystem, in a specific domain (see Zuiderwijk 2015). Open data
ecosystems may also involve multiple levels, namely a data producer and a data user level, with
between the infomediaries level connecting the two (Ubaldi 2013; Jetzek 2017).
38
Cf. Harrison et al. 2012.
39
Harrison et al. 2012.
40
Boley and Chang 2007; Harrison et al. 2012.
41
Davies 2010; cf. the concepts of infrastructures and business systems in Chan et al. 2001.
42
Zuiderwijk 2015.
6 B. van Loenen et al.
and use the data published by OGD providers, where they can use integrated tools, and
where they can interact with OGD providers and policy makers to discuss their findings
from open data use.”43 The function of this concept is rather that of an online platform
than of an information infrastructure. A similar view is provided by Janssen et al. stating
that open data publication requires “an infrastructure which is able to handle the data in
an easy-to-use way to lower the user threshold. [..] Such an infrastructure should have
facilities for the discovery, curation, provenance, analyzation, and visualization of
data”.44 Others defined an information infrastructure as “a technical framework of
computing and communications technologies, information content, services, people, all
of which interact in complex and often unpredictable ways”.45 However, infrastructure
encompasses much more than just the technical bit: it is people, it is laws, it is the
education to be able to use systems, it is the policies, governance mechanisms and the
funding structures.46 In the context of physical infrastructures, Robert Pepper of the US
Federal Communications Commission47 explained that: “If you think about the highway
system, we tend to think about bridges and interstates, but the infrastructure also includes
the highway laws, drivers’ licenses, McDonalds along the roadside, gas stations, the
people who cut the grass along the highways, and all of those support systems”.
In this book, we consider an open data infrastructure as the basic physical and
organisational structure and facilities needed for the functioning of an open data
ecosystem. As such we regard it as a social and technical construct.48 In addition to
the pure technical aspects of providing access through ftp, API, or otherwise to the
datasets and their metadata, it also inhibits non-technical aspects such as the open data
policies, and governance and funding structures. Like any infrastructure, the open
data infrastructure has typical dimensions that are paramount to the success of open
data ecosystems: users, providers, technical aspects (standards,49 access networks),50
legal/ policy framework dimensions,51 the funding dimension and the governance52
of the infrastructure connecting the users, user communities (like developers, uni-
versities, private sector) and providers of the open data.53 The ecosystem builds on
the infrastructure and is “made up of a series of interrelated tools and services that rely
on one or more elements of the infrastructure either directly, or through intermediary
tools and services, for their sustained operation”.54 Davies argues that developing an
43
Zuiderwijk 2015.
44
Janssen et al. 2012.
45
Borgman 2000, p. 30.
46
Coleman and McLaughlin 1997.
47
Coleman and McLaughlin 1997.
48
Star and Ruhleder 1996, p. 113.
49
Zuiderwijk et al. 2014.
50
Zuiderwijk et al. 2014; Ubaldi 2013.
51
Ubaldi 2013; Harrison et al. 2012; World Bank 2015.
52
Harrison et al. 2012.
53
See also World Bank 2015; Williamson et al. 2003.
54
Davies 2010.
1 Open Data Exposed 7
ecosystem around an open dataset might be taken up entirely by third parties (other
than public organisations).55 The European Commission sticks with different
wording to the same concept: government should provide the infrastructure that
should support business development.56
The book is divided into four main parts. Together with this general introduction, the
second chapter, in which the historical background of open data is discussed, forms a
first introductory part of this book. In the second part of the book, which consists of
Chaps. 3–8, various key components of open data infrastructures are addressed. The
third part of the book, containing Chaps. 10–13, provides several cases studies on
open data initiatives and infrastructures worldwide. The book ends with a final
chapter in which current and future open data developments are critically discussed.
In this first introductory chapter of the book, we provided a definition of open
data, discussed the main principles and values of open data, and introduced the
reader into the central concepts of open data infrastructures and the open data
ecosystem. The second chapter by Lorenzo Dalla Delta complements this first
chapter, by providing a historical overview on open data. The concept of open data
as we know it today is the result of several different initiatives, promoted by a wide
range of actors, both of a legislative and non-legislative nature. Numerous regu-
latory antecedents meant to foster the concept of open data and to embed it in
national and international policy agendas have been undertaken on both sides of the
Atlantic, and also at a super-national level. He concludes that the striving towards
openness can be characterised by different priorities and undertones, like public
sector information reuse and economic growth in the EU, or transparency and
accountability in the US, but its general principles are widely and internationally
shared. Throughout Chaps. 3–9, different key components of open data initiatives
and open data infrastructures are further discussed. In the first two chapters, the
focus is on the main stakeholders involved in the implementation of these infras-
tructures, i.e. the user (Chap. 3) and the data provider (Chap. 4). Chapter 3 puts the
user of open data central and explores the open data users, mentioned by many but
considered by few. In this chapter, Bastiaan van Loenen argues that there no such
thing as a single typical user making it very difficult to develop user oriented open
data policies. Chapter 4 addresses open data developments from the perspective of
the data providers, with a particular focus on the issue of funding open data. In this
chapter the change process from cost recovery towards open data is explained and
discussed by Frederika Welle Donker. She addresses the tension field between lost
55
See also O’Reilly 2010: Arguing that greater government involvement could increase the
vitality of an infrastructure. “But if the lesson is correctly learned, it should do so not by competing
with the private sector to deliver [..] services, but by investing in infrastructure (and “rules of the
road”) that will lead to a more robust private sector ecosystem.”
56
European Commission DG CONNECT A European strategy on the data value chain.
8 B. van Loenen et al.
revenue due to open data and maintain adequate data service quality in the context
of a sustainable business model for open government data. Chapter 5 introduces and
discusses the governance of open data initiatives as a key component in the
implementation of open data. Building further on the public administration litera-
ture on coordination and governance in the public sector, Glenn Vancauwenberghe
and Joep Crompvoets develop a framework for analysing the governance of open
data initiatives, and use this framework to present examples of the use of various
governance instruments for governing open data initiatives worldwide.
In the early years of open data, users were often confronted with unique complex
restrictive licences in local languages which were not compatible with other licences.
Although there has been significant work accomplished in the domain of harmon-
ising licences, including open data licences, many countries still develop and use
their own national version of an open licence. Users may wonder why countries and
organisations do not use one single licencing framework across the globe. In Chap. 6,
Alexandra Giannopoulou argues that it is, from a legal perspective, not as simple as it
may look like due to internal and external legal interoperability issues. Open data
interests also have to be balanced with other competing interest. Two of these
competing interests are privacy and data protection. In Chap. 7, Lorenzo Dalla Corte
identifies the relevance of distinguishing these two rights. He arrives at a complex
mix of different types of information including private personal data, non-private
personal data and non-private personal data that can be opened. The performance of
open data and open data initiatives is addressed in Chap. 8, in which Glenn
Vancauwenberghe discusses the different dimensions of open data assessments, and
analyses the differences and similarities in existing open data assessments. The
technological components of open data infrastructures are considered in Chap. 9. In
this chapter, Stanislav Ronzhin, Erwin Folmer and Rob Lemmens use the five star
model of Tim Berners Lee to explain the technological challenges of open data. They
argue that linked (open) data provides the needed mechanisms and conventions for
seamless integration of semantically heterogeneous datasets.
Although open data initiatives and infrastructures have been developed at dif-
ferent administrative levels, especially at the national level much effort is done to
implement open data and coordinate the efforts and activities of different stake-
holders. To give the reader better insight in past and ongoing open data initiatives,
and the way the different components of open data discussed in this book are
implemented, the book also highlights and investigates open data implementations
in four different countries. In Chap. 10, Glenn Vancauwenberghe and Jamie Fawcett
present the history of open data in the United Kingdom, starting from the very first
steps towards open data in the 90s to the most recent developments. Chapter 11 by
Bastiaan van Loenen focuses on the Netherlands, one of the frontrunners worldwide
in opening spatial data. In Chap. 12, Xue Mei looks at open data developments in
China, a country where especially at the local level many data have been opened by
government. The fourth country explored in this book is Indonesia. In Chap. 13,
Agung Indrajit explores how this emerging economy is embracing open data.
The final chapter of this book, Chap. 14, demonstrates that current developments
towards open data, which can be referred to as the democratisation of data, is resulting
in a very small number of data companies having access to a significant portion of data
1 Open Data Exposed 9
available globally including open government data, without sharing this data with
society. Hendrik Ploeger and Bastiaan van Loenen argue that this skewed balance in
the information position of government, citizens and the (big) data companies may
result in a data dictatorship dominating our information societies in the coming
decades. Therefore they hold that the current open data system should be revolu-
tionised from the democratisation of data available to the happy few into a data
democracy for all.
References
Boley H, Chang E (2007) Digital Ecosystems: Principles and Semantics. 2007 Inaugural IEEE
International Conference on Digital Ecosystems and Technologies, Cairns, Australia, February
Borgman CL (2000) From Gutenberg to the Global Information Infrastructure; Access to
Information in the Networked World. The MIT Press, Cambridge, Massachusetts
Carrara W, Oudkerk F, van Steenbergen E, Tinholt D (2016) Open Data Goldbook for Data
Managers and Data Holders. https://www.europeandataportal.eu/sites/default/files/goldbook.
pdf. Accessed May 2018
Castells M, Himanen P (2002) The Information Society and the Welfare State; The Finnish Model.
Oxford University Press, Oxford
Chan TO, Feeney M-E, Rajabifard A, Williamson I (2001) The Dynamic Nature of Spatial Data
Infrastructures: A Method of Descriptive Classification. GEOMATICA 55(1):65–72
Chignard S (2013) A brief history of Open Data. Paris Innovation Review. http://paris-
innovationreview.com/articles-en/a-brief-history-of-open-data. Accessed May 2018
Coleman DJ, McLaughlin J (1997) Defining global geospatial data infrastructure (GGDI):
Components, Stakeholders and Interfaces. Global Spatial Data Infrastructure Conference,
Chapel Hill, North Carolina
Davies T (2010) Open Data: Infrastructures and ecosystems. University of Southampton
Dawes SS, Vidiasova L, Parkhimovich O (2016) Planning and designing open government data
programs: An ecosystem approach. Government Information Quarterly 33(1):15–27
Dietrich D, Gray J, McNamara T, Poikola A, Pollock R, Tait J, Zijlstra T (2007) The Open Data
Handbook. http://opendatahandbook.org. Accessed May 2018
DOMO (2017) Data Never Sleeps 5.0. https://www.domo.com/learn/data-never-sleeps-5?aid=
ogsm072517_1&sf100871281=1. Accessed May 2018
European Commission (DG CONNECT) (2013) A European strategy on the data value chain
Harrison TM, Pardo TA, Cook M (2012) Creating open government ecosystems: A research and
development agenda. Future Internet 4(4):900–928
Heimstädt M, Saunderson F, Heath T (2014) From Toddler to Teen: Growth of an Open Data
Ecosystem. A Longitudinal Analysis of Open Data Developments in the UK. JeDEM - Journal
of eDemocracy & Open Government 6(2):123–135
IBM (2016) 10 Key Marketing Trends for 2017 and Ideas for Exceeding Customer Expectations
Jacobson R (2013) 2.5 quintillion bytes of data created every day. How does CPG & Retail manage
it? https://www.ibm.com/blogs/insights-on-business/consumer-products/2-5-quintillion-bytes-of-
data-created-every-day-how-does-cpg-retail-manage-it/. Accessed May 2018
Janssen K (2011) The influence of the PSI directive on open government data: An overview of
recent developments. Government Information Quarterly 28(4):446–456
Janssen M, Charalabidis Y, Zuiderwijk A (2012) Benefits, Adoption Barriers and Myths of Open
Data and Open Government. Information Systems Management 29(4):258–268
Jetzek T (2017) Innovation in the Open Data Ecosystem: Exploring the Role of Real Options Thinking
and Multi-sided Platforms for Sustainable Value Generation through Open Data. Chapter in
10 B. van Loenen et al.
Contents
2.1 Introduction........................................................................................................................ 12
2.2 Towards Openness............................................................................................................. 12
2.3 US: From the Sebastopol Meeting to the Obama Administration ................................... 13
2.4 EU: From PSI Re-use to Open Data?............................................................................... 15
2.5 A Glimpse of the Future? ................................................................................................. 20
2.5.1 Building a European Data Economy ..................................................................... 21
2.5.2 Towards a Common European Data Space ........................................................... 24
2.6 International Developments............................................................................................... 26
2.7 Conclusion ......................................................................................................................... 29
References .................................................................................................................................. 29
Abstract There is a general, international, and multifaceted trend that is shaping the
global narrative towards openness as a default setting. The open data movement has
been both influencing and influenced by the concepts of Open Government, Open
Access, and Open Source; by Freedom of Information laws, and by the regulatory
initiatives aiming at fostering the re-use of Public Sector Information. The general
aim is promoting information availability, as free of restraints as reasonably possible,
to reach a multiplicity of different goals. Transparency, efficiency, accountability,
economic growth, and democratic participation are amongst the core values upheld,
from an instrumental perspective, by the striving towards openness. This chapter
highlights a number of international initiatives that revolve around open data, and
that have been instrumental in framing the concept as understood nowadays.
2.1 Introduction
There is a general, international, and multifaceted trend that is shaping the global
narrative towards openness as a default setting. The open data movement has been
both influencing and influenced by the concepts of Open Government, Open
Access, and Open Source; by Freedom of Information laws, and by the regulatory
initiatives aiming at fostering the re-use of Public Sector Information (PSI).
Open data’s general aim is promoting information availability, as free of restraints
as reasonably possible, to reach a multiplicity of different goals. Transparency, effi-
ciency, accountability, economic growth, and democratic participation are amongst
the core values upheld, from an instrumental perspective, by the striving towards
openness.
This chapter highlights a number of initiatives that have been instrumental for
the concept of open data and for the goals it pursues. It starts by highlighting a few
important landmarks in the US and in the EU. The analysis, due to the topic’s
breadth and to the multiplicity of jurisdictions and organisations involved with open
data, is bound to leave out many important initiatives, mostly at a national level.
This chapter will therefore also account for several international developments
revolving around the concept of open data, but is by no means an exhaustive
account of the emergence of such a rapidly developing movement.
The brief review of what makes the concept of open data undertaken in Chap. 1
takes the Open Definition as a starting point. As this section aims at briefly giving
some contextual historical and genealogical1 notes on the concept of open data, it
appears appropriate to start by mentioning the roots of the Open Definition itself, as
specified by its authors:
The Open Definition was initially derived from the Open Source Definition, which in turn
was derived from the original Debian Free Software Guidelines, and the Debian Social
Contract of which they are a part […] This definition is substantially derivative of those
documents and retains their essential principles.
The idea underlying open data’s openness (partly) derives from the open source
movement,2 which results clearly from the references to FLOSS and to open formats
1
Gray 2014.
2
The parallelism results clearly from the four fundamental freedoms of free software, as framed by
Richard Stallman and the Free Software Foundation: “The freedom to run the program as you
wish, for any purpose (freedom 0); The freedom to study how the program works, and change it so
it does your computing as you wish (freedom 1); The freedom to redistribute copies so you can
help your neighbour (freedom 2); The freedom to distribute copies of your modified versions to
others (freedom 3)”—Free Software Foundation n.a.
2 Towards Open Data Across the Pond 13
and standards characterizing the concept of open data.3 There are, however, several
other factors that contributed in shaping open data.
Indeed, the same degree of merit in introducing and evolving the notion of open
data must be given to the Open Science and Open Access movements, which aim at
rendering scientific output and governmental information—in particular if publicly
funded—as free from limitations as reasonably possible, shaping scientific research
just as the FLOSS movement shaped software. Furthermore, as it has been pointed
out,4 the open data movement has been developing paralleling the Right to
Information (RTI) one,5 which is mainly concerned with enhancing transparency
and accountability within public administrations, and whose connection with the
notion of open data results therefore apparent. Aside from RTI instances, the open
data movement gained additional traction by leaning on the social, economic and
political ratio underlying PSI reuse:6 the potential for increased efficiency and for
added value and growth deriving from the reuse of PSI pushed open data instances
within environments in which transparency and openness would not have had the
same level of traction.
The last decades saw open data growing from relatively narrow academic and
professional circles to a full-fledged movement that is arguably changing how both
the public and the private sectors operate, often in a radical fashion.
3
See Open Knowledge International n.a.; see also Pomerantz and Peek 2016.
4
Kitchin 2014, p. 48.
5
See Gray and Darbishire 2011.
6
PSI is logically connected to, and yet distinct from, the notion of open data: see Carrara et al.
2016. The next subsection will elaborate on the similarities and the differences between the two
concepts.
7
Malamud 2007.
8
Chignard 2013.
14 L. Dalla Corte
The Sebastopol meeting’s call, expression of a broader trend that was increasingly
gaining consensus amongst the general population, was not ignored by the US
administration. In 2009, the White House issues a memorandum on Transparency and
Open Government,9 in which it endorsed the principles of transparency, participation,
and collaboration, directing “the Chief Technology Officer, in coordination with the
Director of the Office of Management and Budget (OMB) and the Administrator of
General Services, to coordinate the development by appropriate executive depart-
ments and agencies […] of recommendations for an Open Government Directive, to
be issued by the Director of OMB, that instructs executive departments and agencies
to take specific actions implementing the principles set forth”.10 The same year, in
May, the US launched its open data portal, Data.gov;11 in December, the White House
issued the Open Government Directive,12 through which it mandated precise dead-
lines and steps to be followed for the completion of the national open data program.13
Open data features prominently in the Digital Government Strategy issued by the
Obama administration in 2012, as well. On April 27, 2011, the US President issued
Executive Order 13571 (Streamlining Service Delivery and Improving Customer
Service),14 in which the Federal CIO (Chief Information Officer) was tasked “with
developing a comprehensive Government-wide strategy to build a 21st century digital
Government that delivers better digital services to the American people”.15 Because
of that order,16 the US Open Data Strategy was released under the title “Digital
Government: Building a 21st Century Platform to Better Serve the American
People”.17 The Strategy provides US agencies with a 12-month roadmap, whose path
is divided into milestones, and that results in the publication of several related
deliverables. The Strategy’s objective is threefold: enabling the access to high-quality
digital government information and services; ensure that the government’s digital
transformation is performed “in smart, secure and affordable ways”, and unlock the
utility of open government data to spur innovation, foster growth, and improve the
quality of US governmental services.18 It builds on four overarching principles, which
can be seen as integrating the ones deriving from the Open data conceptualization and
movement, and that have arguably had a role in shaping the open data milieu as
understood nowadays. The strategy is based on an “Information-Centric” approach
(principle 1), through which to shift “from managing “documents” to managing
discrete pieces of open data and content which can be tagged, shared, secured, mashed
9
Obama 2009.
10
Obama 2009.
11
Data.gov n.a.
12
Orszag 2009.
13
See the following subsection for an overview of the US Open Government Directive.
14
Obama 2011.
15
Obama 2012a.
16
See, however, also Obama 2012b.
17
Obama 2012b.
18
The White House 2012.
Another random document with
no related content on Scribd:
number of soldiers, Kuropatkin’s army would have numbered
approximately one million!
And the people who thus shed their blood more freely than the
Christian Russians would be excusable if they deserted en masse,
because the Jews enjoy none of the privileges accorded to the
Russians, and they could not therefore be blamed if they refused to
look upon Muscovy as their fatherland. But, in spite of the injustice
done them by the Czar’s Government, they generously gave their
lives to the Czar. And the Czar’s agents in return egged on the
hooligans of all Southern and Western Russia to pillage, burn, and
destroy Jewish property, and to beat and kill Jewish men and
203
women.
These experiences and the apprehension of massacres on a
larger scale have impelled the Jews to form a great revolutionary
association for organised resistance to the organised forces of their
enemies. A secret society—already notorious as the Bund—arose in
Lithuania, whence it spread to Poland and other parts of the Russian
Empire. Its aims are to foster Jewish national feeling and to protect
Jewish interests. But the protection which this body could afford the
victims of deliberate persecution was necessarily limited. If it rescued
them from occasional slaughter, it could not defend them against
chronic starvation. Consequently, the exodus, especially from the
Province of Mohileff, continued: The emigrants were, for the most
part, Jewish young people of both sexes, who, not having any
means of existence, left the towns and villages. Some villages even
became quite deserted. In the town of Mohileff itself, where there are
no factories of any kind or industrial or commercial undertakings
except shops which are held by Jews, business was quite
204
suspended. Within the next five months no fewer than 75,160
205
Russian Jews arrived in New York alone.
How this readiness to quit hearth and home, in order to seek a
new life under unknown skies in the furthest corners of the earth,
carries us back across the ages to the flight of Israel from Egypt! To
the Russian Jews groaning in servitude the Czar’s Empire is a
foreign land; his religion a foreign religion. In leaving Russia they
leave a hotbed of idolatry as fierce, as cruel, as Godless as the
idolatry of Egypt, Babylon, Syria, or Rome. To them the Russian god
who can sanction such persecution is a veritable Moloch. He can
claim no kinship with Jehovah. They owe it to themselves to escape
from the house of bondage, and to their God to continue bearing
witness to His unity. They, therefore, like their remote ancestors,
seek freedom of worship by expatriation. Treated as aliens in their
native country, they renounce it with as little regret as if they had not
been born and bred in it. There are, of course, both in Poland and in
Russia proper Jews who would gladly conform in everything except
religion. Such Jews deplore the estrangement of the Jew from the
Gentile, and believe that the lot of the former can be improved only
by the removal of the legal restrictions which perpetuate that
estrangement. According to them, if the Jews were allowed to mingle
freely with the other inhabitants of the Empire, they would in time
lose all those characteristics which mark them off as a people apart,
and become patriotic subjects of the Czar. But the Russian
Government in its persecution of the race makes no invidious
distinctions between these “Assimilators” and their sterner brethren.
The Jew who ventures to advise assimilation alienates his friends
without conciliating his masters. By its indiscriminate severity the
Russian autocracy feeds the old spirit of dogged resistance, sullen
resentment, and inflexible arrogance.
It also feeds, as might have been expected, the old dream of
Redemption and national rehabilitation. The Russian Ghetto at the
present day is the citadel of Hebrew orthodoxy and the recruiting
ground for the Zionist movement of which we shall speak in the
sequel. It is natural that it should be. The Jew in the Empire of the
Czars finds little or no scope for development. As we have seen, he
is debarred from holding real property, from pursuing liberal
professions, from engaging in many trades. He is a stranger in the
land of his birth, an outcast among his fellow-countrymen. Chronic
contempt and oppression are only relieved by periodical massacre.
Forbidden to be a citizen, he cannot be a patriot. He has no life in
the present. He, therefore, lives in the future. He is an
uncompromising idealist. The same conditions which deprive him of
all inducement to national assimilation also encourage his religious
and social separatism. The intolerance of his Christian neighbours
reacts on his own bigotry. If politically he lives on hopes, religiously
he lives on traditions. Amidst all his calamities, the Jew of the
Russian Ghetto is sustained by the expectation that the real history
of his race is still to come. He believes that the ruins of the Temple
will one day prove the foundations of new greatness. While awaiting
the fulfilment of the ancient prophecies, he clings to the tribal
distinctions, to the ceremonial laws, and to all those rules of
omission and performance which tend to perpetuate his self-
isolation. In the West the Jews have, as patriotic citizens of various
states, succeeded, by generous concessions quite compatible with
true loyalty to their traditions, in the effort to reconcile the old Jewish
life with modern political conditions. In Russia the Jews are denied
the opportunity. But they still love the land. Therein lies the irony and
the hope.
Such is the lot of Israel in Russia. It is hardly better on the
western side of the Pruth—in that other European country which
within three days’ journey of London continues the Middle Ages.
CHAPTER XXII
IN ROUMANIA