Professional Documents
Culture Documents
1.1 Introduction: What Are GI pursuits. Keeping track of all this activity is important,
and knowing where it occurs can be the most conve-
Science and Systems, and nient basis for tracking. Knowing where something
happens is of critical importance if we want to go
Why Do They Matter? there ourselves or send someone there, to find more
information about the same place, or to inform peo-
Almost everything that happens, happens somewhere. ple who live nearby. In addition, geography shapes
We humans confine our activities largely to the the range of options that we have to address things
surface and near-surface of the Earth. We travel over that happen, and once they are made, decisions have
it and through the lower levels of its atmosphere, and geographic consequences. For example, deciding the
we go through tunnels dug just below the surface. We route of a new high-speed railroad may be shaped by
dig ditches and bury pipelines and cables, construct topographic and environmental considerations, and
mines to get at mineral deposits, and drill wells the chosen route will create geographic winners
to access oil and gas. We reside on the Earth and and losers in terms of access. Therefore geographic
interact with others through work, leisure, and family
2 Introduction
Figure 1.2 Navigating tourist destinations is a geographic problem.
(B)
(C)
Radiation (μ Sv/h)
Source: PASCO Corporation and courtesy of Keiji
Source: Soma City and courtesy of Keiji Yano
Yano
Figure 1.3 (A) The passage of the tsunami arising out of the Great East Japan (Tōhoku) earthquake of March 11, 2011. It had subsequent
effects on Soma City in terms of (B) radiation (measured in mSv/h (micro Sievert per hour) and (C) tsunami inundation.
▲
4 Introduction
▲
phenomena—including the human population, the built reports of how localities have been affected, and orga-
environment, and transportation infrastructure—in nizing evacuation), the medium term (e.g., managing
preparation for a major earthquake disaster and protec- the disruption to industrial supply chains), and the
tion against many of its foreseeable consequences. long term (e.g., prioritizing repair and replacement of
damaged transport infrastructure). All these actions
Yet the science of predicting the location, timing,
take place in an organizational context. Early warn-
and intensity of earthquakes has made little progress
ing systems are very much an international effort. In
over the past century. A magnitude-9.0 earthquake is a
terms of addressing effects after the event, the Tōhoku
very rare event and so did not fall within any disaster-
earthquake raised issues that were best addressed at
management scenario prior to the event. For example,
the national level, whereas much of the implementation
the Fukushima reactors had been built to withstand a
was best effected at local levels.
magnitude-8.6 earthquake on the basis of historic occur-
rences plus a safety margin: but not an event of magni-
The three Ps of disaster management are
tude 9.0. However, even when major events are unfore-
prevention, preparedness, and protection.
seen, GI science and systems are integral to response and
GI science and systems are integral to each
recovery in the short term (e.g., alerting populations to
of them.
the imminent arrival of a tsunami, coordinating citizen
evidence is found). Finding new locations for retailers, terms from the dynastic (perhaps thousands of years;
with its focus on design, is an example of a normative see Box 1.2) to the diurnal, but very much longer
application of GI systems. But to predict how consum- with respect to understanding geological or geomor-
ers will respond to new locations, it is necessary for phological change. At one end of the human time
retailers to analyze and model the actual patterns of spectrum, some decisions are operational and are
behavior they exhibit. Therefore, the models they use required for the smooth day-to-day functioning of an
will be grounded in observations of messy reality that organization, such as how to control electricity inputs
have been tested in a positive manner. into grids that experience daily surges and troughs in
Design is concerned with improving the world— usage. At slightly longer timescales, tactical decisions
with decisions that when implemented achieve certain might include where to cut trees in next year’s forest
desired objectives, such as constructing new housing harvesting plan. Still other decisions are more infre-
subdivisions, developing conservation plans, or defin- quent and strategic in nature, such as those required
ing sales territories. In recent years the term geodesign to give an organization long-term direction, as when
has become a popular way of referring to design deci- a retailer decides to expand or rationalize its store
sions at geographic scales, supported by GI systems. network (Figure 1.4). At the far end of the human time
All of us would like to design improvements to the spectrum, Box 1.2 describes how the geographic
world, and GI systems are valuable tools for doing so.
Although most work with GI systems is considerably
Figure 1.4 Many store location principles are generic across different
more mundane, it is always good to bear its grander
retail markets, as with Tesco’s investment in Ostrava, Czech Republic.
potential in mind. As we show in Section 14.4,
geodesign combines two important functions of GI
systems—the ability to capture new ideas through
sketching (creating/editing new features) and the
ability to evaluate them and assess their impacts. A
user might sketch a design for a new development,
for example, and ask the GI system to predict its
impacts on transportation, groundwater, and air
pollution.
With a single collection of tools, GI systems
are able to bridge the gap between curiosity-
driven science and practical problem solving
The third way in which problems can be distinguished
is on the basis of their time scale, ranging in human
6 Introduction
▲
Figure 1.5 The Great Britain Geography of the Longleys, Goodchilds, Maguires, and Rhinds. In each case the shorter (blue) line delineates the
smallest possible area within which 95% of name bearers reside, based on 1881 Census of Population figures, and the outer (red) line encloses
the smallest area that accommodates the same proportion of adult name bearers according to a recent address register.
▲
8 Introduction
Technical Box 1.3
Some Technical Reasons Why Geographic Information Is Special and Why GI Science
and Systems Have Developed
● It is multidimensional, because two coordinates must equivalent to a 1:1 million-scale map or a
be specified to define a location, whether they be x 1:24,000-scale one (see Section 3.7).
and y or latitude and longitude; and a third coordi-
● It may be represented in different ways inside a com-
nate is needed when elevation is important.
puter (see Chapter 3), and how this is done can strongly
● It is voluminous because a geographic database can influence the ease of analysis and the end results.
easily reach a terabyte in size (see Table 1.2).
● It must often be projected onto a flat surface, for
● It may be collected by citizens, governments, or other reasons identified in Section 4.8.
organizations, and it may prove useful to pool infor-
● It requires many special methods for its analysis (see
mation from these diverse sources.
Chapters 13 and 14).
● It may be represented at different levels of spatial
● It may be transformed to present different views of
resolution, for example, by using a representation
the world, for example, to aid interpretation.
in this book is of spatial analysis (see Chapters 13 become clear in later chapters and are briefly
and 14), not geographic analysis, to emphasize this summarized in Box 1.3.
versatility.
Another term that has been growing in usage
in recent years is geospatial—implying a subset
of spatial applied specifically to the Earth’s surface 1.2 Data, Information, Evidence,
and near surface. In this book we have tended Knowledge, and Wisdom
to avoid geospatial, preferring geographic, and
we use spatial where we need to emphasize Information systems help us to manage what we
generality. know, by making it easy to organize and store,
access and retrieve, manipulate and synthesize, and
Although there are subtle distinctions
apply to the solution of problems. We use a vari-
between the terms geographic(al), spatial,
ety of terms to describe what we know, including
and geospatial, for many practical purposes
the five that head this section and that are shown
they can be used interchangeably.
in Table 1.1. There are no universally agreed-on
People who encounter GI for the first time definitions of these terms. Nevertheless it is worth
are sometimes driven to ask why geography is so trying to come to grips with their various meanings
important; why, they ask, is spatial special? After because the differences between them can often be
all, there is plenty of information around about significant, and what follows draws on many sources
geriatrics, for example, and in principle one could and thus provides the basis for the use of these
create a geriatric information system. So why has GI terms throughout the book. Data clearly refers to
spawned an entire industry, if geriatric information the most mundane kind of information and wisdom
has not done so to anything like the same extent? to the most substantive. Data consist of numbers,
Why are there unlikely to be courses in universi- text, or symbols, which are in some sense neutral
ties specifically in geriatric information science and and almost context-free. Raw geographic facts, such
systems? Part of the answer should be clear already: as sensor measurements of temperature at a specific
almost all human activities and decisions involve a time and location, are examples of data. When data
location component, and the location component is are transmitted, they are treated as a stream of bits;
important. Another reason will become apparent in a crucial requirement is to preserve the integrity of
Chapter 2, where we will see that working with GI the data set. The internal meaning of the data is
involves complex and difficult choices that are also irrelevant in such considerations. Data (the noun is
largely unique. Other, more technical reasons will the plural of datum) are assembled together in a
database (see Chapter 9), and the volumes of data doing the latter because of the tools they provide for
that are required for some typical applications are combining information from different sources.
shown in Table 1.2.
GI systems do a better job of sharing data
The term information can be used either narrowly
and information than knowledge, which is
or broadly (and we use both in this book). In a narrow
more difficult to detach from the knower.
sense, information can be treated as devoid of mean-
ing and therefore as essentially synonymous with data Knowledge does not arise simply from having
as defined in the previous paragraph. Others define access to large amounts of information. It can be
information as anything that can be digitized, that is, considered as information to which value has been
represented in digital form (see Chapter 3), but also added by interpretation based on a particular con-
argue that information is differentiated from data by text, experience, and purpose. Put simply, the infor-
implying some degree of selection, organization, and mation available in a book or on the Internet or on
preparation for particular purposes—information is a map becomes knowledge only when it has been
data serving some purpose or data that have been read and understood, as when an experienced hiker
given some degree of interpretation. Information is chooses not to set off into unfamiliar terrain having
often costly to produce, but once digitized, it is cheap read about it and taken stock of the weather forecast.
to reproduce and distribute. Geographic data sets, for How the information is interpreted and used will be
example, may be very expensive to collect and assem- different for different readers depending on their pre-
ble, but very cheap to copy and disseminate. One vious experience, expertise, and needs. It is impor-
other characteristic of information is that it is easy to tant to distinguish two types of knowledge: codified
add value to it through processing and through merger and tacit. Knowledge is codifiable if it can be written
with other information. GI systems are very useful for down and transferred relatively easily to others. Tacit
Table 1.2 Potential GI database volumes in bytes for some typical applications (volumes estimated to the nearest order of magnitude). Strictly,
bytes are counted in powers of 2—1 kilobyte is 1024 bytes, not 1000.
1 megabyte 1 000 000 (220) Single data set in a small project database
30
1 gigabyte 1 000 000 000 (2 ) Entire street network of a large city or small country
40
1 terabyte 1 000 000 000 000 (2 ) Elevation of entire Earth surface recorded at 30 m intervals
50
1 petabyte 1 000 000 000 000 000 (2 ) Satellite image of entire Earth surface at 1 m resolution
60
1 exabyte 1 000 000 000 000 000 000 (2 ) A possible 3-D representation of the entire Earth at 10 m
resolution
1 zettabyte 1 000 000 000 000 000 000 000 (270) One-fifth of the capacity (in 2013) of U.S. National Security
Agency Utah Data Center
10 Introduction
knowledge is often slow to acquire and much more bases, conducting analytical operations in a frac-
difficult to transfer. Examples include the knowledge tion of the time they would take to do by hand, and
built up during an apprenticeship, understanding of automating the process of making useful maps. GI
how a particular market works, or familiarity with using systems also process information, but there are limits
a particular technology or language. This difference to the kinds of procedures and practices that can be
in transferability means that codified and tacit knowl- automated when turning data into useful information.
edge need to be managed and rewarded quite differ- The question of whether and how such selectiv-
ently. Because of its nature, tacit knowledge is often a ity and preparation for purpose actually adds value,
source of competitive advantage. or whether the results add insight to interpretation
Some have argued that knowledge and infor- in geographic applications, falls into the realm of GI
mation are fundamentally different in at least three science. This rapidly developing field is concerned
important respects: with the concepts, principles, and methods that are
put into practice using the tools and techniques
● Knowledge entails a knower. Information exists
of GI systems. It provides sound principles for the
independently, but knowledge is intimately related
sample designs used to create data and the ways
to people.
in which data can be turned into information that
● Knowledge is harder to detach from the knower is representative of a study area. GI science also
than information; shipping, receiving, transferring provides a framework within which new evidence,
it between people, or quantifying it are all much knowledge, and ultimately wisdom about the Earth
more difficult than for information. can be created, in ways that are efficient, effective,
● Knowledge requires much more assimilation—we and safe to use.
digest it rather than hold it. We may hold conflict- Like all sciences, an essential requirement of GI
ing information, but we rarely hold conflicting science is a method for discovering new knowledge.
knowledge. The GI scientific method must support:
Evidence is considered a halfway house between ● Transparency of assumptions and methods so that
information and knowledge. It seems best to regard it other GI scientists can determine how previous
as a multiplicity of information from different sources, knowledge has been discovered and how they
related to specific problems, and with a consistency might themselves add to the existing body of
that has been validated. Major attempts have been knowledge
made in medicine to extract evidence from a welter ● Best attempts to attain objectivity through a
of sometimes contradictory sets of information, drawn detached and independent perspective that
from different geographic settings, in what is known avoids or accommodates bias (unintended or
as meta-analysis, or the comparative analysis of the otherwise)
results of many previous studies. ● The ability of any other qualified scientist to repro-
Wisdom is even more elusive to define than the
duce the results of an analysis
other terms. Normally, it is used in the context of deci-
sions made or advice given, which is disinterested, ● Methods of validation using the results of the
based on all the evidence and knowledge available. analysis (internal validation) or other information
It is given with some understanding of the likely conse- sources (external validation)
quences of various actions and assessment of which is ● Generalization from partial representations that
or are most beneficial. Almost invariably, knowledge is are developed for analytical purposes to the wider
highly individualized rather than being easy to create objective reality that they purport to represent
and share within a group. Wisdom is in a sense the top
How, then, are problems solved using a scien-
level of a hierarchy of decision-making infrastructure.
tific method, and are geographic problems solved
in ways different from other kinds of problems? We
humans have accumulated a vast storehouse of
1.3 GI Science and Systems knowledge about the world, including information
both on how it looks—that is, its forms—and how
GI systems are computer-based tools for collect- it works—that is, its dynamic processes. Some of
ing, storing, processing, analyzing, and visualizing those processes are natural and built into the design
geographic information. They are tools that improve of the planet, such as the processes of tectonic
the efficiency and effectiveness of handling informa- movement that lead to earthquakes and the pro-
tion about objects and events located in geographic cesses of atmospheric circulation that lead to hur-
space. They can be used to carry out many useful ricanes. Others are human in origin, reflecting the
tasks, including storing vast amounts of GI in data- increasing influence that we have on ecosystems,
Figure 1.9 The form of the Earth’s surface shows enormous variability, for example, between (A) the deserts of the southwest United States and
(B) the settled landscape of Northern England.
12 Introduction
Courtesy: National Institute for Space Research INPE Brazil
© BernardAllum/iStockphoto
This perspective is consistent with our understanding Sir Isaac Newton established the Laws of Motion,
of places in the world as sites at which unique according to which all matter behaves in ways that
relations develop among people and the locations can be perfectly predicted. From Newton’s Laws we
that they occupy and the accumulated effects of these are able to predict the motions of the planets almost
relations over time. GI systems provide ways of gener- perfectly, although Einstein later showed that cer-
alizing about and between places, albeit in ways that tain observed deviations from the predictions of the
acknowledge differences between them. Place-based Laws could be explained with his Theory of Relativity.
methods in GI systems make it possible to think of Laws of this level of predictive quality are few and
geography as repetitive (where in the world is like this far between in the geographic world of the Earth’s
place?) while at the same time remaining sensitive to surface. The real world is the only laboratory that is
the unique context of unique places. available for understanding the effects of many fac-
General knowledge about unique places comes tors on unique places in the social and environmental
in many forms. Classification is perhaps the simplest sciences, and considerable uncertainty is generated
and most rudimentary and is widely used in prob- when we are unable to control for all conditions.
lem solving. In many parts of the United States and These problems are compounded in the social realm,
other countries, efforts have been made to limit the where the role of human agency makes it almost
development of wetlands in the interest of preserv- inevitable that any attempt to develop rigid laws will
ing them as natural habitats and avoiding excessive be frustrated by isolated exceptions. Thus, whereas
impact on water resources. To support these efforts, market researchers use spatial interaction models,
resources have been invested in mapping wetlands, in conjunction with GI systems, to predict how many
largely from aerial photography and satellite imagery. people will shop at each shopping center in a city,
These maps simply classify land, using established substantial errors will occur in the predictions—
rules that define what is and what is not a wetland because people are in significant part autonomous
(Figure 1.11). agents. Nevertheless, the results are of great value
More sophisticated forms of knowledge include in developing location strategies for retailing. The
rule sets—for example, rules that determine what use Universal Soil Loss Equation, used by soil scientists in
can be made of wetlands, or what areas in a forest conjunction with GI systems to predict soil erosion,
can be legally logged. The U.S. Forest Service has is similar in its rather low predictive power, but again
rules to define wilderness and to impose associated the results are sufficiently accurate to be very useful
regulations regarding the use of wilderness, including in the right circumstances. “Good” usually means
prohibition on logging and road construction. Such “good enough for this specific application” in GI
rules can be captured in the data model of a GI data- systems applications.
base (see Chapter 7). Solving problems involves several distinct
Much of the knowledge gathered by the activi- components and stages. First, there must be an
ties of scientists suggests the term law. The work of objective, or a goal that the problem solver wishes
14 Introduction
Next Page
Table 1.3 World Internet usage and penetration statistics as of June 30 2012. (Source: www.internetworldstats.com)
example, the radio, the telephone, and the televi- through advertising, with Google perhaps the most
sion). RealTimeStatistics.org estimated that in 2013 obvious exponent of understanding the importance
some 2.4 billion of the world’s 7 billion population of location in delivering targeted advertising.
were Internet users, although stark variations in Inter- We now turn to consider the other components
net availability and usage remain—see Table 1.3. of a GI system. First, the user’s hardware is the device
Many of the early Internet applications of GI that the user interacts with directly in carrying out GI
systems remain in use, in updated form, today. They system operations, by typing, pointing, clicking, or
range from using GI systems on the Internet to dis- speaking, and that returns information by displaying
seminate information on the location of businesses it on the device’s screen or generating meaning-
(e.g., www.yell.com), to consolidated lists of available ful sounds. Traditionally, this device sat on an office
goods and services, to direct revenue generation desktop, but today’s user has more options and much
through subscription services, to helping members of more freedom because GI system functions can also
the public to participate in important local, regional, be delivered through smartphones, notebooks, and
and national debates. The Internet has become very in-vehicle devices.
popular as a vehicle for delivering business GI system The second component is the software programs
applications for several reasons. It provides an estab- that represent the world by running locally in the
lished, widely used platform and accepted standards user’s machine or remotely in the Cloud. Increasing
for interacting with information of many types. It also numbers of users manipulate geographic information
offers a cost-effective way of linking distributed users using executable open-source software code that is
(for example, telecommuters and office workers, cus- often freely available for download across the Web.
tomers and suppliers, students and teachers). From Users can execute this code and also modify it if they
the early days onward, the interactive and exploratory wish. Other open software is also available for use as
nature of navigating linked information became a linked executable files, although the computer code
great hit with users. that was used to generate it is not made available
Internet-enabled devices became portable in the by its authors and so cannot be modified by other
early 2000s (see Section 10.3) with the wide diffusion users. Both of these types of software may be made
of location-aware smartphones and other handheld available by their authors in the interests of solving
devices and the availability of wireless networks in particular problems, or they may be made available
public places such as airports and railway stations. as part of larger linked software libraries, such as
The subsequent innovation of 3G and 4G mobile the R project for statistical computing and graphics
broadband now routinely allows portable and in- (www.r-project.org/). Some open software libraries
vehicle devices to deliver location-based services (see have a focus on geographic problem solving and as
Section 10.3.2) to users on the move. Users receive such are described as GI systems—with the Quantum
real-time geographic services such as mapping, routing, GIS Project (www.qgis.org/) providing perhaps the
traffic congestion, and geographic yellow pages. best contemporary example. The international
These services are usually funded directly or indirectly “Geo for All” initiative (www.geoforall.org/) seeks
16 Introduction
Biographical Box 1.4
Sir Walter Bodmer, Human Geneticist of long-established families who can trace their recent
ancestry to particular locations and to relate this to
Sir Walter Bodmer (Figure 1.13A) is a German-born British historical and archaeological evidence of invasion and
human geneticist. He studied mathematics and statistics at settlement. The DNA samples of thousands of volunteers
Cambridge University, doing his PhD under the renowned have been analyzed in ways that reveal the biological
statistician and geneticist Sir Ronald A. Fisher before join- traces of successive waves of colonizers of Britain—such
ing Nobel-Prize-winning microbiologist Joshua Lederberg’s as the original ancient British settlers, the Anglo-Saxons,
laboratory in the Genetics Department of Stanford Univer- and the Vikings. The resulting genetic map (Figure 1.13B)
sity in 1961. He was an early pioneer of the use of comput- shows, for example, that the Viking invasion of Britain
ing to study population genetics, and after a period as a was predominately by Danish Vikings, whereas the
faculty member at Stanford, he left to become the first Orkney Islands were settled by Norwegian Vikings.
Professor of Genetics at Oxford University in 1970.
Walter was a pioneering advocate of public en-
Population genetics is the study of the changing gagement with science and technology and remains
distribution of gene variants (or alleles) under the influ- very active in this field. He was elected a Fellow of the
ence of four important evolutionary processes: natural Royal Society in 1974 and was awarded the Society’s
selection, genetic drift, mutation, and gene flow. Walter Royal Medal in 2013 for seminal contributions to popu-
was one of the first to suggest the idea of identify- lation genetics, gene mapping, and our understanding
ing the physical and functional characteristics of the of familial genetic disease.
20,000–25,000 genes of the human genome. This idea
was subsequently pursued in the Human Genome Check out the People of the British Isles project at
Project, which in important respects remains the www.peopleofthebritishisles.org.
ultimate investigative analysis using Big Data.
Geography is central to study of the human genome,
for the very good reason that most families remain
settled in one part of the world for many generations.
In 2005 Walter was appointed to lead a major (£2.3 million/
$3.8 million) project to examine geographic variations
in the genetic makeup of the people of the British Isles.
The aim of this project is to measure the genetic profiles
Courtesy: Wellcome Trust POBI Project
UK Map Boundaries
UK Map Boundaries Source:Source:
2001 2001 Census,
Census, Output
Output AreaBoundaries.
Area Boundaries. Crown
Crowncopyright 2003.
copyright 2003.
Republic of Ireland and Isle of Man Maps: Copyright EuroGeographics for the administrative boundaries.
Republic of Ireland and Isle of Man Maps: Copyright EuroGeographics for the administrative boundaries.
Figure 1.13 (A) Walter Bodmer, human geneticist; and (B) a genetic map of the United Kingdom, with different colors denoting the genetic
groups of long-settled residents.
18 Introduction
Figure 1.14 Section of the 1:63,360 scale
geological map of Abingdon, UK—the first
known example of a map produced by
automated means and published in a stan-
dard map series to established cartographic
standards.
At the time a modest computing system—far less on the collection of geographically referenced data
powerful than today’s personal computer—could or activities that build on this, such as value-added
be obtained for about $250,000 (about $750,000 in reselling of data, consulting, app development, train-
2015 prices) and the associated software for about ing, system integration, software design, and so forth.
$100,000 ($300,000 today). Even at these prices, the Many new opportunities (particularly for start-up
benefits of consistent management using GI systems companies) are arising from use of open-source
and the decisions that could be made with these new software or Open Data (see Section 1.5.3), where the
tools substantially exceeded the costs. The market for underpinning costs of data or software production are
GI software continued to grow, computers continued met through volunteer activity or are underwritten by
to fall in price and increase in power, and the GI soft- research institutions such as universities.
ware industry has been growing ever since. Such activity has mushroomed with the ever-
wider use of the Internet to disseminate software
The modern history of GI systems dates
and data, as well as the innovation and use of search
from the early 1980s, when the price of
engines, blogs, and social networking sites to spread
sufficiently powerful computers fell below
news about what is available. This section looks at
a critical threshold.
the diverse roles that people play in the business of
As indicated earlier, the history of GI systems is a GI systems, and is organized by the major areas of
complex story, much more complex than can be human activity associated with it.
described in this brief history, but Table 1.4 summa-
rizes developments from this early history to more 1.5.2.1 The Software Industry
recent commercial developments and the wide Many of the roots of the development of GI systems
advent of Open Data and open-source software. can be traced to the commercial development of off-
the-shelf software packages that began in the 1980s.
1.5.2 The Business Perspective Today’s commercial solutions are manifest in a wide
GI systems provide the underpinning technology for range of forms. Thus, for example, the major software
GI science, and many people play many roles in their vendor Esri, Inc. (Redlands, California) today sells a
development. Important activities range from soft- family of GI system products under the ArcGIS
ware development to championing the importance brand name to service the disparate needs of its
of the spatial dimension in the activities of an orga- diverse user base. At its core are three niche desk-
nization through spatial data analysis. The business top systems: Basic, for viewing spatial data, creating
activities of many established companies are based layered maps, and performing rudimentary spatial
20 Introduction
Table 1.4 (continued )
analysis; Standard, which offers more advanced tools 1.5.2.2 The Data Industry
for manipulating spatial databases; and Advanced, The acquisition, creation, maintenance, dissemina-
which has further additional facilities for data manipu- tion, and sale of GI data also account for a huge
lation, editing, and analysis. Other vendors specialize volume of economic activity. Traditionally, a large
in certain niche markets, such as the utility industry, proportion of GI data have been produced centrally
or military and intelligence applications. The software by national mapping agencies, such as the U.S.
industry employs several thousand programmers, Geological Survey (USGS) or Great Britain’s Ordnance
software designers, systems analysts, application spe- Survey. In response to U.S. federal government policy,
cialists, and sales staff, with backgrounds that include USGS supplies Open Data, defined as being priced
computer science, geography, and many other disci- at no more than the cost of reproduction of data.
plines. Given the all-pervasive nature of GI systems Elsewhere, some or all of available national map-
and their linkage to other forms of software, it no ping agency data are charged for to recoup the costs
longer makes much sense to guesstimate the global of their collection, although this is changing under
value of the GI system industry, but where such esti- various government Open Data initiatives. In Great
mates are attempted, it seems clear that the industry Britain, for example, many Ordnance Survey mapping
continues to grow rapidly—for example, estimates series have been made available free of charge since
of GI system sales in China, where GeoStar (www. 2010; other, commercially valuable, data sets are not
geostar.com.cn/Eng/) is widely used, suggest that the made available, however, because the government
value of the industry has increased 50-fold over the is unwilling to bear the projected $45 million cost
last decade. GI systems are a dynamic and evolving of losing future revenue streams for highly detailed
field, and their future continues to offer many exciting mapping products used by corporations such as util-
developments, some of which we will discuss in the ity companies. The innovation of free-to-view map-
final chapter. ping services such as Google Maps (maps.google.
Today a single GI system vendor offers com) and Microsoft Bing Maps and Virtual Earth, with
many different products for distinct their business model based on advertising revenues,
applications. is having profound implications for the provision of
Figure 1.15 The Nestoria Website (www.nestoria.com) presents a consolidated view of available property for sale or rent, alongside Open Data
about neighborhoods. It is funded by the realtors who supply the linked listings and who “pay by click” for users searching for property.
map data (see Chapter 18). Specialist mapping sites, organizations in pursuit of their day-to-day opera-
such as those used in the real-estate industry, present tional activities. Subject to legal data protection
a consolidated map view of all available properties requirements, such data may be reused in pursuit of
for sale or rent, along with Open Data pertaining to the tactical or strategic objectives of the organization,
neighborhood quality (Figure 1.15). Volunteer-driven, and many retailers, energy suppliers, and financial-
open-source approaches to online cartography such service providers employ teams of analysts to devise
as OpenStreetMap (www.openstreetmap.org) are also better ways of servicing customers and increasing
revolutionizing online cartography with their novel market share. Still other companies (including cell
approach to map production and have stimulated phone companies) fulfill the role of value-added
the growth of commercial firms such as CloudMade data resellers of commercial and public-sector data.
(cloudmade.com). Important applications for such services are opening
up, including improving public-service delivery. The
Open Data are revolutionizing the business
GeoWeb (see Chapter 10) is creating fertile envi-
model for GI applications.
ronments in which a very wide range of public- and
Geographically referenced attribute data relating private-sector data sources can be combined,
to customer transactions are collected by many analyzed, and displayed.
22 Introduction
Private companies are now also licensed to col- One of the most profound changes concerning
lect fine-resolution data using satellites and to sell government and GI in recent years is in the ways in
them to customers—as, for example, with GeoEye’s which government information is shared with citizens
(geofuse.geoeye.com) IKONOS satellite data (see and external organizations in many parts of the world
Table 1.4). Other companies collect similar data from (see Chapter 17). Public-sector information (PSI) is
aircraft. Still other companies specialize in the produc- defined by the wide range of information that govern-
tion of good-quality data on street networks, a basic ment bodies collect, produce, reproduce, and dis-
requirement of many delivery companies. TomTom seminate across various areas of activity to fulfill their
(www.tomtom.com) is an example of this industry, public-task functions. In terms of the nomenclature
employing over 1800 staff in producing, maintain- of Table 1.1, PSI is most conveniently thought of as
ing, and marketing good-quality street-network data information that has not been subjected to processing
worldwide. or any other manipulation beyond that necessary for
its first use by government. Given the multiplicity of
1.5.2.3 GI Services government functions, information that has been pro-
As developments in the information economy cessed for one function (such as estimating local levels
gather still further momentum, many organizations of homelessness or overcrowding) may be further
are becoming focused on delivering integrated refined and reused in the process of devising different
business solutions rather than raw or value-added measures for other functions (such as the composite
data. The Cloud makes possible easy user access to measures of barriers to housing and public services
data from sites that may be remote from locations used in various deprivation, or hardship, indices).
where more specialized analysis and interpretation It is usual to think of PSI as having high fixed costs
functions are performed. In these circumstances, it of assembly or collection and much smaller variable
is no longer incumbent on an organization to man- costs of copying and dissemination to users across
age either its own data or those that it buys from the Web or through other media. It is also increasingly
value-added resellers. For example, Esri Inc. offers a the case that the effort of rendering PSI into machine-
geographic data management service, in which data readable or linked form can be absorbed into the fixed
are managed and maintained for a range of clients costs of information provision (see the star ratings
that are at liberty to analyze them in quite separate schema discussed in Section 17.4.1). The small variable
locations. This may in time lead to greater vertical costs of disseminating some PSI have usually been
integration of the software and data industry— absorbed in the United States, but this practice has
for example, Esri Inc. has developed a business been by no means universal, or even common, in other
information solutions division and acquired its own parts of the world. It is only in recent years that many
geodemographic system (called Tapestry) to ser- governments worldwide have committed to greater
vice a range of business needs. This merging of availability of Open Data—which can be defined as
the software and data industries comes together in data that can be used, reused, and redistributed freely
the business of providing GI systems as a service, by anyone, subject only at most to the requirement to
in which the Cloud is used as a remote platform attribute and share alike. Open Data are not necessar-
on which software and data reside and GI system ily free at the point of delivery, but any charge is usually
operations are performed. no more than the cost of reproduction.
24 Introduction
specialized domain than in generic data science (see known as “Paris” (France) through mapping the extent
also Chapters 17 and 19). of attribution of this label to photographs uploaded
to Flickr, along with the more limited extents of other
more localized places identified by users.
1.5.5. The Geography Perspective Geography also tells us that widely scattered
Geographers have long agonized over the content, places (for example, in Washington, California,
coherence, and relevance of their discipline, and set Florida, or Vermont) share important social or physical
against this background, it is perhaps surprising that similarities that are manifest in different ways and to
they have nonetheless made time to investigate and differing degrees. GI systems provide a way of repre-
innovate in the use of new research methods in their senting these similarities, using standardized quan-
discipline. Throughout the period documented in titative measures and, hence, the repetition of place
Table 1.4, GI science and systems have developed effects across space. Geodemographic classifications
and maintained a special relationship to the academic provide one prominent example of the assignment
discipline of geography and other disciplines that deal of dispersed locations to place-relevant typologies,
with the Earth’s surface, including geodesy, landscape independent of their locational proximity to one
architecture, planning, and surveying. Yet any special another (see Figure 1.17).
relationship is multifaceted, nuanced, and sometimes In both of these examples, absolute location is
tense. This is even more so, given that the roots to not the principal focus, but it is the relative locations
much of geography lie in mapping for warfare, that of places and the connectivity and relations between
the military remain heavy users of GI systems, and that them that are important.
commercial software applications are rarely open to full Recent years have seen the popularization of the
academic scrutiny in deference to business priorities. term neogeography to describe developments in Web
The idiographic and nomothetic concerns of mapping technology and spatial data infrastructures
geography (Section 1.3) have recently been the focus that have greatly enhanced our abilities to assemble,
of thinking about the representation of place and share, and interact with geographic information online.
place effects. Place is a social construct of space by Allied to this is the increased crowd sourcing by online
humans and is key to the way that they understand communities of volunteered geographic information
their surroundings. Thus, although space exists (VGI), discussed in Section 1.5.6 below. Neogeography
independent of the existence of people, it is human is founded on the two-way, many-to-many interac-
interactions and experiences that crystallize space tions between users and Web sites that have emerged
into place through shared perceptions and recogni- under Web 2.0, as embodied in projects such as
tion. Figure 1.16 shows one delineation of the place Wikimapia (www.wikimapia.org) and OpenStreetMap
Image courtesy: Linna Li; base map courtesy Openstreetmap
Figure 1.16 Delineation of the extent of Paris, France, and locations associated with it, based on the density of geotagged Flickr photos for each
place name. Contours of the place surfaces are depicted by choosing a threshold value of point density visually (e.g., the threshold value for the
contour of Louvre is 500 points per square kilometer).
(B)
Courtesy: Chris Gale; contains National Statistics and Ordnance Survey data
© Crown Copyright and database right 2014
Figure 1.17 Geodemographic neighborhood classifications make it possible to quantify how similar one place is to another, subject to the
availability of consistent data across a jurisdiction. (A) A classification based on 2010 U.S. Census data, showing part of San Francisco; and (B) the
UK 2011 Output Area Classification, showing part of Southampton.
26 Introduction
some success in illustrating how better technology,
data, and methods make it possible to go beyond sim-
plistic depiction of raw data. Yet for others, the deploy-
ment of computer systems inherently implies greater
conviction than is warranted about the attributes and
behaviors of the subjects of research. If this is the case,
the ”critical” prefix can never be more than a sop that
implies, but does not deliver, acceptable levels of
humility when engaging with the subjects of research.
Unit AM7 Spatial statistics (13) Unit DM2 Database management systems (6)
Unit AM8 Geostatistics (13) Unit DM3 Tessellation data models (7)
Unit AM9 Spatial regression and econometrics (13 and 14) Unit DM4 Vector and object data models (7)
Unit AM10 Data mining (13 and 14) Unit DM5 Modeling 3-D, temporal, and uncertain
phenomena (7)
Unit AM11 Network analysis (14)
Unit AM12 Optimization and location-allocation Knowledge Area: Data Manipulation (DN)
modeling (14) Unit DN1 Representation transformation (7 and 8)
Unit DN2 Generalization and aggregation (11 and 12)
Knowledge Area: Conceptual Foundations (CF) Unit DN3 Transaction management of geospatial data (8)
Unit CF1 Philosophical foundations (1)
Unit CF2 Cognitive and social foundations (1) Knowledge Area: Geocomputation (GC)
Unit GC1 Emergence of geocomputation (15)
Unit CF3 Domains of geographic information (1)
Unit GC2 Computational aspects and
Unit CF4 Elements of geographic information (2 and 3) neurocomputing (15)
Unit CF5 Relationships (2 and 3) Unit GC3 Cellular Automata (CA) models (15)
Unit CF6 Imperfections in geographic information (5) Unit GC4 Heuristics (14)
Knowledge Area: Cartography and Unit GC5 Genetic algorithms (GA) (15)
Visualization (CV) Unit GC6 Agent-based models (15)
Unit CV1 History and trends (11 and 12)
Unit GC7 Simulation modeling (15)
Unit CV2 Data considerations (11 and 12)
Unit GC8 Uncertainty (5)
Unit CV3 Principles of map design (11)
Unit GC9 Fuzzy sets (5)
Unit CV4 Graphic representation techniques
(11 and 12) Knowledge Area: Geospatial Data (GD)
Unit CV5 Map production (11) Unit GD1 Earth geometry (4)
Unit CV6 Map use and evaluation (11) Unit GD2 Land partitioning systems (4)
▲
28 Introduction
▲
Unit GD3 Georeferencing systems (4) Unit GS4 Geospatial information as property (17)
Unit GD4 Datums (4) Unit GS5 Dissemination of geospatial
information (17)
Unit GD5 Map projections (4)
Unit GS6 Ethical aspects of geospatial information and
Unit GD6 Data quality (5)
technology (18)
Unit GD7 Land surveying and GPS (4)
Unit GS7 Critical GI systems (1)
Unit GD8 Digitizing (8)
Unit GD9 Field data collection (2, 8, 17)
Knowledge Area: Organizational and Institutional
Aspects (OI)
Unit GD10 Aerial imaging and photogrammetry (8) Unit OI1 Origins of GIS&T (1)
Unit GD11 Satellite and shipboard remote sensing (8) Unit OI2 Managing GIS operations and infrastructure (16)
Unit GD12 Metadata, standards, and Unit OI3 Organizational structures and procedures (16)
infrastructures (10)
Unit OI4 GIS&T workforce themes (1)
Knowledge Area: GIS&T and Society (GS) Unit OI5 Institutional and interinstitutional
Unit GS1 Legal aspects (18) aspects (18)
Unit GS2 Economic aspects (17) Unit OI6 Coordinating organizations (national and
international) (18)
Unit GS3 Use of geospatial information in the public
sector (17)
either in ignorance or in implicit recognition of the organizations or funding by universities using student-
convenience of the search engine and other services fee income (justified as enhancing the public-facing
that such companies provide. Disclosure of online profile of the organization). We discuss these different
identity (of which location is an integral part) helps us funding models in greater detail in Chapter 18.
to deal with the tyranny of small decisions in everyday The diversity of motivations underpinning the cre-
life, even if GI service providers then try to use this to ation, maintenance, and dissemination of GI and the
shape aspects of our identities as consumers in the software to analyze it leads to the following societal
interests of their advertising clients. concerns (see also the contribution of Sarah Elwood,
Choosing whether or not to use a particular discussed in Box 10.4):
search engine is an individual decision, but other
GI applications are fundamentally underpinned by ● The links between knowledge and power. The
collective attitudes to creation and use. VGI is the ways in which GI systems represent the Earth’s
collective harnessing of tools to create, assemble, surface, and particularly human society, have
and disseminate geographic data provided volun- been observed to privilege certain people,
tarily by individuals—such as street locations (e.g., phenomena, and perspectives, at the expense
www.openstreetmap.org), geotagged photographs of others. Minority views, and the views of indi-
(e.g., www.flickr.com), or locationally referenced viduals, can be submerged in this process, as can
restaurant reviews. The motivations for supply of information that differs from the official or con-
such information are multifaceted and may ebb and sensus view. GI systems often force knowledge
flow over time—recent years have seen much talk of into forms that are more likely to reflect the view
“Wiki fatigue”—but all such projects are dependent of the majority, or the official view of govern-
on funding models for creating and maintaining GI ment, and as a result marginalize the opinions of
systems—for example through individual donations or minorities or the less powerful. Countering this
corporate sponsorship. outlook is the view that greater access to PSI
The funding imperative holds in all other societal (see Section 17.4), especially over the Web,
deployments of GI. Governments pay the cost of has enlivened debate and has gone some
ensuring the availability of Open Data and public- considerable way toward leveling the playing
sector information (PSI). The costs of creating open field in terms of data access. Moreover, the near-
software are often borne by research-grant-awarding ubiquitous availability of mapping services and
30 Introduction
Individual understanding of how people think about their geo-
graphic surroundings. If GI systems are to be easy to
use, they must fit with human ideas about such topics
as driving directions or how to construct useful and
understandable maps.
GI
Science Many roots to GI systems can be traced
to the spatial analysis tradition in the
discipline of geography.
Computer Society
Yet if there are enduring qualities to the body of
Figure 1.19 The remit of GI science, according to Project Varenius
knowledge that constitutes GI science, so there is
(www.ncgia.ucsb.edu/varenius/varenius.html).
also profound change in the setting in which it is
applied. Astonishing improvements in our technical
capacity to create and manipulate digital informa-
One disarmingly simple way of viewing this tion have profoundly changed the ways in which GI
taxonomy is in the framework of the Varenius proj- systems are understood and used, both in business
ect (www.ncgia.ucsb.edu/varenius/varenius.html; and in government. But perhaps the most profound
Figure 1.19). Here, GI science is viewed as anchored changes of all have taken place in society at large.
by three concepts: the individual, the computer, and Just as a GI “system” is no longer a readily identifi-
society. These form the vertices of a triangle, and able and clearly bounded “thing,” so the notion of
GI science lies at its core. The various terms that are a “GI system user” has become a similarly anoma-
used to describe GI-science activity can be used lous conception—for the simple reason that we are
to populate this triangle. Thus research about the now all consumers of GI and applications that use it.
individual is dominated by cognitive science, with Other changes characterize the environment in which
its concern for understanding of spatial concepts, GI science is applied, given our vastly improved
learning and reasoning about geographic data, abilities to identify and monitor the movements of
and interaction with the computer. Central to such objects and (consenting) individuals, the rise of citi-
research is the desire to engineer more readily intel- zen science, and the availability of vast arrays of data
ligible user interfaces, in the interest of improved that make it possible to devise rich yet comparable
human–computer interaction. Allied to this is the depictions of unique places.
desire to improve geovisualization and render intelli- We wonder where this will all end. Resolving the
gible the results of spatiotemporal analysis. Research “Where?” question places unending demands on
about the computer is dominated by issues of society’s evolving information infrastructures and,
representation, the adaptation of new technologies, in turn, begs many more questions concerning the
computation, and visualization. And finally, research sourcing, assembly, maintenance, and deployment
about society addresses issues of impact and societal of GI. How might the quality and reliability of volun-
context. teered geographic information be ascertained? How
can users best understand the standards by which the
information that they rely on has been created? How
1.7 GI Systems and Science might user interfaces be improved to render GI intel-
ligible to all relevant users? What are the obligations
in Society of governments to supply Open Data that are timely,
comprehensive, and safe to use? What issues arise
GI science is often about using the software environ- when commercial spatial data infrastructures (such
ment of GI systems to redefine, reshape, and resolve as Google Earth) are reused for scientific or policy
problems that are even older than the first GI systems. purposes? And what rights does the citizen have to
The need for methods of spatial analysis, for example, privacy, or to be forgotten?
dates from the first maps, and many methods were These are some of the key questions of GI
developed long before the first GI system appeared science that we address in this book. They have
on the scene in the mid-1960s. Another way to look profound implications for the ways in which we
at GI science is to see it as the body of knowledge interact with one another and the ways in which we
that GI systems implement and exploit. Map projec- interact with places. For billions of casual users, GI
tions (Chapter 4), for example, are part of GI science systems have become an integral part of the infor-
and are used and transformed through use of GI mation technology mainstream, yet this begs
systems. A further area of great importance to GI important questions about the roles, functioning,
systems is cognitive science, particularly the scientific and status of pervasive GI technologies in society.
1. Examine the geographic data available for the 4. Locate a subset of the issues identified in Box 1.5
area within 80 km (50 miles) of either where you in two triangular “GI science” diagrams like
live or where you study. Use it to produce a short that shown in Figure 1.17—one for themes that
(2500-word) illustrated profile of either the socio- predominantly relate to developments in science
economic or the physical environment. (See, for and one for themes that predominantly relate to
example, www.data.gov/ or data.gov.uk/data) developments in technology. Further details on
each of the themes is available in the online BoK.
2. What are the distinguishing characteristics of the
If time is available, give short written reasons
scientific method? Discuss the relevance of each
for your assignments. Compare the distribution
to the history of GI systems.
of issues within each of your triangles to assess
3. Has society-wide use of GI systems transformed the relative importance of the individual, the
the research agenda for GI science? Give reasons computer, and society in the development of GI
for your answer. science and of GI technologies.
Further Reading
De Smith, M., Goodchild, M. F., and Longley, P. A. Foresman, T. W. (ed.). 1998. The History of
2009. Geospatial Analysis: A Comprehensive Geographic Information Systems: Perspectives
Guide to Principles, Techniques and Software from the Pioneers. Upper Saddle River, NJ:
Tools (3rd ed.). Leicester: Troubador and Prentice Hall.
www.spatialanalysisonline.com.
Goodchild, M. F. 1992. Geographical information sci-
DiBiase, D., deMers, M., Johnson, A., Kemp, K., ence. International Journal of Geographical Informa-
Taylor Luck, A., Plewe, B. and Wentz, E. (eds.). 2006. tion Systems 6: 31–45.
Geographic Information Science & Technology Body
Wainwright, J. 2013. Geopiracy: Oaxaca, Militant
of Knowledge. Washington, DC: Association of
Empiricism, and Geographical Thought. New York:
American Geographers. www.aag.org/galleries/
Palgrave Macmillan.
publications-files/GIST_Body_of_Knowledge.pdf
32 Introduction