Professional Documents
Culture Documents
7th Edition
Eider Press
\Vhite Bear Lake,
M innesota
Et'rata.1.l.1.Stn1c1or resourm. aud othet helpful l.ttfonnation nbo\1111-t.is
book may be found ai
b1ip:/Jwww.paulbolstad.net/gisbook.b1ml. or at ,vww.paulbols1ad,org
Copyright (c) 1022 by Paul Bolstad, Steven Manson. all rig.his reserved.
We nms.1 thank many people for their contr.ibutioµs to this book. Our aim is
the one that Paul bas held uuwaveringly to for years, namely to provide a
readable. thorough. and affordable intr0duc1ory text on GIS. Although
gratified by the adoption of the book ar more titan 440 coUeges and
universities. there is room for improvemen1 with each edition. and the
theory and practice of GIS evolves. Reade.rs have offered corrections.
helpful suggestions, and enthusiastic encouragement, and for these we give
lbanks. They have led to improvements in this edition.
Finally. this project would not have been possible save for the
encouragement and fori>earance of Rolly. Sam, and Sheryl. and the support
of Margaret. Steve would like to thank Paul for kindly inviting him on
board this great publishing adventure and, additionally. thank his family for
their suppon and his sn1dents for continuing 10 teach him things.
While many have helped in ~us edition. we've surely left many
oppornmities for improve.men1. If you have comments to snare or
improvements to suggest. please send them 10 Eider Press. '.!303 4lh Street.
White Bear lake. MN 55110, pbolstad@gmail.com. Or
smmanson@g.maiLcom.
Perhaps !lie most useful are !lie book figt1tes. made available in
presematioo-friendly fom1a1s. MoSI oftlie figures used in the
book are organized by chap1er. and may be doMtloaded and
easily incorporated into common slide presenta1io11 packages. A
few grap,hics are not presem because they were used by
pennisston. we do not hold copyriglu. and could uot obtain
pem1ission 10 distribute them.
1 An Introduction to GIS
lntrodutction
Geography llas always been important (Figure 1-1). Spa1ial informa1ion bas a gre,11
to humans. Stone.age hunters anticipated impac.t on our lives by helpLng us produce
1he loca1iou of lbeir quarry. early explorers 1be food we eat lbe energy we bum. 1he
lived or died by 1heir knowledge of geogra- clothes we wear. and 1he diversious we
phy, and currenl societies work and play eJIJOy.
based on lheor unders1aoding of who Because spa1ial infonna1ion is so
belongs where. Applied geography. in 1be important, we have developed tools cal.led
fonn of maps and spatial infom1ation, has geographic infom1a1ion systems (GIS) to
served disco,,e,y, pla.CU1ing. cooperation. help us organize spatial da1a. We will use
and conOic1 for a1 leas11he past 3.000 years 1he abbrevia1ion GIS 10 refer 10 bo1b singu-
2 GIS Fundamentals
lar. system, and p.lural, systems. Some GIS A G!S is n compurer·based sys rem 10 aid
components are purely technological: these ;,, 1/,e rolfection, moimenouce. sromge.
include space-age data coUectors. adv~nced aualysi.s, ourp111, n11d distribmion ofspa-
communications networks. and soph1sucated tial iuformn,;011.
computing. Other GlS componeo1s are very
Wbcn used wisely. GIS can help us live
simple. for example. a pencil to acid-verify heald1ier, wealthier, and safer lives.
a paper map.
Each GlS user may decide what fearures
Key 10 all definitions of a GlS are and attributes are important. For example,
"where" and "wha1.•· GlS record the abso- forests are good for us. They may prole<:I
lute and relative location of features. as well water supplies. yield wood, harbor wildlife.
as the propenies and anribmes of those fea: and provide space to recreate (Figure 1-2).
hires. Mount Everest is in Asia. Timbu\...1U 1s We are concerned about the level of harves.1,
in Mali. and the cruise ship Titanic is at the the adjacent land use, pollution from nearby
bonom of the A1lru11ic Ocean. A GlS quanti- inch1s1ries. or where forests burn. Infonned
fies these loca1ions by recording their coor- mana~ement requires knowledge of all these
dinates. numbers that de.scribe position on related factors and. perlrnps above all. the
Ear1b. The GlS ,nay also note the height of spatial arrangement of these fac1ors. Buffer
Mount Everes1. the population of Timbuktu. strips near rivers may protecl water supphes.
or the depth of the Titanic. and many other clearings may prevent 1he spread of fire. and
defming characteristics of spatial fearures. polluters upwind may hanu our forests. A
GlS helps us analyze tl1ese spa1ial internc-
What Is a GIS? tions. and is also particularly useful at dis-
playing spatial da1a and analysis. A GlS is
A GlS is a 1001for making and using often the only way to solve spatially-related
spatial infomuuion. Among the many defini- p1oblems.
tions of GlS, we choose:
Figur(' 1· 2: OlS ~llow us to aoalyzc imponaot geognivbic features. The ~tcUitc ~ge at 1be tentc:T show&
a forec,tcd lll'Ca in \V«:tcm 01'<l,Oll. with a patcb\\-otk o(lakes. for«';S'. c:1~1~. alptn~ zoocs. and dcsnu. A
GIS in:t>· aid in a1Juring 1n.1st1un11.blc rccre2tion, timber h,:n•ett, awito1unc11t11.l pro1cc11c11t, 11od other bellefits
(co1.1t1ct.) · NASA).
Chapter 1: Introduction lo GIS 3
Figure 1·3: The cn,·i.romneotal impac~ "T0U£bt by b\Uurutt ba,·c accelmttcd in man~· p,atts 0(1bc world
d · lhc .st ccnnU'\l. Tltcse ~ tellitc lnui;es f'roin 2000 (up~ le~) 10 2013 (lower ngl.,;.1t) show.,
,~'ln1J.!1Sc., du~ 10 die o,-cn~ or "':1.tc:r. Oi\'cnion (or urig.,ttoo has cld1royed II f!C:~t fi~h(b'Y• 1bc
«onomic ~~ for many 6Cas-idc co1rnutm1tacs.. (HS n13)' be used 40 docw1K"nl cba.ngc. nubg~tc mage.
aud cffectinl)•maoagc out natural l'C.$0Ute" (courtesy NASA).
4 GIS Fundamentals
f igurt- 1-4: GIS 1111)' aid ia:i dis:utC':t' 11uclitnm1 :.nd rcco,•ay. This s:ucllilc i1uaic Jhows the C11mp (ll'e,
wb,cb con, wucd 1uOS1 of Par.idisc, Califomi1', in No,·anbcr 201S. Emcr,a1cy response o.l)d 1,c:m,gcr•tcnn
rcl.>ui.lding efforts may be impro,·cd by "}Mti11l cbtii collectioo and ru~1ly11s (courtesy NASA),
an.,lyzing species range and migration pat- and access these stores? GIS are also used in
terns. relative 10 human land use. GIS use is hundreds of other business applications. 10
manda1ed in other endeavors. including route vehicles, guide advertising, design
emergency senices. Oood protection. disas- buildings. plan construction. and sell real
ter assessment and management (Figure 1- estate.
4), and infrastruceure development. The societal push 10 adopt GIS bas been
Public organizations have also adopted complemented by a 1ecbnological pull in the
GIS because it aids in governmental func- development and application of GIS. Thou-
lions. Forexample. emergency service ve.lti· sands of lives an<! untold wealth have been
cles are regtllarly dispatched and ro111ed lost because ship cap<ains could 1101answer
using (ii$. E9 1I GIS matches the caller's the simple question. "Where am r?"
address to the nearest emergency service Sh\· Remarkable positioning technologies. gener-
tion. a route is generated based on the street ically known as Global Navigation Satellite
network and traffic, and emergency crews Systems (GNSS). are now indispensable
dispatched in a fraction of the pre-GIS times. tools in commerce, planning, and safety.
Many businesses adopt GIS for The tecluiological pull has d:eveloped on
inc-reased efficiency in the delivery of goods several fronts. Spa1iaJanalysis in particular
aod services. Retail businesses locate stores bas been helped by fasier computers with
based Oil a numbero(spatially related fac- more storage. an<i by the increased intercon-
tors. \Vhere are the polentiaJ customers'? nectedness via mobile.networks. Mosl real-
What is the spatia.l distribution of competing world spatial problems were beyond the
businesses? Wher.e are potential new store scope of all but the largest organizations
local ions? \Vhat are traffic flows near cur- until the 1990s. GIS computing expenses are
relll stores. and how easy is ii to park near becoming an afterthought as costs decrease
Chapter 1: Introduction to GIS 5
Figm·e l ·6: I.mag" hlk~ fi-o1n aircrn.Aand s.atdlitcs (left) pro,idc a rkh~c of dat,i. which 1nay be
in1crprc:1cd :tiJd co11,·cncd 10 i.u(onu111ion t1bot1t 1hc Ear'lh's wrl'11cc: (ri~1t).
6 GIS Fundamentals
the scenic and recreationaJamenities avail· lake quality. and 10 ensure compliance witb·
able in Oneida Coun1y. The seasonal influx out undue burden on tando,.,ners. The
ahnos1 doubles the total co1u11y population county uses GIS teclu1otog)' in the mainte-
e.ach summer. nance of propeny records. Propeny records
include infonnation on the owner. tax value,
Population growth has caused a boom
in constmction and threatened 1he lakes that and any special zoning considerations. The
draw people 10 !he county. There is a bigb county uses these digilal records when creat-
number of nearshore houses. hotels. or busi- ing parcel maps: processing sale. subdivi-
nesses. $e,epage from septic syste,ms, nmotf sion. or 01her parcel trn..nsac1ious: and
from fertilized lawns. or erosion and s.edi· integraling ne.w cL1ta such as aerial or boat·
ment from cousiruc,tion all decrease lake based images to help detect property
water quality. J_ncreases in lake nutrients or changes and zoning violations.
sediment le-ads to turbid waters, reducing the GIS may also be used to administer
beauty and value of1be takes and nearby shoreline zoning ordinances. or 10 no1ify
propenies. landowners of routine tasks. such as septic
In response 10 this problem. Oneida system maintenance. Northern lakes are par·
County implemented a Shoretand Manage- tiet1larly susceptible to nutrient pollution
ment GIS Project. Tltis project helps protect from oearshore septic systems (Figure 1-9).
valuable nearshore and lake resources. and Timely maintenance of each septic system
provides rut example of how GIS tools are must be verified. The GIS c.10 automatically
used forwater resource management (Figure identify owners out of compliance and gen-
1-8). erate an appropriate notification.
Oneida County has revised zoning and GIS has helped the U.S. Fish and Wild-
other ordinances to protect shoreline and life Sen•ice manage the recovery of 1he Gray
• •• :, • ••
-
? •'
•
" ..........
,ti , ...........
• •
•..
•
.. ,··
• ........
•
••
•
•
• •' ••
.•
•••
"" ..... •
CJ
' •
••
••
•
•• ••
• •. . •
'
••••
•
••
•
• •• •• •
.... - .
.~ 't . .,r;,,;
.
' .... .
~,
• •••
• •• •
..
~,
• •
•
..••. " '
Figure 1· 8:: Pated llUoru,:uion cntcl'<"d iu a OJS may subs-1autUIJly improve go, -ai1mtt1t kf'lices. Hi:tt,
imn~cs of dte $borclitic t11kcu fron\ bike \'Jlt)J11gc points MC" combined wilb digi111l mnps of 1bc shOTt"linc,
buildings. aud ~r«I bou.nOO.ries, The imngc ind~ lower lcfl WM obtained from tbc 1oc.a.tiou s.bown H-'
light dot nc:l!f the ccnic:r of the 6,gtlR: (courtesy Wiscoru..in Sc" Grant institute and UCOF).
8 GIS Fundamentals
.
,( ,.... *........ .
<( ... ..
•·-·
./} Cl
,J, .....
Cl
Jli-, ....
;,,,,..=;;-----'-.;..,J: ,_.._,. ~ I
....
• ·-'•"" I
-.~ '
Flgua-e 1·9: G(S may be used co ,tr-:amline 1;0,·enuncn1 function. Herc. septic systems not compliaot
wit~ pollmioi:i Pl'C''«uiou ordilllUlCcs att idc:ruific:d by 1.iaJit ei~lc:s (courtesy \Vl$C<>m-iu Se:a Gr\tut
l11.s11tu1c :ind UCOF).
,;;;._-------~'-
Figure 1- 11: Wolf recovery im'Oh'<• tn:u)quil·
iz:ing aod fi11in,: wok« with tracking c:C)ll.llf'S.
sulfur dioxides. carbon mouoxide. ozone.
and small panicles from oil. gas. coal. and
wo<Xl combustion. Primary sources are
These PfO\idC" detailed locatiou and u>0,·em<n1 power generati01l factories. and iransporta·
dM.a. a.id a bcttct undmt11ndi.nJ ofwolf habitat tion (Figure 1-13). Small particles Lodge in
rcquircmaitt (courtc$)' NPS).
the lungs. causing inflauuuation and reduc-
ing Lwtg func1ion (Figure I· 14). Alveolar
using lhe older. radio-based technologies macrophages attempt to isolate Ibis material.
they replaced. but air pollution levelscommonly exceed the
Lung's capacity for self-cleaning. Damaging
Scientists at lhe Voyagetus Wolf Projec1 particle concentrations are typically higher
have been tracking wolves 10 bener under- in urban areas, or near traffi~, power plants,
s1and !heir behavior (Figure 1-12). Part of and other pollution sources. GJS helps map
wolf recovery and de-Lis1ing may include concentrations. idemify sources. and plan
bu ming and ttapping seasons iu some areas.
- -
. .., .;..•• ..,. • "", !~>.. _. .~~
Legend
..
. .~· ., ..· ·,..,...
-1. .. , ... ~· ..,i:.
·. ......
... .~\.it.
. .... . ... .
,•,,
, ...;,;':-. 'lo:,• ,'':' • .,,,_.
"·...........
WOif ~iol~ and >locne Ral'lge:S
o Pac~ t • Pack2
Koodu>ching
St.ate
.
.. .
. ' .,.,...,,
:t;·"
~ • Pack 3 o Pacl\4
Forest Voyageurs
-· Nalio~I Park
- ......
.,
••• i.,·••• •
. . w'•i ' ..·· ~.. ... ,-.• .
~
-. . . . . -.
.. c. .•
I
..._ AA • • •
.,, _ -
~
,•'i,: ..... , , \ ••
•
•
• ... -.. ..... ·.·t '!: ••t- . ... -·
·\·. , .. ..... . .. \ .'
• • I ~
_. •
'"' • I I ,; • • • •
~ ':
• ..•
...~. •"<'. .·,
·~·! ,.
... •
I • • • •
.:- ' .
.. .. ....,.. . \
Figul'(' 1· 12: Spati.al d.atn. ~uch a$ 1he l'C'Cordcd p;u:k loc:ationi. and borne rang« (cirdc$ aod polygon,.
tup«tivcly}. ~ay be combined to hcl"p uudmtand bow bc-$110 UWtage the Gray Wolf(couricsy \Oya-
icun \\'t>lf Pro,cct}.
10 GIS Fundamentals
0 s 10 15 20
12
~
...v,ro
-~ 10
1)'
•• •
• •
•
.,•
-c
ro 8
(.)
Q) • •
••
•• •
(•
•
•
•
•
•
•
,,' •
•
..., t•r•
c. • lo
x
UJ 6
•
I
••
•
t··
•
(
• I,• •
$
.:. •• \
~ • •
•• •
•
• •••
....J
c
Q)
4
•
• I
•
$. ••
.....
•
••
•
•
i•¥••~• I • •
•
v, • •
ro
Q)
•
~ 2 • I
(.)
c •• •
• •
0 2 4 6 8 10 12 14
Decrease in fine particle air pollution (11g/m3)
Fig\111' 1-15: Scientists al NASA h,a,-c dc..,.doped rucll.ods to map ~irpollution .iaoss .:ontiocnts on
a daily basis~ which may b¢ anragcd 10 cstllnatc chronic cxpos:un: (lop), Tiic« spatial da1a may be
combined \\~i1h snidic.s on hl1.n:ui rcspoitk to air pollution {bo(tom) :md lite loc:mon o( ,.,l1i1et11blc
populn1ioi1s 110 itnpro\'c public bultb and reduce medicill coses (c01111c,)' EPA. USA).
12 GIS Fundamentals
sources. Some areas are particularly prone to Additional work has focused on air pol-
high concentrations due to surrounding high· lution a1 greater geog;apbic detail. in pan 10
lands. e.g.. the Ce111ral Valley of California better quantify aod manage individual e:<po·
or Salt Lake City, Utah. sure and risk. Dr. Julian Marshall and collab-
orators a1 the University of Minuesota have
Work by heal,lh scientists bai identified
the impacts of afr pollution on target popula- developed systems 10 sample p<>llutant con-
tions. Increased rotes or asthma, lung dam· cen1rations at very fine spatial intervals.
age. and death observed in smaller sn,dies or towing an air sampling system behind a
individual cities can be expanded to broader bicycle through a rnnge of trnffic densities.
areas through the combination or data in road types. and neighborhoods (Figure 1-
16).
GIS. For example. combining health and
population data with satellite exposure Satel.lite positiooiog was syncb.roniied
records has helped e.s1ima1e the increase in with video and air samples, and these com.
life expectancy with a decrease in air pollu· billed with spatial data on road networks.
tion. Legislation passed in the 1970s resulted popu•ition density. land use. and other fac-
in a measurable improvement in air quality tors. Sta1istical models were then developed.
across the United States. Progress bas been These allow detailed estimaies of pollutant
var~1ble across the countcy. witb some popu- conce111rations. even down 10 the individual
lations seeing laiger reduc,tions. Scientists street (Figure 1-17). Such estimates may in
measured the decrease in death mies in con>· tt1m help reduce air pollution. plan bicycle
par-able populations. and es1ima1ed an a\ er- 1
or pe<lestr•,n corridors. separate the pollut-
age 2-year increase in life span for each ant loadings due to cars vs. trucks. buses or
10 µg m3 reduction in exposure (Figure I· other large vehicles, and manage llllffic or
__
15. bottom). infrastn1cnire to rech1ce b\unan exposure.
.,
__
_
--·
--·
,_,
,.._
-
- Ot--
F1gure l · 16: Tow·~blc 1;.amplen help measure air poUution (or i.11dividual W'Ccts. al \'arious tmtlie dcnsi·
1ic:s and types (co11:nc:sy J. M.,rsbll1J).
Chapter 1: Introduction to GIS 13
'
A
Flgurt t .. l 7; fine-<ktailcd &~rial e,:tinJ.11te$-
of partic:ulru:c air poUu1auts (cow1«)' J. ).far·
shall).
14 GIS Fundamentals
GIS Components
A GIS is composed of hardware. soft- often repealed over tens of millions of1imes.
ware. data. humans. and a set of organiza- corresponding 10 each space we analyze in
tional protocols. The developmenc and our area of interest Even simple operations
integration of these componeuts is an itera- may lake subs1an1ial lime on geoeral-pur-
tive. ongoing process. The selection and pur- pose computers when applied to large areas.
chase of hardware and software are often the and complex operations can be unbearably
easiest and quickes1 steps in creating a GIS. long-runniug. While advances io compu,e
Data collecrion and organization. persorn1el speeds during lhe pas! decades have subs1an-
development. and 1he establishment of pro- 1ially reduced the time required for most spa-
tocols for GIS use are often more difficult lial analyses. computaiion times are still
and time-constuufog endeavors. long for a few applications.
Disuioo1ed computing is increasingly
Hardware for GIS possible wid1 faS1er and broader i111erne1.
cellular. and wifi networks. Large data sets
A fast computing. large data storage may now be stored remotely and! served over
capaci1ies, and a high-quali1y. large display cellular or wifi networks. and computations
fom, 1he l~1rdware fom1da1ion of nios1 GIS. may be migrated 10 local or cloud.based
although the computing and storage are sen•ers (Figure 1·18). This ne,,vork i11tegra-
increasingly distributed across network and 1ion often brings added comptexiiy and new
0010 the data processing cloud (Figure 1-18). layers of software. and organizations must
Fast compu1iog is required because spatial weight 1rade-offs between local and distrib-
analyses are often applied over large areas uted GIS sys1ems.
at higl1 spatial resolmions. Calculations are
stand-alone
GNSS/GPS
CIO<Jd
cellular computing,
data storage
network
_ .i \
-
. i ((<s
~
...·· \~ 8'
local
network ' Bluetoo1h Q
phone or field
wiri
server
=-
oomputer with
GPS/GNSS
......
'·,\
\
\
portabl& oomputOf
desktop computer
............
Figul't' 1- 18: ors :ire u-"Cd i11 "nu1gc of co11lpuli11J cm·i.r(11uneut1 and nctworl: configunitioni, (f'Otll s1ia:nd·
.alone $)·i.tcrns to li:illy clood-bn.scd compt~iog t.nd &tn .s:torngc.
Chapter 1: Introduction to GIS 15
While muich hardware used iu GlS is and extrnct the information we require. and
general-purpose and adaJllable for a "ide produce tlte materials 10 communicate the
range of tasks. !here are also specialized infom,arion we have developed. GIS soft-
hardware components designed specifically ware pro\lides the specific tools for some or
for spatial data collectio1L GIS require coor- au of these tasks.
dinate data 10 define geographic features. There are many public domain and com-
such as roads.. rivers, and parcels. GPS/ mercially available GJS software packages,
GNSS. field cablets, and other speciali2ed and many oftbese packages originated at
equipmem. described in Chapters 4 an 5. aid academic or government-funded research
in data entry. Recent increases in wireless laboratories. The Environmental Systems
communications substantially improve data Research lnstirute (ESRI) li11e of products.
input. allowing us 10 connect these mobile including ArcGIS. is a good! example. Much
devices to existing GJS while in the field, of the foundation for early ESRI software
improving data entry via computers, tablets, was de,·eloped during lhe 1960s and 1970s
phones. and GNSS/GPS units. at Harvard Unjversity in the Laboratory of
Computer Graphics and Spatial Anal}~is.
GIS Software Alumni from Harvard included these in
commercial products. and ltave developed
GIS software provides the tools 10 man- additional methods and integrated new aca-
age, analyze. and effectively display and dis- demic research in the five decades since.
semina,e spa1ial information (Figure 1·19).
GIS by necessiiy involves the collec,ioo and
manipulation of coordinates. We also must Open Geospatlal Consortium
collect qualitative or quantitative infonna- We wiU briefly cover lite most common
tiou on the uouspa1ial attributes of geo-
GIS software. bm first wish 10 introduce the
graphic features. We need tools to \~ew and Open Geospatial Consortium (OGC). Their
edit these data, manipulate them lo generate effons have eased sharing across various
GIS softwares and computer operating sys- source software, the original offering had
tems. Standards tor data formats. documen- limited capabilities. \Vi1b an average of
tation. program il!lteractions. and approxima1ely 1wo updates a year sinee
transmission have been developed and pub- 2002. QGIS provides a large number of
lished (""'w.opeogoospatialorg). and lists basic GIS display and analysis functions. An
of s1andards-complia111 software compiled. interface has been developed whh GRASS.
These common standards ease community anotheropen.source GJS wilh complemen-
adoption and reduce bllrrie~ 10 switching 1ary an.,ly,ical fimctions, bur lhar tacks as
among softwares. Compliance with the stan- straightforward a graphical user imerface.
dards is a plus from a user's perspecti\'e.
GeoMedia
ArcGIS
GeoMedia and related produc1s are 1be
ArcGlS is by some measures the most popuL,r GIS sui1e from Hexagon Geospatial.
broadly deployed GIS. and provides a near- GeoMedia offe~ a complete set of data
comprehensive set of gooprocessing func- entry. analysis. and ou1pur 1ools. A compre-
tions, from data entry Lhrough analysis to he.nsive set of editing tools may be pur-
mos, forms of data output They provide chased. including rbose for auto1na1ed da1a
desk1op. emerprise. and cloud-based ver- emry and error de1ec1ion. da1a developmem.
sions. ArcGIS supports multiple da1a 1ypes data fusion. complex analyses. and sophisti-
and stn1c1ures. and literally uiousands of cated da1a display and map composition.
possible operations 1ba1 may be applied 10 GeoMedia also provides a comprehen-
spatial data. Subs-tantial training is required sive set of tools for GIS analyses. Complex
10 mas1er the full capabilities of ArCGIS. spa1ial analyses ,nay be perfonned. includ-
ArcGIS provides wide fle~ibility in how ing queries. for example. to fiud features in
we conceptualize and model geographic fea- 1he database that match a SN of conditions.
tures. For example. elevation data may be and spatial analyses such as proximity or
stored in at least four major fom1a1s. each overlap between feanires. World Wide Web
with attendant advantages and limitations. and mobile phone applications are well sup·
There is equal Oec<ibili1y in the methods for ported.
spa1ial dara processing. This broad array of
choices. while responsible for the large
investment in time required for mastery of ldrlsi
ArcGIS. provides subs1amial aoalylical ldrisi is a GIS system developed by the
power. Graduale School of Geography of Clark
University, in Massachusetts. Idxisi differs
QGIS from the previously discussed GIS software
pac.kages in that it provides botll image pro·
QGIS is an open.source software proj· cessing and GlS functions. Image daln are
ect. an ini1iati\le under the Open Source useful as a source of informa1ion in GIS.
Goospa1ial Foundation. TI1e software is a There are many specialized software pack-
collabora1[\le effort by a co1111nmt.ity of ages designed spe.cifically 10 focus on image
developers and users. QGIS is free. stable. dato collection. manipulation. aod output.
changes smootl1ly 1hrougb time. witl11he Idrisi offers much of tl1is fimctionaliry while
source code available so that it can be also providing a large suite of ipatial data
ex1ended as needed for specific tasks. 11 pro- analysis and display functions.
vides a graphical user interface. suppo11S a A suite of tools for Earth system model-
wide variety of data types ru1d fomt.ats. ru1d ing has been developed on u,e Idrisi plat-
nins on Unix. Ma.cOSX, and Microooft Win· fonu. and combined in 1be Ten-eset software
dows opera1ing s~'Slems. As wi1ll most open- system. Ftmclions include land change mod-
Chapter 1: Introduction lo GIS 17
eling, habirat and biodiversity modeling, and development Bentley includes a general set
climate change adaptation. of 1ools, including field data collection. pbo-
tognunmetry. sophisticated map co111posi-
1ion, database management, analysis, and
AUTOCAD MAP 30 reporting. Bentley products are partit11larly
AUTOCAD is the world's largest-sell- focused on the built en,•irornment. including
ing computer drafting and design package. road, building, utility. and o'lher large con-
Produced by Autodesk. Inc. of San Rafael. stniction design. plaDlling, a11d management.
California. AUTOCAD began as an engi- Bemley also suppons industry-specific
neering dral\i ng and pru11ing tool. A brood tools, including mining and power genera-
range of engineering disc.iplines are sup. tion systems and networks.
ported. including su"•eyu,g and civil engi-
neering. Surveyors have traditionally Spatial R, Python, an.d GOAL
developed aod mainlained the coordinates
for property boundaries. and these are Geueric programing. pr«essing. and
among the most important and often-used statistical analysis tools may be combined 10
spatial data. AUTOCAD MAP 30 adds sub- provide most GIS functions~ and inc.lude
stantial analytical capability 10 the already newer analytical methods not available in
complete set of data inp111, coordinate common commertial packages. R is an
manipulation., and data output tools provided open-source software project with many spa-
byAUTOCAD. tial packages. These support a rich set of
spatial operatioos. particularly for spatial
estimation. Python is a general-purpose pro-
GRASS graming language with several available
GRASS. the Geographic Resource spatial libraries. Notable among them are
Analysis Support System. is a free. open- Shapely, Geopaudas. and pySAL. containing
source G IS that runs on many platfonns. The a large set of spatial functions. GDAL is a
system was originally developed by the U.S. standard set of spatial input/output and data
Anny Construction Engineering Research processing functions. which! may interface
Laboratory. Dul was discontinued by the mil· \\1th both Rand Python. Together, these
itary. and taken up by an open-source project tools support sophisticated OIS anal)~is.
to maintain and enhance GRASS. The soft- This re\~ew of spalial data software is
ware pro\~des a broad array of raster and incomplete. There are many other software
vector operations, and is used in both tools available wh.icb provide tmique. novel.
research and ~ppLicatioos worldwide. or particularly cle\'er combi:nations or geo-
Detailed infonna1ion and the downloadable processing functions. Maptixude GIS. Wbi1e-
software are available al box GAT. Micro1J.111ges. Sn1'111world.
hnp:/lgrass.osgeo.org. Manifold GIS. ILWIS, Map'Window, PCI.
and gvSIG are just a rew additional software
Bentley Map packages with spatial data capabilities. In
addition, there are thousands of add-ons.
Bentley Systems has developed spatial special-purpose roots. or specific modules
an.1lysis software for mobile device through 1ha1 CO!llplemem tltese products.
enterprise levels, with a strong focus°'' flex-
ible. integrated infrastnicrure design and
18 GIS Fundamentals
GIS in Organizations
Allhough new uws often focus on GIS GIS are often employed as decision sup-
hardware and sofnvare components. we pon loots (Figure 1-20). Oa1a ar~ collected.
must recognize !hat GIS exist in au iosti1u. entered. and organized into a spatial data.
1iona1contexi. Efl'ecth•e use ofGIS requires base. and analyses perfonned to help make
an organization10 support various GJS ac.tiv- specific decisions. The resuhs of spatial
ities. Mool GIS also require trained people 10 analyses in a GIS often uncover 1he need for
use them. and a set of protocols guiding bow more data. and there are often several itera-
the GIS will be used. The institutional con- tions through 1he collection. organization.
text detem1ines what spatial data are impon- analysis, outpul. and assessment steps before
ant. how these da1a will be collected and a final decision is reached. It is imponant to
used. and ensures that the results of GIS recognize the organizational strucrure with.in
analyses are properly interpreted and wltic.t11he G!S will operate. and how GlS
applied. GIS sbar~ a common cba.racteristic will be imegrated into the decisi.on.mak.ing
of many powerful technologies. If not prop- processes of the organizatio,~
erly used. GIS may lead to a significant
waste of resources. and may do more harm
than good. The pr,oper instimtional resources
are required for GlS to provide all its poten-
tial benefits.
Define
prot ocols
-
Decide and act
Collect and
\ edit spatial data
Flgurt 1· 20: OlS e,i;.is:t in an Ulititu1ional co11tex1. Effective use of GlS depends ou a set o(
protocols and aa1 Ulle;ration into tllC d:ua eollc<tioo. anaJysis. ckeision. and actioo loop o(.au
orpni2.'lliou..
Chapter 1: Introduction lo GIS 19
One first question is. "What problem(s) decision support tools. Lhe effe.ctive use of
are we to sol,,. with the GIS?" GIS add sig- GIS requires more than the purchase of hard-
nificant analytical power dirough the ability ware and software. Trained personnel and
to measure di.stances and areas. identify prolocols for use are required if GIS are to
vicinity. analyze networks. and tlirougb the be properly applied. GIS may then be incor-
overlay and combination o( different infor- porated in the quesrion-<ollect-analyze-
mation. Unfortunately. spatial data develop- dec.ide loop when solving problems.
ment is often expensive. and effective GIS
use requires specialized knowledge or train-
ing. so there is often considerable expense in The Structure of This Book
constructing and operating a GIS. Before This book is designed 10 serve a semes-
spending this time and money. tltere must be ter-long. 15,week course in GIS at the urti-
a clear idemification of the new questions versity level. We seek to provide the relevant
that may be a.nswered. or the process, prod- information to create a stro11g basic founda-
uct. or service thai will be improved. made tion on which 10 build an undemanding of
more efficient. or less expensive through the GIS. Because of the breadtb and number of
use of GIS. Once the ends are identified, an topics covered. students may be helped by
organization may detem,ine the !e,•el of knowledge of how this book is organized.
investment in GIS that is warranted. Chapter I (this chapter) sets tlie stage. pro-
viding some motivation and! a background
Summary for GIS. Chapter 2 describes basic data rep·
resentations. It ueats the main ways we use
GIS are computer-based systems that aid computer.; to represent perceptions of geog-
in the develolJlment and use of spatial data. raphy. common data structu,es, and bow
There are ma:ny reasons we use GIS. but these structures are organized. Chapter 3
most are based on a societal push. our need provides a basic description, of coordioa1es
to more effectively and efficienlly use our and coordinate systems. hov•1 coordinates are
resources. It also responds to a technological defmed and measured on the surface of the
pull. our inre,est in applying new roots to Ear1h. and conventions for converting 1hese
previously insoluble problems. GIS as a measu.remenls to coordinates we use in a
technology i, based on geographic infonua- GIS.
tiou science. and is supponed by the disci- Chapter.. 4 through 7 treat spatial data
plines of geography. surveying, engineering. collection and en,ry. Dara collection is ofieu
space science. comprner science. canogra. a substantial task and comprises one of the
phy. statistics. and a number of other.;. main activities of most GlS orgru1iza1ions.
GIS are composed of both hardware and General data collection metliods and equip·
software components. Because of the large ment are described in Chapter 4. Chapter 5
volumes of spalial data and the need to input describes Global Navigation Satellite Sys-
coordinate values. G!S hardware often have tems (GNSS). a conunon teclmology for
large storage cap<tcities. fast computing coordinate data collection. Chapter 6
speed, and ability to capture coordinates. describes aerial and space-b.ased images as a
Softwue for GIS are unique in their ability source of spatial data. Most historical and
to manipulate coordinates and associated comemporary m•ps depend in some way on
at1:ribute data_ A number of software tools image data, and this chap1er provides a back-
and packages. are available to help us ground on how these data are collec1ed and
develop GIS. used to create spatial data. Clulprer 7 pro-
While G!S are defined as tools for use vides a brief descrip1ion of common digital
with spatial data. we must siress the impor- data sources available in the United Stales,
tance of the iustinnional context io which their fomiats. and uses.
GIS fit. Bee.ause GIS are most often used as
20 GIS Fundamentals
Chapters 8 through 13 treat the anatr-,is how we assess and document spatial data
of spatial data. Chapter 8 focuses on ann· qualiiy, while Chapter 15 provides some
bute data. anribute tables. database design. musings on current coodi1ions a.ud funire
and analyses using attribute data. Attributes trends.
are half our spatio! data. and a clear 1mder-. We give preference to the ~1tema1ioa.11
standing of how we stn,cture and use them is System of Units (Sf) throughout this book.
key to effective spatial reasoning. Chapters The SI system is adopted by most of the
9, 10, II. and 12 describe basic spatial anal- world. and is used to specify distances and
yses. including adjacency. inclusion._over· locations in the most common global coordi-
lay, and data combination for the ma~n data nate S}~tems and by 1110st spatial d.ata co!lec-
models used in GlS. They also describe 1ion devices. However. some English uuus
more complex spatio-temporal models. are culturally embedded, for example. the
Chapter 13 describes various m:ihods for . survey foot. or 640 acres to a Public land
spatial prediction and interpola11011. We typi- Survey Section. and so these are 1101 con-
cally find it impractical or inefficient 10 col· vened. Because a large portion of 1he target
lect "wall-to-wall," spatial and a1tribme da1a. audience for this book is in the United
Spatial prediction ~lo~vs us to ~xtend our States. English units of measure often sup·
sampling and provide mforo-,uon for plement SI units.
unsampled locations. Chapter 14 describes
Chapter 1: Introduction to GIS 21
Suggested Reading
Ballas. 0., Clarke. G .. Frn.nldin. R.S .. Newing. A. (20J 8). GJS a11d the $.oc1af Scit11c,s: Tlu1•
0,)1and Appfrcoho,1s. Londoo: Routledge.
Couvis, C..L.. Jr. (Ed.). (2001). Co11stnn11011 Geogr(lphy: Case Studit$ m GIS.. Computer
Mappmg, tmdAcrl,·,sm. Red land$: ESRJ Press.
Day. 8 .. B:runer, J.. Moser. A. (20 17). Goospatral Data and A11olys1s. S<-bastOP'ol CA:
O'Reilly.
Dent. 8 .. Torguson., J.S., Hodler, T.W. (2009). Ca,.tography, Tl,emnUc Map Druig,r. 61h Edi-
tion. N C\V Yol'k: McGraw HilJ.
Dodge, M ..• McDerby. M.. Turner. M. (2008). Geogrnpluc l'fsuoli201to11: Co,icepts. Tools, 011d
App!,u mo111. Hoboken: Wiley.
Fotheringham. S.• Rogerson. P.A. (2009). T1te SAGE Handbook of Spana! A11a{rs1s. London:
SAGE.
Ortcnc, R ..P.• Pick. J.B. (2012). Expformg tire f.lrbm, Commumry: A GJS Approad,. Upper
Saddle River: Prentice HaU.
Hankey. S •• Marshn.11. J,D . (2016). L'llld use regression models of On-road Panicula1e Air
Pollution (Paniculate Number, Blac.k Carbon. PM2.5, Particle Size) Using Mobile Moni-
toring. Em•1rom1umral Science DJtd Tedmo/og;: 45:9194-9202.
Jotutson. S. (2006). Tlte Ghost Map: the Story ofLondon~ .Uost Te.rrify1l1g Epfdemic, aud
How It· Cl,a11g«I Scie11ce. C,t,es, and rite ITorld. New York: Riverhead Books.
Kouyoumjian, V.(20 11). (;IS;,, the Cloud: n1e New Age of Cloud Comp111mg and Geo--
graphic Juformatio11 Sys tems. Redlands: ESRJ P~ss.
Kre$$e, W., Danko. D.M. (Ed.). (2012). Handbook of Gtogrnpluc ln/om,ntio,J. Oordreclu:
Spring(',:
Lawrence, P.L. (Ed.). (2013). Geospnonl Tools/or {k ban U'Oter Resources. New York: Oor-
drecht/Springer.
Mueller. T.. Sassenrnth. G.F. (20 IS). GJS Appl1cl1110,is m Agncul1111·e, f'Olume 4: Co,i.rerwm'on
Plmmittg. Boe-a R.iton: CRC Press.
National Research Cowieil ohhe National Academies (2006). Be;·ond MtJpp111g: MiJ.em,g
Nanonal Needs 1hroug/J E,1Jin11ced Oeogrnplrrc il,formntio11 Sde11ce. Was.hington O.C.:
National Academics Press.
Pe terson. G.N. (2014). GIS Cartognipl,y: A Gmde to E/fecm·e Unp D<!$ig11. Boca R.n'lon: CRC
Press.
Petra$0va, A., Hrmuon, 6 .• Petras, V•• Mit.1sova. H. (2015). Tm1g1ble Mode/mg u•11J, Ojxui
Somres GJS. New York: Springer.
Shelito, B.A. (20 12). hmr,ducuo,, 10 Geosporiol Teclmologies. NewY<irk: W.H. Freeman.
Singl«on. A.O.• Spie lnuu. S.E •• Fokh. O.C. (20 1S). Urbo11 A110(,•11r:s. Los Angeles: Sag~.
T heobald, 0 . M_ (2003). OIS Co11cepls and An:GIS Me.lfuxls. fort Collins: Conservation
Pla.uuing Technologies.
Tillman Lyle, J. (1999). Des1g11/or H11mn11 ECOl)'Stems: Lm1d.fcnpe. Land Use. and Natuml
Resoun:es. Washington: lslaod Press.
Wegmaru:i.. M .• Leutner. 8 ., Dech, $. (20l6). Remote S,msi11g and GIS/or Ecologis1:s: Using
Open Source Software. E."(eler: Pelagic Publishing.
Wise. S .• Cragli:a. M. (Eds.). (2008). GIS a11d Evidc11ce-bn.red Poirer Mnl.·illg. Boe.a Raton:
CRC Press.
Chapter 1: Introduction to GIS 23
Study Questions
1.1 • Why are we more imcrcs1ed in spatial data 1oday than 100 years ago?
1.2 - You have probably collected. analyzed, or communicated spatial data in one
way or anolber during tbe pasi montl1. Describe each of these sieps for a specific
applica1iou you have used or observed.
1.3 - Ho,v are GIS hardware different from most other hardware?
1.4 • Describe the ways in which GIS software are differem from otl1er computer
software.
1.S . What are the limitations of using a GIS? Under what conditions miglu the 1ecb-
nology hinder problem solving. rather than help?
1.6 . Are paper maps and paper data sheets a GIS? Why or why not?
24 GIS Fundamentals
25
2 Data Models
Introduction
Oa1a in a G!S represen1 a simplified ,iew ersbip. and land use. Smaller polygons and
of physical emiries - the roads. mouolains. unrecorded characteristics such as value a.re
accident locations, orotJ1er features we care not included in this representation.
about Our d~1a record spa1ial loca1ion and Adata set's spa1ial de1ail and essential
nonsparial properties. characteristics are subjecti\·ely chosen by the
Each enlity is represented by a spatio/ da1a developer. The de1ail r<quired by a sur-
object in a G!S. defining an emi1y-objec1 cor- vei•or will be differem 1bau 1ha1 for a land
respondence. Because every computer system pla(U)er. The essential characteris1ics of a for-
has limits. we can 1t save the exact geography es! would be differenl in the eyes of a logger
or all cbarac1eris1ics offearures. As illumaied 1han those of a hunter or biker. No representa-
in Fi_g,ire 2-1 , we may s1ore land cover as a tion is tmiversally better than any 01her. and
set of polygons, wi1h polygons of a1 leasl 30 tl1e G!S developer seeks 10 defioe objects dial
square me1ers.. We may record da1a lha1 support the imended da1a uses. a1 Ille desired
define each land cover. vegetation l)pe. own- level of deiail and acc,~acy.
'
. .,
•••
t
0
~ .
Flg,u~ 2-1: A p.'hytit2I cntit>' it; 1tpn«ntcd by l\ s.p:ui:i.l <>bjctl inn. GlS. Herc, fakc-s (d:u-k IIIJ'C'M in dtc pho.t()o
grnph) and otbc:r land eo,-cr types are rcprcacoted by polygon, in 1:be data layen on 1bc rig!Jt.
26 GIS Fundamentals
Real world
Data model
J••·, . / Data
strucwre
"• x
12
y
• .,
t
- . •
•. ' 5.8
8.9
3.o
7.l
ID Meo Type
Macihine
l 16.3 PUB code
2 7.9 PEM
3 12 1 8 u 1001 1101
00110110
4 10 l PVS 10110100
Flautt 2-2: Lends ,o( abs:IJ':lctiou in 1hc r¢p~sciuation o( spatjaJ C11ti1i«. TI1c l'Cal world is rcp~tcd in
rucccui\•cly 111<nc m:.c:11inc<o1rq,:uib1c bu1bwna..nlr obicutt (on:nt.
A sparia/ data model (Figure 2-2) may tive data layers. Through analyses we may
be defined as 11te objects in a spatial da1aoose combine da1a 10 create a new data layer: for
plus the relationships among 1hem. The 1erm example. we may identify areas that have
''model.. is fraught with ambiguity because it high elevation and join this information with
is used in many disciplines to describe many 1be soils daca.
things. Here. a spacial daca model provides a
formal means of repre.senting and manipulat-
ing spatially referenced infonnation. In Fig·
ure 2· l. our data model cousiscs of a se1 of
polygons recordimg 1he edges of distinct land
uses, and a (not shown) set of variables asso-
ciaced wi1b each polygon. The da1a model is
the most recognizable level in our computer
abstraction of the real world. Data structures
(bow we organize the infon11a1ion in the
computer) and binary machine code (how
we record it) are successively less recogniz-
able bm more co1npu1er-compatible fonus or
the spatial d.11a (Figure 2-2).
Most GlS store our dam as a set of lay-
ers (Figure 2-3). Each layer organizes 1be
spatial and attribute data for a kind of carto-
graphic ol>jec1. and are often referred to as
thematic layers. An example GIS dacabase
might inc.Jude a soils data L1yer. a population
data layer. an elevation data la)>er. and a fla:utt 2..J: Spatiaidllta 11tt often stOttd llU q>Mlte
roads daca layer. The roads layer contains tltc:matie: layc:n, with objec,s grouped b:u.c:d on :.
only roads data. including the location and 5Ct of prope11i«, e.g.. w~tcr, ronds. o:r faod cover,
properties of roads in the analysis area. or some otbc:ragrced-upon set.
lnfonnation on soils, poLi1ical boundaries.
and elevation are contained in their respec-
Chapter 2: Data Models 27
Coordinate Systems
Coorrli11mes are used 10 define 1be spa- (3.2,15)
tial location and extent ofgeographic objecis 15 (8.4.10)
(Figure 2-4). A coordinate most often con-
SiSIS or a pair or triple, or nwnbers 1lul1 spec-
A""l'rbulU
ify locotion iu rela1ion to an origin. The y 10 , , , • 13'7
coordinates quantify the dislance from the ,:; J~ftt'. v.~
(13.7 1)
coordinate p.1irs, Xand Y, in a Canesian 5
coordinate sys1em, named after Rene Des-
cartes. the system's origiuator. These pairs
define data on a planar. two-dimensional (5 8.2.l)
surface. The two-dimensional surface is usu- 0.0\ 5 10 15
ally based up,oo standardized metliods of origin x
mapping 1be ground surface lo a flal map
surface (discussed in Chapier 3). Typically. Fi&ot•t 2-4: Coordinate and attn"butc data arc
a11ribu1e daia complemeo111te coordina1e used 10 rcptescm c:iuitics.
da1a for carto:gr.iphic objecis. These auribme
data record the non-spa rial components of rut
object such as a name. color. pH. or cash axes (right angle. or 90°), fomting a plane
value. Keys. Labels. or 01her indexes are used (figure 2-5). We specify a Y-axis. usually
so tl1at lhe coordinate and attribute data may
aligned at or dose 10 a nortlt-south direction.
be viewed. re la1ed. and manipula1ed and an X·axis. usually align,ed at or near an
east-west directiOtL The Y-axis is often
1oge1her.
referred to as a ,,or1M11g ,1.ris and values
Planar. 1wo-dimensional(2-D) Cartesian increase upwards in a grid north direction.
coordinate systems define two 011hogo110/ The x-axis is often referred 10 as an eosii11g
a:ris with values increasing 10 the right
0
v
0
M
0
N
-0
0
0 10 20 30 40 50 x
Flan~ l -5: ~ 2-D coordinate system dcfio" X and Y a.,es (left paocl i.u figw-c abo,-c). and ~i(y coordi·
1uuc loc:uioi11s by tbcsc X-Y pdlN. Cootdiiutc nah.1cs incn:-asc ll) ria)uwntd (X) :ud upw.t1td (Y) du'C'CliOlU,
:md lines of consll\UI X Or Y ,·:tlucs mny be 11,kd 10 ~id in loc:uion Oii innpt (tight. abo,"C:).
28 GIS Fundamentals
We must be careful when making mea- frnm -90 at 1he South Pole to 90 at 1he North
surements on our Oat. 2·0 surface. These Pole. Lines of constant la1itude are called
measuremen1s un:avoidably diston relative parallels (Figure 2·6).
locations, because the Earth's true surfac-e Geographic coordimues do 001 fonn a
approxima1es a sphere and hence is curvoo. Cartesian sys1em becaiise the meridians
We can keep errors small by limi1ing 1he c,onverge. A Cartesian system defines lines
area over which we use our flat map. As the on a righi-:1ngJe, planar grid. Geographic
mapped area ge1s larger. !he error increases coordina1es ocC11r on a curved surface. and
to amoun1s we usually can'1 ignore. Specific the longitudinal lines cross at tbe poles.
conversions from 3-0 to 2-0 coordinate sys- This convergence means the distance
tems and methods for managing dis ton ion in spaimed by a degree of loogimde varies
Oat map systems ru-e discussed in Chap1er 3. from approximately l 11 .3 kilorroeters at the
Equator. to Okilometers at the poles. In
Coordinates on a Sphere conltllSL 1he ground distance for a degree
ofla1itude varies only slig)itly. from 110.6
When we map over larger areas or when kilomelers at the Equa1or to 111.7 kilome-
we need lhe highest precision and accuracy, 1ers a, the poles. The slight difference wilh
we often use a three-dimensional. sphel'ical la1in1de is due to a non-spherical Earth.
coordinate sysrem. H_jpparcbus. a Greek something we'll describe a bit later.
mathematician of the 2nd century 8.C .. was
among lhe first IC) specify ioca1ions on lhe Convergenceca11ses distorti<>n because a
Earth using angular measurements on a degree of latirude spans a grealer distance
sphere. Our familfar Gecgn,pltic C(J()rt/imue near 1he poles than a degree of i<>ngimde.
System uses two angles of rotation. the lon- For example. "circles" with a fixed radius in
gin1de 0,). and !he la1imde ($). and a radius. geographjc units, such as 5°, are not circles
R. to specify locauions on Earth (Figure 2-6). 011 tlie st1rfate of the globe, witli di~tortion
The longitude measures east.west distances grea1es1 at the poles (Figure 2·7. left). They
around the polar Eanb axis. Zero is se1 for a may appear as circles when the Earth's sur.
line th.11 passes through England, and 1he face is ..unrolloo .. and ploned 011 a Oat map
distance angle is positive eastward and nega· (Figure 2-7. right). b1111rea1ing spnerical
tive wes1ward (Figure 2·6). Lines of equal coordinates (la1imdesllongitudes) as Carte-
lougirude are call~d meddlans. and are ori- sian coordfoates creales an inherently dis-
ented north-south_The zero longitude. aJso 1orted map. NOie lhe distorted sbape of
known as the Pn·me Men·d;m, or the G,.ee,,.. A.LHarctica in Figure 2-7. right.
wich A1eridio11. was first specifioo 1brough Because the spherical sys1cm for geo-
the Royal Greenwich Obse!\'lltory because grapltic coordina1es is noo-Canesian. pla-
they had amassed lhe best early measure· nar fommlas for area, distance, angles, and
me.uts. but the Prime Meridian bas now olher geometric properties used in a Cane-
shifted about I02 meters (335 feet) east of sian coordinate system shou.ld not be used
the Greeuwicb Oll>servatory as measure- with geographic coordinates. Are.as are
meuts improved. ·1ec1onic pla1es shiftoo. and usual!y calcula1ed after convening 10 a pro-
convention changed. jec1ed sys1em. described in chap1er 3.
Asecoud angle of rotation. measured There are two primary conventions
along north-south planes that intersect the used for specifying latitude and loogimde
poles. is used to define a la1i1ude. Lalitudes (Figure 2-8). The first uses a leading letter.
are specified as zero at the Equator. u,e Line N. S, E, or IV, 10 indica1e direction, fof.
encircling 1he Ea11h that is equidistan1 from lowed by a number 10 indicate location.
the North and Sou1h Poles. By convention, Northern latitudes are preceded by an N
latitudes increase 10 maximum values of 90 and southern latitudes by an S, for exam-
degrees in the 11011h and south. or, if signed. ple. N90°. SI0°. Lougimde values are pre-
Chapter 2: Data Models 29
U-
1
fi&"rt- l-6: Cc:m,·eu1iom wben n:fc1Ting to geographic Ultitudes Md lo11gitudC$. Mcridian!i arc lines nllUting
nortb-soutb tba1 ba,·c coustant looajrud«. ParalMs arc Imes: n.wning cast·WC61 lb:u ban: ~.Stant latitudes.
Latirudc i:s ZC':f"O Olt 1bc Equator. Loniirude is zero on tbc Ottcnwicb Mcridi.,11111,d undci1.ncd at 1.hc poles.
b,cc.,usc all longitlld-iual mcridi!l1'5 iJ1tc111a:1 1licn: (i. • ? in tbc figure).
F1;urt l -7: Gcoat9phic coordi.na.t« o" a spbcrie11I (left) aild Car1csian (ri,ht) rcprCklltati.on. Notice that ..cir-
cles'' dcfotcd by ll 5 d~c nridiwi do 1.ot forut circles on the E:uth 's SW'f:ncc nc11t 1hc polc-s, 11.t shown on tht-
sphcric.il representation (left figwc), but :ippca.r a, ci.rdc$ iu the dislorted Cn.ttcs:ian pie.I of gcogr,ip!i..ic coordi,.
11atcs (riaht), This figure illu!itratcs both that a) tbc sur!aoce disua:)Ce for a tmit of 1011.gitude changes depending
on your location on Earth, a.iKi b) 11 Cat1c.s-Uln plot o( $C03N1phic coordinates is biahly dist01tcd.
30 GIS Fundamentals
ceded by an E or W, for example WitO". Northern la1in1des are positive and southern
Longin1des range from Oto 180 degrees latin1des are negative. and eastern longitudes
east or west. Note 1ba1 the east and west positive and western longimdes negative.
longitudes meet at 180 degrees. so that Latitudes vary from -90 degrees to 90
E 180° equals W 180°. degrees. and longitudes vary from - ISO
Signed coordinates are 1he second com- degrees to 180 degrees. By this com•ention.
mon way to specify latitude and longitude. the longitudes "meel'' at the maximum and
minimum values, so -180° equals t 80".
Spheric.al geographic coordinates are
North Pole
often recorded in a degrees.minutes.seconds
Nil0 , £84<> (OMS) notation: N43° 35' 20" for 43
--"P-~·11· OC:s.,. degrees. 35 minutes. and 20 seconds of lati-
tude. In OMS, each degree is made up of 60
minutes of arc. and e.acb minute is in mm
divided into 60 seconds of arc (Figure 2-9).
This yields 60 times 60. or 3600 seconds for
each degree oflatitude or longirude. Spheri-
cal coordinates may also be expressed as
decimal degrees (DD). When usi ng DD. the
degrees take the usual - ISO to ISO (longi-
n,de) aud -90 to 90 (latinide) rru1ges. but
minutes and seconds are reported as a deci-
S62", E35 mal portion of a degree (from O to
·62' or.35.., 0.99999 ...).
SOIJ1h Pote
Con1·ersion be!IV~il DMS aud DD is
Fleurt 2-8: SpheriC;J1J coord.i1utc~ of Luimdc :lnd shown in Figure 2-10.
longitude 11.-re IUOtil often C:<prc$.SCd "' directiom11
(NJS. .E/W). or as si,p~ numbers. Latinidcs arc
positi\'c uorth, ncl*tl\•c. soutk lontitudcs arc posi·
li\'c call, m::pti,·c \'l.'cs.1.
,.•
3600
•
Sooth Pole
FiJ.Ul'f' 2-9: Tiicrc tire 360 dcgrccs io a C01n,Jilc1c cin:lc. with c.icb dcgru composed of 60 mit1ll lc1, :,,ud c.-c:!1
minute e,,owpos,:d of 60 s«onds..
Chapter 2: Data Models 31
e.Quotor equator
flatten ng
factor f
·-
O- b
0
pole pole
Fi111rt 1-11: Sf.hcricial (left) \'S. c:llipsoitbl (right) appro:c.im111io11.1. of the Earth's $b11pc.A sphere bas 3 sin·
gk 1'3dius. whi can ellipse ha.s ditTcm11 radil along the semi-ruajoraod «mi-minor axcf. The splcroid wd
cllipso1d C80 be tbouatu of u rouuioa: these two basi.c: sbapts around the polara.,:is to ttcatc solid fia:urcs.
32 GIS Fundamentals
jection. and then lists a set of ellipsoids, with f igure 2-12 shows an example calcula-
differing polar and equa1orial radii. tion of arc Jeuglh, using 1be average radius
There have been several estimates of tJ1e for Eanb. No1e 1ba1 equation (2. I) applies 10
"best" Eartl, radi~ bes1 in !he sense 1ba1 tlie a genenc arc angle. measured ill! thedirec-
1iou oftbe spanned arc. witliotn regard 10 lhe
average height of the solid earth surface is
zero when 1aken across the globe. Eratosthe- lalinide/longin,de sysiem. Subs1i11ning la1i-
nes ofCyreue provided a remarkably accu· tude values will result in a reasonably accu-
ra1e es11maic of1he Eanh's radius more 1han rate ans\\rer. but subs1itutiug longitude
2,000 ye.ars ago, but improvements in tech- values. anywhere bul along 1he Equaior will
result m an error. This error will be small
nologies allowed repealed. increasingly
accura1e measureane,us 1brough 1he Jaie- near 1he Equa1or and lal)(esl near the poles
l 900s. when the current values were due 10 longinldal conveience. The forum!~
adop1ed. 1s best used as a first approximation of dis-
tance spanning generic arcs. and not using
Today. 1be bes1estima1e for a is Jongimdinal conrdina1es unless they're near
6,378, 137.0 me1ers (m), and for b 1he Equa1or.
6.356.752.3 m. A mean value of 6,367.444.7
m is often used for spheroids. bu1 so111e1imes . N?1e 1ha1 1be angle should be specified
lhe value for a is adop1ed, 6,37$, 137 m, or m radian measure. defined as 2m radians per
jus1 6.378 km.
Convertfng degrees to radians: Also note that the arc angle between two
locations is ofte11 no, accurately de1ennined
30.1487 degrees is by subtracting their geographic coordinates.
due to the convergence of longitudes. This
30.1487 I 57.2957795 fommla is generally accurare for la1i1udinal
differences. bin only closely approximates
= 0.52619 radians longitudinal distances near the Equator.
because the lo11gi1ude lines converge ac the
poles. As !hey converge. the difference in
Converting radians ta degrees: longitude becomes less than the arc angle 9
l.284 radians is i111be fom1ula.
Thegrenr circle dfsumce fommla should
1.284 x s1.29sn95 be used for the most accurate estimates of
cbe surface distance oonvee111wo points
• 73.5678 degrees (Figure 2-14). Agrenr circle is defined by
FJaurt 2.. 13: Co1wcnion between rad.Utn 11nd
any plane that i111ersec1s a globe and passes
degree :angle unite. through its center. All meridians are great
circles. while t.be Equator is the only line of
constant la1itude that is also a gre.at circle. A
the 360 degrees. or approximately great circle distance is cbe sborcest path on
51.2957795 degrees per radian. Radian mea- the E.inh 's surface between two poims. and
sures are ru.1 altemative to degrees, and scale long-distance airline routes .approximate
the rotation by the radius of the circle. You great circles. AS wicb all trigonometric for-
may easily cooven bc(ween radjao and mulas. yon should know if your calculations
degree uni1s (Figure 2-13). Many spread- expect degree or radian measures as input,
sheet. online, and app programs by default and couven accordingly.
use radian measure. and subs1iru1iug degrees
will lead 10 errors.
The greot circle distoru::e between points on o sphere is given by the formula
d • r 2 sin 1J[(sin'<¥» • cos($, ) cos($0) s,n
2(~)]
'M'lere d ,s the shortest distance on the surface of the Earth from A to 8.
r Is the Earth's radius. opproxfmalely 6378 km, ond ~·¥
ore the cfrfference-s
betvl<?en point lotrtudes and longitudes. dNided by two.
As on example, the d1stonce between Paris. Fronce, and Seottte, USA. is:
Lolotude. longitude of Por,s. Fronce, 48 864716°. 2.349014°
Latitude. longitude of Seattle. USA • 47.655548°. - 122 JaJ20•
2
d , 6378·2 s1n 14!(s,n (0 604584))-cos(<8 8647J6)·cos(47 6555•8) sin'(62 36107)]
, 8.'043 6558km
Fla:utt 2· 14: ,Calculatiou of the _vcat circle distance bc-twcco poiuu:.
34 GIS Fundamentals
Three-Dimensional, Earth-Cen·
tered Coordinates
There is an al1ema1e. 1hree-di111ensional
(3·D) Canesian representation of coordi-
nates for locations. on or abo\'e Earth (Fig-
ure 2-15). This is commonly used in
geodesy. the scieace of1be Eanh's shape.
size, and ph)-sical dynamics d1at underpins
all coordinate measures. Geodesy is at !he
heart of map projections (Chapter 3) and sat-
ellite positioning (Chapter 5). fundamental y
building blocks of G!S.
The 3-D Canesian system typically
x
places the origin mearor at the utaS'S center of
the Earth. This Canesian system is aligned Soum Pole
with the Z axis through the geographic Nonh
IOngrtOOe, kltitudt from known 3-!> Cort~slen
Pole and !he X and Y axes forming a plane
on the Equator (F ig11re 2-l 6). The positive i~. 9;i • F ( Xc,,.Yp,Zo)
X·axis iu1ersec1s the ellipsoid where latitude 3 ·0 CortcllM from known IOll'l)(lt:. longttl.ld<
and longitude values are both zero, and !he ••. v•. z.. o o.,. e. >
positive Y-axis intersects the ellipsoid at a Fla:urt 2-16; Fonnulas ex.isl 10 eon,·e:11 bct\,-ecu
longitude of 90 aod latinide of O. known sehcrie:LI ;co , bic cootdu.l.a1cs (111.litudc
t1nd longitude on 11 $ 1eroid) :tml COTft:i])Olldiuj
Mathematical formulas allow us 10 cal- 3·0 CMc~an <: 10.,1·cs ($ce 11ppendl.'C. C).
culate any x. Y. and Zgiveu any latimde.
longintde. and Earth radii (Figure 2-16).
Each latitude/longinKlelradius coordinate in
1he geographic system corresponds to an
3-D Cartesian Coordinate System X-Y-2 triplet in 1he 3-D Canesiao coordinate
z system. T11ese fommlas are commonly used
by geodesists in the most precise surveys.
but are also embedded in many softwares
that conven between different versions of
y our coordinate data.
There are 1wo dilferelll se1s ,of equations
forconver1ing from geographic 'to J.D Car.
x 1esian coordinates. one assuming a spherical
~ origin.
X•Y•Z•O
Eanh. and a more accurate one assuming an
ellipsoidal Eanh. Ade1ailed discussion of
1hese is best leti for an ad,oancoo co,irse, so
fonnulas are uicluded in Appendix C for ref-
erence.
Chapter 2: Data Models 35
Geograph le and Magnetic North places on Earth. The compass will usually
poinc ease or wesc of Geographic North.
There is often confusion be1ween Mng- defining an angular difference called the
neiic North and Geographic North. Mag- magnetic de.clinariou. Oecli11ation varies
netic North and Geographic North do not across ct,e globe. and also bas varied lhrough
coincide (Figure 2· 17). Magnetic Nonl1is cime as Maguecic Noni, wanders.
the location towards which a compass
points. The Geographic North Pole is 1he Our geographic daca are alruoscne,•er
average nonhem location ofche Earth's axis referenced wilh respect co Magnetic North.
of rota1fon. If you were standhtg on 1he geo- Because Magnetic North wanders. is in a
graphic North Pole with a compass. it \\,.ould different direc.tionacross the globe relative
poinc approxi:macely in lhe direclion of1he co constan, lines of latimde 3nd longirude.
Bering S1rai1s. In addi1ion. Magne1ic North aod can vary substantially due 10 local
"wanders" lhrough lime. by JOOs of kilome- anomalies. it has been used historically for
ters over che historical record. and has in.field navigatfon mlher 1han as a reference
recemly increased ics nue of shifc (Figure 2- against which coordi.nares are defined. Geo·
17). graphic North is used because ic is stable
cbrougb cime. predictable. can be reproduc-
Because Magnecic North and cbe geo- ibly measured. and cied to crcller Eanh-
graphic North Pole are noc in die same place. defiued conslants.
a compass does not point towards Geo-
graphic Nonb when observed from mosc
201z
Magnetic
North
Greenland
36 GIS Fundamentals
Nominal attributes may also be images. in the second column of Fig,.ire 2.18. These
fdm clips. audio recordings. or other data are often recorded as real numbers on a
descriptive infom1a1ion. for example. GIS linear scale. Area. length. w,eigltt. beiglll. or
for real estate often have images of the depth are a few examples of attributes that
buildiugs as pan of the database. Image. are represented by intervaVl'.8tio variables.
,,ideo. or sound recordings store<! as anri- Items have a domain, a range of values
butes are sometimes referred to as "'8L08s'· they may take. Colors might be restricted to
for binary large objeCfs. red. yellow. and green: cardinal direction to
01di1111/ amibures imply a ranking by north, south. east. or west: and size to aU
tlteir values. An ordinal attribute may be positive real muubers.
descriptive. s11clt as high. mid. or low. or it The anribute type and domain define the
may be numeric; fore.'<ample. an erosion kind of operations that may be appropriately
class with values from I to 10. The order applied 10 1ba1 anribute. 11 ust,ally makes Lit-
refleclS only rnnk. and not S(ll!e. An ordinal 1le sense 10 calculate the average value of a
value of four has a higher rauk than two. but nominal attribute. e.g .. the average name for
we can't infer that the attribute value is twice a set of counties oraverage colorof an
as large. image. Mis-applying fimctioos merun for
fmermllrario 11trrib111es are used for intervaVrntio attributes to nominal orordinaJ
numeric items where both rank orderand anributes is among the most couunon errors
absolute difference in magnitudes are repre· in spatial dma analysis.
sented, for ex:ample. the number of flanges
f la1.11~ 2-1.9: An eic.runple c:onlinuous wrfocc, a depiction of light inlcns-ity from .i,n elc,;.a1io11 ,~irfocc.
CkM-sl-y solarCUCfgy~0«1cd from the ,urface is «ti.mated across the landscape as II re;a.l (intcr\'al/nitio)
munbcr; and ttndcrs an impression of , urf:tc:c shape.
38 GIS Fundamentals
We may perfonn inappropriate calcula- peratures may occur one or two times in the
1ions on intervaVratio data. sometimes quite da1a. e.g. 22.00000000 and 22.00000001
sub1ly so. Consid<r a hig)lly accurn1e 1em- degrees are considered different. 11 may be
perature measurement g.rid, with data save.d more appropriate to bin the data into 0.10-
10 1he neru:es1 0.0000000 I degrees. The degree imervals and 1hen calcula1e 1he mode.
ma1bemarlcal mode. 1he mos1 freq11en1 value One mus, bear in mind anribu1e da,a l)'J)e.
in 1he data se-1. may not be informative domain, and characteristics when manipulat-
because high precision means 1ha1 al01os1 all ing spa1ial da1a.
values have very low frequency. Most rem-
Chapter 2: Data Models 39
Vector Raster
points
y •
•
• ••
line
area
x
40 GIS Fundamentals
Points
•
4
•
I
•
ID Tower Nome
1
"""' 1010 ...
Heinh1 Formo1
•2
J
'3
•;
VNOU
m
WO. .
BBC
.., .......
55.S
,eo.o
O'A"'
l.12.J
"""" 1V
CJ0,.$<41
Lines
4 7 10 Name
6 1
2 '""'"'l>n>nch"""'
8 J
• """~""'
Stq>t F'O"tt
•'
Md1 8t'ool,-
2 M(tdMCtl-Sromt' Crttk
1 LOl"lf.R A.IT'1
• ......,. """""
Polygons
10
' _... ld1M Nome
HcdSOl'I Holl
Floors
M
R00f Type
r1cn. u:cieo 10,
2
3
•
•'
-
G.,lforc Tc,l'flOtoqy BIUcJ
"""'Sldo'
h\fnd SC.<tnc.n SdQ.
Aomn,.rn;'hOt'I
o~·c s~,, ~"''r
''•••
''
10
1, .0
•••
tw!"'<d 9(Jl, ·~,
llo'I. ~ , r l
flo1w $Nlltd IQ!'
pr!(lled 12/12, Tilt
i;t!Ct\td 6/JZ, meic
IIO'I. Relcd 10'
•'
•
Grc,,tl!clrn Towtr
61(11....,.•.,, S<.•nc.u &:-
10
••
prlched 9/J?, "lC
pr!c.llc!S i2/12. 111c
Fl.;,urt- 2~21: An u21i1p!c of tbc m<W! c01ruuon ,·ccror d.,tn model t1mct\lJ'Cs. Cicogr.,piUc fe:itutts cowist of
pom1$. Lines. orpolygo1n. "ilh ucb fe11.t\lf'C corre,]>Ooding to 3 row in" table with OJI idcnlificr (ID) and II set
of anribi.11« amycd w colulUllS.
made up of Line segments that nu, between These polygons are fonned by a set of con·
adjacenl coordinates in the ordered set. nected lines. either one line with an ending
Attribu1es in a table correspond to line seg- point that connects back to tlte starting point.
ment Cul'\'Cd linear emities are most often or as a sci of lines corutected stan-10-end.
represented a:s a collection of short, straight. Polygons have an interior region and may
line segments, although cun·ed lines are at entirely enclose other poly~ns in this
times represemed by a nia1hema1ical equa- regjotL Polygons may be adjacem to other
tion de.scribing a geometric shape. The line polygons and thus share "bordering.. or
starting and ending points are often called "edge.. ltnes with otlter polygons. Allribute
nodes. and i,nenuediate points used 10 repre- data such as area. perimeter. land cover 1ype.
sent the li_ne shape are called vertices. or coumy name may be linked to each poly-
Area entities are most often represented gon.
by dosed polygons (Figure 2·21. bonom).
42 GIS Fundamentals
Nole that there is no uniformly superior spatially distinct features are matched with a
way to represent feacures-. and we may repre· row. and the row attributes apply to all the
sent tbe same feattures as points. lines. or distinct fearures. This is common when rep-
polygons (Figure 2-22). Some feature types resenting islands, groups of buildings. or
may appear to be more ''naturally.. repre- other clusters of features that make up a per·
sented one way: manhole covers as points. ceived whole thing. These muhi -1>0rt fea-
roads as lines, and parks as polygons. How- tures beuer represent our percep,lions of
ever, in a very decailed data set. the manhole entities in some contexts.
covers may be rt'Presented as circles. and Multi-part feanrres may also be used for
both edges of the roads may be drawn and large data sets. for example. when millions
the roads represented as polygons. The best orpoim obsel'\'ations are collected a\ltomati-
representation depends on the detail. accu- cally with laser scanners. Tables: are often
racy. and intended use of 1he data set. slower to process than point geographies.
Vector layers sometimes have a many- and so reducing the table size by grouping
to-one relationship between geographic fea- points into muhi·part featwes may shorten
tures and table rows (figure 2-23). defining many operations.
,mtlti·partfeauo-es. l.n these instances. many
oreos
Fia11N' 2. 22: Tilc snmc o\,jcc.l• sully be rcprcs.c11, tcd by poir1ts, lioc$, or polygo113. depending ou,our ,icw of
and intended use fe>r tbe da.1.1.
Chapter 2: Data Models 43
Multi-port Features
2
ID Count y Area
,-0 Bristol 73.5
I Newport 286.5
2 Providence 1,121.9
3 Kent 455.6
4 Washington 912.3
ID Couniy Area
O Bristol 73.5
1 Newport 243.5
2 ' Newporl 14.8
1 3 Newport 24.8
4 Providence 1, 121.9
-
5 Kent 912.3
6 Washingt on 884.7
-
7 I Washington 27.6
-
f la:w·" 2-23: Eluunplc of uiulti-pMt and sil1Jlc-pat1 fcarutrt. H«c. cow11ics for Rhode ls.l:tud. a suite lU
dte E11$1cnt U .S.A., 11rc shown vtitli one table cuJry f0r acb c()W1l>' (top), with n1uhi p.1r1 fe111urcs in a
0
fayer, aud witll oac t2blc Clltl}' for c.1cb distllict polygon {bottom), with only s:inglc~pnrt fcati.ares in the
layer. Nore that c.alcuU11ioos. aoal.)"sis. and intcrprccation inay dilfcr for multi-part v.s. single-part fca -
"""-
44 GIS Fundamentals
- ~ ID State populot.
" lw I 231 Hawaii 11.419.461 r-~ ID State popv!at
(, 1 Hawaii 1.419,461
LJ d 6 Howou 1,419.461
7 Hawaii 1.419.461
fi&m~ l-24: Multi-g>nr1 to siogle·p:u1 c<:11\'crtiou m~ lc~d 10 cn'Qrs in 1tsbs,cquent 3:nal)'$,li bcct1Alk :ittributcs
may be copied from tbc origin:1.L multi.part chas-1,:r (left). to each siui.le·~ compooe111 (ri~1)~Or:mity.
swm. or- other dr:11\~ec:1 ,·nri.t1blcs should lrcqUC'ody be rt-cakuUl1r:d (or smgk-pan fr:a11ns. bu1-0ftcu arc ooc.
l'Ciuhing i.n m'OT$.
Chapter 2: Data Models 45
fle,u rt 2..15: Ex:amplcs of p<!lygon inchtti<nli ($idct\•t1.lk inclu.sio11 it! Bowel' bed sltO\\'tl in O and b), :and
lxn• id.,~· g1mcnliz,tic:m (C) in .i vcclor cbtn model. TI,c,e appro,ximatio1u typicnllf oc:cur 95 .1 conicqucute
ofadophng ~ \""'°' r<:presentatioo. aod their irup~ts mmi be co1uidcted wbc:n u.smg v«t<lf data.
One solu1tioo creates a polygon for each separaling different area features. for curved
inclusion. This often is 001 done because ii lines. these s1raigh1 line seg,.uenis may be
may take 100 much effon 10 ideniify and col- viewed as a sampling of1be 1me curve. and
lec,t the boundary location of each inclusion, there is typically some deviation of the line
and 1here rypically is some lower limit or seg,uen1 from ,he ·~n,e·•curved boundary.
m inimum mapping uuit. on the size of The amount of generalization depends on
objects we care to record in our data. lnclu· many factors, and should be so small as to be
sions are preseni in some fonn in many poly- u11imponao1 for any inteude-d use oflhe spa-
gon d.11a !aye!$. tial cL1ta. However, since many data sets may
Botmdal), geuemli:atfrm is the incom· have unforeseen uses or may be obtained
ple1e represeutation of boundary locations. from a 1hird pany. 1he boundary generaliza-
This problem stems from the typic.al way we tion should be recognized and evalua1ed rel-
represem linear and area fearu.res in vector ative to the specific requirements of any
data sets. As sllow11 in Figure 2-25c. poly- gil'eu spatial analysis. There are additional
gon boundaries are represented as a set of fomis of generalizaiioo in spatial data. and
connected s1raigh1-line segmenls. The seg- these are described more thoroughly in
menrs are a means to uace the boundaries Chapter 4.
46 GIS Fundamentals
L4 1
Pl
L3
P2 1'3
2
Chapter 2: Data Models 47
Nonplanar Planar
node
Line
11ode
crossed _
tines
I
Polygon
overlop 2 3
area
fiaul'(' 2-27: Nonpliwar aod plallllT topology i.n linu "od polygons.
There are. additional top-01ogic.a1 con. consistem. '"cleaner'' data. 01her systems
stmc1s besides planari1y tlm1 may be speci- maintain little topologkaJ informn1ion in the
fied. For example. polygons may be da1a strucrures. but compute and ac, upon
exhaustive, in that 1here are no gaps. boles. topology as needed during specific process-
or ..islands" allowed. Line direclion may be mg.
recorded. so 1hat a "from" and 1'10" node are Topology may also be specified be1ween
identified in each line. Directionality aids layers. because we may wish to enfore-e spa·
the repre.sentation of river or street networks. tial rela1ionsbips between eo1i1ies that are
where there may be a namrnl Oow direction. s1ored separately. As an example. consider a
There is no w1iform set of lopological data layer !lull stores property Lines (cadas-
relationships lhat are included in all 1opolog- iral data). and a housing data layer that
ical data mod.els. Different vendors have s1ores building footprints (Figure 2-28).
incoiporated different topological inforrna- Rules may be specified that prevent poly-
1ion in their data stmctures. Planar topology gons in the housing data laycer from crossing
is often included. as are representations of property lines in the cadastral da1a layer.
tu/joce11cy (which polygons are next to This would indicate a building that crosses a
which) and com1ecffriry (which lines con· property line. Most such ins1ances occur as a
nect to which). re.sull of small errors indata entry or mis,
Some GLS software create and maintain alignment among data layer$. Topological
detailed topological relationships in lheir restrictions between two data layers avoid
data. This results in more complex and per- these inconsistencies. E.xceptions may be
haps larger data slmcn1res. bul access is granted in those few cases when a building
often faster. and topology provides more truly does cross property lines.
48 GIS Fundamentals
•
obscure way. and tlterefore bidden from
users.
dlsollowed
D Point topology is often quite simple.
Points are typically independenr of each
OYC::r1op
other, so they may be recorded as individual
identifiers. perhaps with coorrunates
included. and in 110 JX1nicular order (Figure
2-29. top).
Line topology typically includes sub-
stantial strucrure and identifies 211 a mioi-
mwn the beginning and ending points of
Figurr- 2-28: Topo.logic:ll ni.let 111..,y be cuforocd each line (Figure 2·29. middle). Topology
tteroH <bl.A lnycl"l. Mete, rules Ul~)' be , pecifo:d to may be organized in tables, in<l\lding line
a,·oid ow:rbp be:n,;-cc:o objects in di.ffcrc1u laycrs,
identifiers, starting nodes. and e:nding nodes
for lines. Lines may be assignedl a direction.
and the polygons to the left and right of the
There ore mo:ny other types of topologi· lines recorded.
cal constraints that may be enfotted. bot11
wi1hin and between layers. Dangles, lines Polygon topology may also be defined
that do not connect to other lines. may be by tables (Figure 2-29, bottom). The tables
proscri~ed, or limited 10 b¢ gm1cr or less may record the polygon identifiers and the
than some 1hresho ld length. Lines and points ordered list of connected lines th.at define the
may be required to coincide. for example, polygon.The lines for a polygon form a
water pumps as p,oims in one data layer and closed loop. so the staning node oftbe first
water pipes as lines in another. or lines in line in the list also serves as the ending node
separate layers may be required 10 intersect for the last line in the list.
or be coincident. While these topological Topological models greatly enhance
rules add complexity to vector data sets. they many vector operations. Adjacency analy-
may also improve the logical consistency ses are reduced to a "table look·up'', a quick
and value of these data. and easy operation in mos, software sys-
Topological vector models often use tems. Assm11e the city is represente<I as a
codes and tables ro record topology. As single polygon, and we seek all :neighboring
described above. 11odes are the staning and polygons. Adjacency analysis reduces to I)
ending points oflines. Each node and line is scanning the polygon 1opology table 10 find
given a unique identifier. Sequences of the city polygon and reading tl1e list of lines
nodes and Lines are recorde<I as a list of idea· that bound the polygon. and 2) scan:ning this
tifiers. and point. line, and polygon topology list of lines. accumulating a list ofall left and
recorded in a set of tables. The vector fea- right polygons. Polygons adjarent to the city
tures and tables in Figure 2·29 illusrnue one may be identified from this list. List searches
form of this topological coding. on 1opological tables are typically much
Many GIS software systems are written
faster llt.1n searches involving coordinate
data.
such tlmt the topological coding is not visi-
ble to users, nor directly accessible by them. Topological data models olten have an
Tools are provide,d to ensure the topology is advantage of smaller file sizes, targely
create<I and maintaine<I. that is, there may be because coordinate data are recorded once.
directives that require that polygons in two For example, a nontopological approoch
Chapter 2: Data Models 49
'
Line.s " '
.. Line Begin End Left Rlghf
•• • .m Jl9ll ll2la. ~ =
• ,,
• II l
• •2 ...
...
A
" 19
12
52
...
2 3 8
A
A
13
Polygons Polygon
14
ll! Llnll
!3 • A
8
11.12.52.53.5-'
52.53.19.15.1.A.13
19
f i;,u.rt- 2· 29: An example of ,,:,:tor feo.lUR$ and cotTCsponding topolo,&y l$blc$. lnfo11J1J1tion ou 1hc
adJaccucy. coonccth·icy. and otbc=r$fN!tial rclatiousbip, may be stored 111 topology tables. and joined
10 fc:111.l«s by Uldiecs, bC1't' rcprct.en1cd by ,-alues i.u the ID columns..
often stores polygon boundaries twice. Lines polygons must be closed. Unconnected lines
52 and 53 a11he bouom of Figure 2-29 will or unclosed polygons will cause errors
be recorded for both polygon A and polygon during analyses. Significant human effort
B. Long. complex boundaries in polygon may be required 10 ensure clean vector data
data sets may double their size. Tllis because each line and polygon must be
incre.ases both storage requirements and pro- checked. Software may help by nagging or
cessing. fixing ''dangling" nodes that do not connect
There are limitations and disadvantages 10 other nodes. and by a111ous11ically idemi-
to topological vector models. First. there are fying all polygons. Each dangling node and
computational costs in defining the topologi- polygon may then be checked, and edi1ed as
cal stmcrure of a vector data layer. Software needed to correct errors.
must detemline the connectivity and adja· Limitations and 1he extra editing are far
cency inforn'8tion. assign codes. and build outweighed by the gains in efficiency and
the topological tables. Computational costs analytical capabilities provided by topologi-
are typically quile modest with current com· cal vector models. Mauy cumnl vector GIS
puwr technologies. packages use topological vector models in
Second, che data must be very "clean; • some fonu.
in that all lines must begin and end with a
node. all lines must connect correctly. and all
50 GIS Fundamentals
Vector Features, Tables, and manipulate table values, usually- with the
Structures exception oflhe ID. and the underlying
topology and coordinate data a.re accessed
As described-earlier. geographic fearures via requests to display. change, or analyze
are associated wirb oonspa1ial attributes in the spatial data components. Dau layers
vector models: tables are used to organize may also indude additional infonnation (not
the anribu1es. tn most GIS software, we can shown) on the origin, region CO\'·ered. date of
most easily view 1be tables and a graphic creation. edit history. coordinate st1te111, or
represe.nration ofthe spatial data as a linked other charac1eristics of a data se·1.
table and digital map (Figure 2-30. top).
Note that 001 all GJS store coordinate
Most GIS employ underlying file stmc- and topological data i11 no1Habnlar file
tures 10 organi2e components of the spatial struc.ture-s. Coordinates. points. lines~poly-
data. An e.xample organization is shown in gons. and other composite features may be
the bonom half of Figure 2-30, whe.re the siore<I in tables similar 10 anribntc tables. It
topological elements are recorded in a linked is premarure 10 discuss the detaiIs of these
set of tables. in tltis example one for each of spatially euabled databases. because they
the polygons. lines. and nodes and vertices. are based on something called a relatio11al
Most G!S maintain the spatial and topologi- data model, described in detail i11 Chapter S.
cal data as a single or cluster of linked files. Faster computers suppon th.is generally
This internal file stmcmre is often insula1ed more flexible approach. allowing simpler
from direct manipulation by the GJS user, and more transparent access across different
but underlies nearly all 5Pa1ial data manipu- types of GJS software.
lations. A user may directly edit or otherwise
Geogrophic Deprction
User's View
Attribute Tobie
&\ I It>
t .
.....
16.8
I 2 ~ 2 little 22.2
3 m,d tM_
,~, 103 4
''"' 20.7
2
4 3
107
"--- -·
108
on Ur.cs
l 101.J0.1_--._
\'-· 2
3
t02, t09, ..
103, l04, ...
104, t07,...
,-~.
4
Topology &
Coordinate
Structures
Hidden from -
N&V
.,,.
21
..
T,P<
'
•
•
"
x
12'4.7
2022
16.3
u,.,
..
y
~~
1009
J5.5
ll.2
/r~;
'
l . ::
J02:
Nodes & Yer-Tied
!IS, 47, 48, 49, 50, ·-
7J, 72, 73, T~ . 1,, ·-
35. 21. 2Z. 23, 24, _
9<. 79.60,8l,8Z,_
- ·----· - ·-..--..•. __
User's View \ - - _ , .-
//
·-
Fi.gun 2.JO: Fc.;rnuu in :i tOPol~c::11 tb111 l:iyc~ 1ypic:ally b.:i,•c ll 011c,to--o1!.c: rcltl1i~nsh!J! Wi)h ctllricll iu 1111
associ.ilc:d :ittribl.tc: tnble, TI>c: nttnbute t3blc typ1caUy ~nlams n coluuw ",th 11 1,1mquc: ,dc:1111.fic~. Of 10. for
each fea~ . Topology :tnd coordinate da~ are ofi~o luddc:n &-o.ru th~ user. bu1 lu1ked.10 dt< ottnbu1e lf!d
j'c:Of.tl'lpbic: fc:irut:'C's tbr<)t1aJ1 poimcn al'ld mdc:x \'ftn.11blc:s. ducnbed m l11c: Da!11 and file: Slrtlc:IUl'C"s srt1100.
fatcr in 1bl$ ch11pi:c:r.
Chapter 2: Data Models 51
a) b} c)
Fiaun.• 2.J2: Tbc number ofcclb in a 11l$'.lcr dn.1a set dc:peod3 on 1hc cell ~zc. F«ll' a gi\'Cll lll'Cll. a lioear
d«rease in cell size: caus« an cxpoocuti.al incrcas<c i:o cell 11uruber. e.g .. ba.hing tbe «U size: c:.auses a
four fold increase-in cell number.
may be considered 10 apply to the entire cell. A raster data model may also be used to
If the variable is unifom, across the raster represent discrete data (Figure 2.-33). for
cell, the value will be correct O\'er the cell. example. to repre.sent land cover in an area.
However. under most conditions there is Raster cells typically hold numeric or single-
within·cell varicniou. and 1lle raster cell letter alphabetic characters. A coding
value represe.nts the average. central. or most scheme defines what land cover type the dis-
common value found in the cell. Consider a crete values signify. Each code may be
raster data set representing average weekly found at many raster cells.
income with a eel.I dimension tlmt is 300 Raster cell values may be assigned and
meters (980 feel) on a side. Further suppose interpreted in a1 least seven ditferem ,vays
that there is a raster cell witl, a value of 71 O (Table 2-1). We have described three: a ras-
pesos. The entire 300 by 300 meters area is ter cell as a point physical value (elevation).
considered 10 have this value of710 pesos as a statistical value (average income). and
per week. There may be many households as discrete data (L1nd cover). Raster values
within the raster cell that do not earn exactly may also be used to represent points and
7 10 pesos per week. Howe\'er. the 7 10 pesos
may be the average. the highest point. or
some-other representative \'alue for 1he area •• • o r
'
,, • • ., • •
covered by the ce·ll. While raster cells often • • • •, ' ' , • • ., • •
represent the average or the value measured • • • ' ' , • • .,• • •
• • •t r '' '' ,, • • ., • •
at the center of the cell. tltey may also repre- •• •' •• • •
sent tl1e median, maximum. oranothe.r statis-
tic for the cell area. • • -• • .
•h h' ht ''h h' •'h ' • h0 .,h •h •h
f
f
lines. as the l:Os of lines or points that occur butes must be assigned (Fig1lte 2-34 C).
closest to the cell center. Since attributes are assigned by cells. some
Point and! line assigmnMt to raster cells pr~edctice 1m1s1 be es1ablis.bed. wi!h 011e
may be complicated when there are muJtiple line assigned a higher priority.
features within a single cell. For example. Rasier cell assignment also may be com-
cell vah.1e ass ig:mneot is straightforward plicated when representing what we rypi-
when there is. only one light pole in a cell cally think of as discrete, unifonn areas.
(Figt~e 2-34, near A). When there are multi- Consider the area in Figure 2-35. We wish to
ple poles in a single cell 1here is some ambi- represent a,is area \\i lh a raster data layer.
guity. or generalization in lhe assignment with cells assigned to one of two class codes.
(Figure 2-34. near B). One common solution
represents one feature from the group. and
retains infom,ation on the auributes and .A • /
characteristics of that feature. This entails
some data loss. Another sol11tion is 10 reduce . .
lhe raster cell size so that there are no multi· , B \ l
pie fean~s in a cell. This may result in . '
impractically large data se1s. More complex (
schemes may record muhiple instances of .
features in a cell, but tbese 1hen may slow
access or otherwise decrease the mility ll1a1
comes from t'.he simple rasrer stmcture.
Similar problems may occtir when there
are multiple line segments within a raster
.
J
I
- \
'
)~
searching. or subseuing records become pro· in Figure 2-36b are both of class 10. Both
hibitively tune consuming. Display and are recognized as distinct fe.atures in the , 1ec·
redraw rates become low. reducing the tnil· 1or aud one·tO·one raster representation., but
ity of these da1a. and decreasing the Hkeli· are represented by the same .attribute entry in
hood that GIS will be effectively applied. the many-to-one raster representation. This
to avoid these probtems. a many.to-one reduces the size of the anrib111e table. but at
relationship is usually allowed between the rhe cost of reducing its flexibility. Many.to-
raster cells amd tlte anrib111e table (Figure 2- one relationships effectively create multi·
36b). Many raster cells may refer to a single
pan areas. The data for the represented vari-
row in lhe a1tribu1e cohunn. This subSlrut· able may be summarized by class: however.
tially reduces the size of the anribute table these classes may or may not be spatially
for most data sets. although it does so at the contiguous.
cost of some spatial ambiguity. There may An alreruative is to maintain the one·tO·
be multiple. ooncomiguous patches for a one relationship. b11t to index all the raster
specific type. For ex.1mple, the upper left cells in a contiguous group. thereby reducing
and lower right portion of the raster data set rhe number of rows in the attribute 1able.
Li.oo I e 10 0.8
This requires software to develop and main- nant type or dala (discrete or continuous),
tain the indices. and to create them a,td 1be expec1ed 1ypes of analyses. available
reconstinue the indexing after spatial opera- s1ora,ge, tbe main sources of input data. and
tions. These indexing schemes add overhead the expertise of the human operators.
and increase data model complexity. thereby Ras1cr aa1a mo<lels exbibi1 :;everal
removing one of ~,e advantages of raster advantages relative to vector data models.
data sets over vector data sets.
Fir.:1, rasier data models are particularly suil·
able for representing 1bemes or phenomena
A Comparison of Raster and Vec- that c.t1:mge frequently in space. Each raster
tor Data Models cell may contain a value different than its
neighbors. Thus, trends as well as more
The question often arises, "\Vhich are rapid variabilily may be represented.
better. raster or vector data models?" The
answer is neither and both. Neither data Raster data s11Ucrures are generally sim-
model is better in all conditions or for all pler than vector data models. panicularly
data. Bolh have advantages and disadvan- when a fixed cell size is used. Most raster
tages relative to each other and 10 additional. models store cells as sets of rows. with cells
more complex data models (Table 2-2). In organized from left 10 right. and rows s1ored
some instances it is preferable 10 main1ain from top to bouom. This organization is
data in a raster model. aJ1d in ochers in a vec- quite easy to code in an array s1rt1cture in
tor model. Most data may be represented in most computer lruiguages.
both. and may be conver1ed among data Raster data models are often faster in
models. As an example. land co\'er may l.>e layer overlays. A raster cell corresponds 10 a
represented as a s.et of polygons in a vector given localion. Oata in different layers align
data model or as a set of identifiers for cells cell-10-cell over this location. Overlay
in a raster grid. The choice often depends on involves locating a grid cell and comparing
a number of factors. including 1he predomi- the values found in other layers for the s.1me
location. This cell look-up is quite rapid in Vector data models e.ase the calculation
most roster daia s1n1crures. and storage of topological infom1a1ion.
Finally. raster da1a struc.tures are the Topological infonnation ai<ls in perfonuing
mos1 practica.l method for storing. display. adjacency, connec1ivity, and otheranalyses
ing. and manipula1ing digiial image da1a. in an efficient manner. Topological inforl.Jl3·
such as aerial photographs and satellite tiou also allows some fom1s of automated
imagery. Digital image data are an important error and ambiguity detection. leading to
source ofinfonnation when building. view- improved da1a quality.
ing. and analy?ing spatial databases. Image
display and a.nalysis are based on raster Conversion Between Raster and
operations 10 sharpen details on u,e image. Vector Models
specify the brightness. contras!. and colors
for display. and 10 aid in 1he extraction of Spatial daia may be convened between
infonnation. raster and vector data models. Vector-to-ras-
ter con\•ersion involves assigning a ceU
Vec1ordata models provide some advan- value for each position occupied by vector
tages relative 10 raster da1a models. First. feamres. Vector point feamr,es are typically
vector models often le.id 10 more compact assumed to have no dimension. Points in a
data storage. porticularly for discrete objects. raster da1a se1 must be represented by a value
Large homogeueous regions are recorded by in a rasler cell. so poims have at least the
1he coordinate botmdaries in a vector data dimension of the rasterc.ell .after conversion
model. These- regions are re.corded as a set of from ,iec1or-10.ras1er models. Points are usu·
cells in a raster data model. The perimeter ally assigned 10 the cell cootainiug the point
grows more slowly than the area for most coordinate. The cell in whic·h the point
fean1re shapes. so the amounl of data resides is given a number or other code iden·
required 10 represent an 4reA iuaeases umtli 1ifyiog the poim feature occurring at the cell
more rapidly wid1 a raster data model. Vec- location. If 1he cell size is too large, two or
tor data are much more compact lhan ras1er more v•ctor points may foll in the same cell.
data for most !hemes and levels of spatial and either an ambiguous celJJ identifier
detail. assigned.. or a more complex numbering and
Vec1or data are a more natural means for assignment scheme implemented. lypically.
represeming 11e1w01ks 31\d 01het connected a cell si.ze is chosen such 1ha1 the diagonal
linear features. Vector data by their namre cell dimension is smaller 1han the distance
store information on intersec.tions (nodes) between 1he two closest point fearures.
aod 1he linkages be1ween 1hem (lines). Traf. Vector line feamres in a data layer may
fie voh1me. speed. tinting. aod other factors also be converted to a raster dala model.
may be associated with lines and intersec· Raster cells may be coded us ing different
1ions 10 model many kinds of networks. criteria. One simple method assigns a value
Veclor data models are easily presented to a cell if a vec.tor line intersects with any
in a preferred: map format Humans are pan of the cell (Figure 2-370. left). This
familiar with continuous line and rounded ensures the m.aimenance of ,connected lines
curve represe-nlations in hand- or machine- in the raster fonn of the data. This assign-
dnl\',;n maps. Ras1er da1a often show a ·•stair· ment rule often leads 10 wider than appropri·
step" edge for curved bo1mdaries. panie11- ate lines because several adjacent cells may
larly when 1he cell resolution is large relalive be assigned as parl of lhe line. panicularly
10 the resolution at which the ras1er is dis· when 1he li.ne meanders near cell edges.
played. Vecior data may be ploned wiu, Olher assigomem rules may be applied, for
more vis.uaUy appealing continuous lines example. assigning a cell as occupied by a
an<l rounded edges. line only \\11e11 1hecell cenl'er is ne-ar a vector
Line segmem (Figure 2·3 7o. right). ''Neat"
58 GIS Fundamentals
may be defined as some sub-cell distance. producing errors as well as more efficient at
for instance. J/3 a1e cell wid1h. Lines pass- providing correc1 resulls. Until sufficien1
ing tbrough die comer of a cell will 001 be experience is obtained with a suite of algo-
recorded as in the cell. TI1is may lead to thin- rithms. in this c.ase vector.to-rnsterconver-
ner Linear fea1ures in 1be ras1er da1a se1. but sioo. small. con1tolled 1es1s should be
often a, 1he cos1 of line discon1inuities. performed to verify 1he accuracy of a given
The output from vector•tC>-rasler conver- method or set of constraining parameters.
sion depends on the algorithm used. c,,en Up 10 this poin1 we have co"ere<I vcc1or-
though you use 1he same input. This brings 10-raster data conversion. Oata m,'ly also be
up an important poilll to remember when convened in the opposite direction. from
applying any spatial operotion. The 0111pu1 raster to vector data. Poim. liue. or area fea-
often depends in subtle ways on the spatial 1ures represented by raster cells are con-
opera1ion. Whal appear 10 be qui1e small dif- verted to corresponding vector data
ferences in lhc algorithm or key defming coordinates and srmcmrcs. Point fearures are
parame1ers may lead to quite different represen1e<I as single raster cells.. Each vec-
results. The ease of spatial manipulation in a tor point feature is usually assigned the coor-
GIS provides a powerfol and often easy-10- dina1e of the corresponding cell cemer.
use sel of 1ools. The GIS user should bear in Linear features represented .in a rasler
mind that these tools can be more efficient at enviromneo1 may be converted 10 vector
b)
Roster Cell cent er 01nts Smoothed line
....
...
...
Fi&11J? 2..J7: \'ceto1~to-rnJter co1,,·cnion (o) nnd rastcr-to-vcctorcom"ffSl011 (b). In 4, cells ~re anigncd
in a raster if lh~ intersect with a con,·ertcd ,·cctor. llte left-and ri~t panel, show bow two assignment
rules rcsuh in difrcrciu ra,tcr coding !)Cat lioes. Pancb in b s.bow how raster data may be converted to
, ·cctor fonnn1s. atid n'l~' i,wolvc lin"e s.moo1.bing or ot11cr opcmti<>'lll to remove the-- sll'lit•t tcp," cff'ett.
Chapter 2: Data Models 59
Jines. Conversion to vector Jines typically thal the cell edges align with the coordinate
involves ideu1ifying the cominuous con- sysiem a.,es. Hence. the new cells often do
nec1ed set of _grid cells tha1 fonu 1be line. not correspond to the same locations or
Cell centers are typically taken as the loca- extents as the old cells.
1ions ofvenices along the line (Figure 2· Common resampling approaches
37b). Lines may then be "smoothed" using a include lhe ueau.s, neighbor (laking the out-
ma1hem,1tical algorithm to remove the "stair- put layer value from the nearest input layer
step" effect. cell center). biUuear ;me,polaHon (distance-
based averaging of !he four nearest cells).
Raster Geometry and Resam- and cubic com·o/mion (a weighted average
pllng of the sixteen neares1 cells. Figure 2·38).
Da1a often m1is1 he resampled when An example of a bilinear int·erpolation is
convening between raster and wctor data, or shown in Figure 2·39. This algorithm uses a
changing the cell size of a raster data sel distance.weighted average of the four ne.1r-
(Figure 2-38). Resampling in,•olves reas- est cells in the input to calc.u late the value for
signing the cell values when changing ras1er the ou1put. The ,tew ou1pu1 location is repre-
coordinates orgeometry. Resampling is semed by 1he black post. Initially. the height.
required when changing cell sizes because or ZOU? value, oftbe oulput loc.atioo is
1he new cell centers will 001 align exactly unknown. Zout is calculated based on the dis-
with old cell centers. Changing coordinate tances between the outpm locations and the
inpul locations. The djstance in the Xdirec-
sys1ems may change~ direction of the X
and Yaxes. and GIS systems often require tion is denoted in Figure 2-39 by d1• and the
dis1ance in 1he y direction by dz. The values
Output
raster
. ..
\:···-' ..······'
,
...
.......,····
60 GIS Fundamentals
z 2 _•6
,,
/ z,.4.6
"---z..,
7. __,_
/ • ..,
i.;
,_ -
z, ff
~!
"--------===-==-==-- .:..__:=-::::-=:__ .:::-=:_::.__:~8.s
What Is the value of Z a;1 ? x
in the inplll are shown as gray pos1s and are inpu1 and output locations 10 weigb1 values
labeled as 21 1hrough z,. lmennedia1e at each inpm loca1ion. The cubic convolu-
heights Zb and~ are shO\\-n. These repre- tion resmnpling calculation is similar. e.xcept
sent the average of the input values when that more cells are used. and the weighting is
taken in pairs in the x direction. These pairs not an average based on linear dlis1ance.
are z, and Z2• 10 yield Z,.. and Z3 and z,. to
yield z 0. z. and Z0 are then averaged 10 cal-
culate z..,.using 1he distance dz between the
Chapter 2: Data Models 61
occeptoble.
convergent ctrcles
...---.....' \
--
\unocceptoble circle ·
conto1ns o somple point
62 GIS Fundamentals
data structures. Vector topology could be sented by lines and the valves by points, but
included. incorpnrated within the single these vec1or elements are enhanced in the
object Object model approaches have been object model because the specific pipe and
adopted by at least one major vendor of GIS pipe propettie.s may be linked 10 the specific
software and are applied u1 a number of valve anacbed 10 a given location. Object
fields. models can have i11heritn11ce. automatically
Object models for spatial data often fol- transfering properties within classes of
low a loglcol model. a user's view of the rcaJ objec1s. We may crea1e a generic val\le
objects we portray with a GtS (Figure 2-4 t ). object witlt a maximum pressure rating.
This model includes all the ·'thu1gs•· of inter- cost, and material type. We may create
est. and the relationships among 11te,u. val\le subclasses within this class. e.g.. emer.
Things. or objec1s. might include power gency cut·off valves. primary control valves.
poles. traasfonners, powerlines, me1ers. and or shunt valves. These subclasses will inherit
customer buildings in a city. and relation· all the propeny variables from a generic
ships among 1hern would include a crnns- valve in that each bas a cost. ma:.ximum pres-
former on a pole, lines be1ween poles, and sure. and material. but each subclass may
meters at points along the luies. The logical also have additional unique propenies.
model is often represented as a box·and-line Figure 2--42 shows an example of an
diagram. object data model for hydrologic basins and
Most object 01odels define the proper· related stream foarures. Tbe top fiame shows
ties of each objec·t. and the. relationships feanires: in this example basins. sul>-basins,
among objects. Pipe objects may have a a stream net,vork. and features on the stream
diameter. materia I type. and be connected 10 network such as sampling statiOllS. The bot-
valves and tanks. The pipes may be repre- tom panel shows die feature types. anributes.
and propenies in 1heobjec1 model. Nole 1ha1
there are both object propenies and topologi·
cal relationships represented. and that multi·
COUNTY pie feature types may be represented in the
name object model. The object data model bas
6
OWNER
nome
stote
watershed dlstrtet
b<llh advantages and disadvantages when
compared to tradition..11data models. Some
geographic entities may be naturally and
easily identified as discrete units for partict•·
lar problems. and so may be narurally
address amenable to an object·orienled approach.
p one number
PARCEL
) Some proponents claim object models are
more easily implemented across. a wider
range of da1abase software, partiicularly for
complex models. However. object data mod-
els are less usefi>I for representiug cootinu·
tox ID number ously varying features, such as elevation. In
location addition. for many problems. object defini·
size tion and indexing may be quite complex.
value Software developers h.we had difficulty
developing generic 1ools 1ba1 may quickly
fiJu1't 2-41: Objects io a GIS database may Ix
implement object models. so there is an
conecprualizcd in ii. diap;rruu. or toai,cal mockIo( added level of specialized training required.
how llicy lll"e rd.Aled. Hctt, lhnc- l))>"Of objects Finally, we note that d1ere is no widely
arc- rc-prcscot«l. with 0 \\1lffl ~101ed wi1h p.i,r· accepted. formal def01i1ion or ,vbat consti·
,eels. and pan:el, as-sociatcd "';1h ,cou:nti«.
lutes an object data model.
Chapter 2: Data Models 63
Objects
•
Stream net Stations
•
•
• • •
•
Schematic Diagram
Bo sin ·a
Wotcrreorurc
Wole,-10 IW'o1 ushed I
Sob·bosi.n
Woter(eofure
Water IO
l~a:=I
WoterCocle
WoruCode:
~ -b
R:ourePoin 'c
Bosinttorurt RouteFtoturt lr'IUD
8cslnl0 RovtetO
BoW1Area
RouteT~e ul t i Dom
GO Olf"II
Three-Dimensional Data Models sel is, they also suffer from the same draw.
backs, particularly the trade-off between pre-
GlS in built environments are increas· cision. or voxel size. and data volumes.
ingly integraiing three-dimensional (3-D)
infonnation such as building heights, roof Many alternative "vector-like" 3-D
shapes. and other heigb1,rela1ed characteri.s, models have b~n propose<!. many defining
tics (Figure 2-43). This is in part to support a body element, in addition to 1he standard
analysis. and in part to generate visualiza. point, line. and polygon elements. Points are
,ions from at or near ground-level. for exam, used to create lines. lines for polygons. and
pie. building apperu-ance from a nearby road. polygons to create bodies. Muell like 2-D
Improved data e-apture allows for the rapid vectors. 3~0 vectors add indexiug schemes.
development of J.-0 data Iha, musi be inte- with 1he added complication of n z coordi-
grated imo an appropriaie data model. nate to any element abo\•e the base plane of
the data layer.
Several 3-0 data models have been pro-
posed. with ''Vector-like" models more com- One 3-0 model. developed by Bentley
mo,~y applied 1han "raster-like.. models. Systems. is called a rea/iry 111es/1 (Figure 2-
The lauer employ the concep1 of voxels, or 44). It combines a tltree-dimensional trian-
volume eleme,us. esseniially cubes of a gulated irregular ne1work with a.n image-
fixed dimension. :Flat or baseline areas lMwe based texture surface to create realistic rep-
zero voxels stacked over the surface. with a resentations ofJ-0 feamres. Complex 3-0
requisi1e number of voxels ·~1acked" a, each surfaces can be eff'icienlly represemed.
rasier cell 10 represent heigh 1. Wliile easy 10 demanding less storage space, \vhile proper-
access and simple for all the reasons a rasler ties like material or color can be associa,ed
-··---
·-
·--
·-
·-
··-
--
···
.·-
=-
·--
··-
f lautt 1-ll: ~ e.<crunplc of3-D St>Jiti:L! dtlta displtl.ycd for 11. n:gioo in Gcnntmy. The third dllncniioo ftids
u1 mn.ny pla.wm1g and a1$C$S'lncnt h.11,c1101l$. llud i11 becoming common iri mimagiog 1hc built c1.wironmcn1
(cour1csy City of Heidelberg and ilie Vai\·mity of Heidelberg).
Chapter 2: Data Models 65
with each of the triangular faces. A com- tured, stable. productive tools, supporting
pressed raster ''texture s11rface" 11tay be Stan to finish worktlows. from 2-D 10 3-D
wrnJ)ped around me 3-D TIN. yielding sub- data conversion. 3-0 spatiaU data ingestion.
stantial detaiL The images may be com- processing and organiza1ioru. and output to
pressed. using methods described later in project. video, and in1ernc1i,·e Web services.
this chapter. to reduce data volumes. Three-dimensional G!S processing tools are
While vector 3-D models are becoming available from other ve.ndor.s. and as a plug.
common. no ooe model form orstaodard bas in forQGIS.
been widely adopted. Three-dimensional
GIS products have become quite mature, for
example. the 3-D SP"tial analyst and 3-D
CityEugine products from ESRI are full-fea-
Fiat1rt 2-.W: .An c:un1ple of a rcali1y m«b as a J.-0 ruodcl. Surface gcomcl:t)• is recorded i.u a J.-01ri1U1gu-
latcd imj.UW' network.. while a corrcspondin; ''texture.. ,ur&cc is projcc:tedou to oorrcspondina f.accts
(courtesy Balltlcy Systems).
66 GIS Fundamentals
Roster DEM
Detailed view of
roster cells
6-45 650 654 658 6SJ 64&
Vector contours
fi&nn" 2-45: 0.1tn n:ay often be- rcprc~cutcd in 1,c,"Cnl dal.'1 models. Oigitaf clcv:r.1ion d:it11 arc C01JUJ1()1tl)' rq>-
rcscnted in n1,5ter(OEM). vector (cootOU1$), '-Lld TL!I{ data models.
Chapter 2: Data Models 67
binary decimol
00000001 1
00000010 2
00000011 3
00000100 4
00000101 5
00000110 6
00000111 7
00001000 8
00001001 9
00001010 10
.... ....
Flam-e 2---46: Billa.J)' rcpr«eutation of dc:ci:iual muub,c,ti.
68 GIS Fundamentals
Ead1 digit or colunm in a binary number grams, including GIS, particularly for data
is called a bit. and eight columns. or bits. are expon or exchange.
called a 1>)1e. A byte is a common unit for ASCII codes aJlow us to e.asily and w1i-
defining data types and numbers. for ex.1111. fonnly represent alphanwneric cbaracten
pie. a data file may be referred 10 as contain· :mch as lciwrs. p11ncnia1ion.othC'I" characters.
ing 4-byte integer numbers. Tl,is means each and munbers. A$Cll conve.rts binary num-
number is represented by 4 bytes of binary ben 10 alphanumeric cluu-acters through an
data (or 8 x 4 • 32 bits). index. Each alphanumeric character corre-
Several bytes. are required when repre. sponds to a spec.ific number bel\,.'een Oand
senting la,ger nwnben. For example. one 255, which allows any sequence of charac-
byte may be used to represent 256 different ters 10 be represented by a number. One b)1e
values. When a byte is used for nonnegative is required to represent each character in
integer numbers, 'then only values from Olo extended ASCII coding, so ASCII data sets
255 may be recor<led. This wiU work when are typically much la,ger than binary data
all values are below 255. but consider an ele· sets. Geographic data in a GIS may use a
vation data layer with values greater than combination of binary and ASCfJ da<a stored
255. If the data are not rescaled. then more in files. Binary data are 1ypicaUy used for
than one byte of storage is required for each coordinate infommtion. and ASC!l or other
value. Two bytes will store up to 65.536 dif· codes may be used for attribute data.
fereot numbers. Terrestrial eleva1ions mea·
sured i.n feet or meters are all below this
value, so two bytes of data are often used to
Pointers and Indexes
store elevation data. Real numben such as Data rnes may be linked by fi!epoimers.
12.19 or 865.3 typically require more bytes. ;,,dexes. or other strucn1res. A pointer is an
and are effectively split, that is, two bytes for address or index that connects o.ne file loca-
the whole part of the real number. and four tion to another. Pointers are a com.moo way
bytes for the fractional ponion. to o,ganize infonnation within :u,d across
Bi.nary numbers are often used to repre· multiple files. Figure 2-47 depicts ru, exam-
sent codes. Spatial and attribute data may ple of the use of pointers to organize spatial
then be represented as text or as standard data. In this figure. the polygon is composed
codes. rrus is panicularly common when of a se.1ofline.s . Pointers are used to link the
rasrer or vec1or data are convened for expon set of lines that fom1 each poJygon. There is
or unpon among <li!ferent GIS software sys- a pointer from each line 10 the next line.
tems. For exampLe. ArcGIS. a widely used fonuiog a chain tbat defines the polygon
GIS. produces se"eral export fonruus tliat boundary.
are in text or bina.ry formats. ldrisi. another Pointers help by organizing data in such
popular GIS, supports binary and alphanu- a way as to i.mprove access speed. Unorga-
meric raster fomiats. oh:ed dala would require time.conswning
One of the most common number cod- searches each time a polygon boundary was
ing schemes uses ASCII designators. ASCD to be ideotified. Pointers also aUow efficient
stands for the American Standard Code for use of storage space. In our example, each
lnfom1ation lnte.rchange. ASCH is a stan. line segment is stored only once. Several
dardi2ed. widespread data fon11a1 that uses polygons may point ro the line segment, a$ it
seven bits. or the numbers O through 126. 10 is typically much more space efficient to add
represent text and other characters. An pointers than to duplicale ihe line segment.
extended ASCII, or ANSI (American Shopefiles are a common vector spatial
National Standards lt1Stin11e) scheme. uses data format that uses an index 10 link files.
these same codes. plus an extra binary bit to Shapefiles were originally developed by
represent number:s between 127 and 255. ESRI as a way to store point. line, and poly·
These codes are then used in many pro- gon feanires. although they ba"e since been
Chapter 2: Data Models 69
l Arc Al
memory address 0011 O!OC 0101 0110 )
coordinates 1.9 5.10 8.8 11 _3 pointer
Al
f Arc A2
Oty OddN:SS 1000 1001 1010 )
A3 ool'd1notes 11.J 9.0 0.2 pointer
+Arc A3
memory oddt'tss 1110 1111
pointer
coordtf'lotes 0.2 1.9
A2
FJaure M7: Poinrcrs lltc used 10 Ofl:tilltt \'ttfOI' d11.1~. Poi.ntctt t ~ rcd11ndi111.1 s1orngc: 11.1.d
ii)cT~SC speed of 11«e$$,
adopted as a common fonuat for data inter· and another list The roads.sh.'< file conlains
change and analysis. Shapeftles are sup- indices that point 10 the segment records in
ported by most other GIS softwares that the .shp files. based on these- identifiers. TI1is
process vector data. speeds access, avoiding a search in the .shp
Shapefiles represent layers with a clus- file each 1ime it links segmems in a rood.
ter of files. Each file has the same base name The roads.dbf file also uses an index 10 poiut
but a different filename extension. indicated to 1he combined roads in 1he .shp and .shx
by a suffix. for example. the ".shp" in the files. A group of segments may be used to
filen.,me i.1xmndary.shp.·· A transportation fomi a line. and associated with a set of anri-
data layer stored in shapefile format might butes stored in a .dbf file.
have the base name of roads. with differem Because pointers and in.dices are key
suffixes for different files: elemenls in organizing the s..patial data. alter-
roads.shp ing them directly will usually cause prob·
lems. Typically. these indices are created by
roads.shx 1he software during processing. and updaled
roads.dbr as needed when data are added, modified. or
analyzed. Poimers may be ,·isible. for exam-
roads.prj ple. the O!D colunms in the .dbftables used
etc. with shapefiles. but manually changing the
The first three files above are all values will often n,in the data layer. You
required to represent a vector data )ayer
should know the identity and use of pointer,;
using sbapefi.le~ These files are connected in your data sets so that you don't change
using indices. munbers that identify coDl\ec· them inadvertently.
tiOD$ ;ind groll!pings for various components. Pointers~indexing, and multifile layers
The .shp files co111ain the coordinates that are not limited 10 vector data. Many raster
represent each road. organized by line seg- formats store a majority of the cell data in
ments. There is general infonnation for each one file. and additional, linked information
segmenl, and Lhen a list of coordinates and in an associated file. You m11St be careful
other data for the segment. This is followed when transferring a data lay·er 10 include all
by general information for the next segment. the associated files. For exa,nple. c.opying
70 GIS Fundamentals
sum along the run-length codes to identify a indexing than simple raster data sets. Point-
cell position. This is typically a minor addi· ers are used to link data elements in a tree·
tional cost. b,n in some applications the like strucrure. hence the name quad trees.
trade-off belween speed and data volume There are many ways to structure the dala
may be objectionable. pointers. from large 10 small. or by dividing
Quad rree representations are another quandrants. and these methods are beyond
raster compression method. Quad trees are rhe scope of an in1roductory text. Further
similar to run-length codings in t11a1 they are information on the s1ruc1ure of q1,ad trees
most often used to compress raster data sets
may be found in the references at the end of
when representing area features. Quad trees this chapter.
may be 1hough1 of as a rasttr data strucmre There are many other dma compression
with a variable spatial resolution. Raster cell methods that are commonly applied. LZW is
sizes are combined and adjusted within the a lossless compression metltod commonly
data layer 10 fit into each specific area fea. applied 10 image and raster data sets. partic·
nire (Figure 2-49). Large raster cells that fit ularly GIF iumges and TIFF fomtats. JPEG
entirely into o ne unifom1 area are assigned and wavelet compression algorithms are
the ,,,.Jue corresponding to that area: for often applied to reduce the size of spatial
example. the three largest cells in Figure 2· data, particularly image or other data,
49 are all assigned the value o. Successively although as implemented these are lossy
smaller cells are then fit. halving the cell algorithms. Generic bit· and byte-level com·
dimension at each iteration. again fitting the pression methods may be applied to any files
largest cell th.at will fit in each uniform are.a. for compression or communications. There
This is illustnned in !he top-left corner of is usually some cost in lime 10 the compres·
Figure 2·49. Successively smaller cells are sion and decompression.
defme<I by 1pli11ing ·'mixe<I cells.. into four
quadrants. and assigning the values o or b 10 Raster Pyramids
uniform areas. Titis is repeated down 10 the
smallest celJ size that is needed to represent We some1imes inten1ionally increase the
uniform areas at the required detail. size of our rasterdata sets wjthout increasing
The varying cell size in a quad tree rep- the resolution in a process known as p)•·a·
resentation requires more sophisticated miding. We create pyramids. to increase dis-
play speeds when \~ewed al' small scales
(..zoomed out"). Long redraw times often
hinder use of k1rge data sets. particularly
when ponning frequently. When displayed at
• • very small scales. the cell size of a data set
may be smaUer than the resolution of the
• computer screen. A raster data set 1,000.000
• pixels across bas 1.000 times the data that
can be displayed on a moniror with 1.000
• pixel horizontal resolution. :However. dis-
play software must wade through all
1.000.000 data elements in a row 10 pick die
I cell in 1.000 to display. While clever soft-
• • ware ran help, there are limils to bow much
we can spee<I up the redraws.
Pyramiding in effect saves subsampled
copies of the cells at various resolmious. ln
our example above. pyramids may do the
Flautt 249: Qnad tttc comprcu:io11. equivalent of saving every t·wo. every four.
72 GIS Fundamentals
e\'ery 10, every 30, and every I00 cells, all The Geospalial Dala Abs1rac1iou
wi1hin ll1e same ras1er data set. The software Library (GOAL) provides a u1iliiy to rrans-
then compares the display scale to die late among many common vec1or and raster
dimensions of the data set. and chooses the file fom1a1S. This free u1ili1y is flexible, and
most appropriaie ,cell resoluiion 10 display. often can be used 10 ex,end 1be reach of
Redraws are much (aster, and transparem to commercial packages. by firs, using GOAL
the user. to convert files from unsupported to sup-
Nole 1bai we say pyramids "in effecl'' poned 1ypes. and !hen imponing 1hese into
save copies of cells at various resolutions. rbe 1a,ge1 software.
This is !he simplest me1bod, but often no! 1be
most efficient for space or speed of access. Summary
Sophisticated indexing may be used to point
10 the cells a, !he appropriaie resolutions. In !his chapeer we have described rhe
main ways of conceptualizing spatial enti-
No1e that pyramiding comes at a cost. 1ies. and of representing tl1ese enti1ies as SJl'l·
both in !he size and complexily of !he rnsler tial fearures in a computer. We c·ommonly
data set Indexing schemes complica,e the employ two conceptualizations. also called
simple rasier data simcmre, and !he software spatial da1a models: a raster da1a model and
mus1 be able to navigate the indexing a vector data model. Bo1h models use a com-
scheme. Already large raS1er dala se,s may bination of coordinates, defined in a Carte-
be infla1ed from a few percen1 10 several sian or spherical systent and anribu1es. 10
times, although in practice it is typically less represent 0111spa1ial fean,res, Fearnres are
1han a doubling o.f size. usually segrega1ed by lhematic type in lay-
ers.
Common Fiie Formats Vecior da1a models describe 1he world
A few file formats are commonly used as a se.t of point, line, and area fea tures.
10 siore and traosfer spa1ial data. Some of Anributes may be associated wiu1 each fea-
these file structures arose from distribu1ion nire. A vector data model splits that world
fonnalS adop1ed by go"eou11emal agencies, into discrete features, and often supports
01bers were specified by software vendors, 1opological relatiooships. Vec1or models are
and some have been devised by standards. most often used to represent features that are
making bodies. Some knowledge of !he considered discrete. and are compatible with
iypes and propeniesof1hese file fonnats is vector maps. a common ou1pu1 form.
helpful 10 tl1eGIS practilioner. Raster data models are base,d on grid
Common geographic data fonnats may cells and represen, !he world as a "checker-
be placed into three large classes: raster. vec- board." with uniform values within each
1or, and anribu1e. Ras1er fonnats may be fur. cell. A raster data model is a narural choice
ther split into single-band and ornhi-band for represen1ing fe.atures that vary continu·
file iypes. Multi-band rasier data selS are ously across space. such as tenip,:ramre or
most often used to store and distribute image precipitation. Oata may be conv,erted
data. while single-band raster da1a sets are between raster and vector data models.
used to siore bolb single-band images and We use d.11a structures and computer
nonimage spatial data. Table 2-3 summarizes codes to represent our conceptuaJizations in
some of 1be moSI common spa1ial data for· more abs1racL but compu1er-corupa1ible
mats. fomis. These Slmcmres may be op1imized 10
Mos, GIS softwares provide some utili1y reduce storage space and increase access
for data impon and e.xpon from Slandard for- speed. or to enhance processing based on !he
mats
namre of our spatial data.
Chapter 2: Data Models 73
OWG, Au1o0est .dwg 1-JoU..·e b(nary 1..6 used by Au10De$k IO s10re goo-
gmphic data and <fmww,gs n AutoCAO (V),
Geodalabase,. ESRI .gdb, .mdb ESRlcool8Wlef tor many data types(R. V, A. I)..
GeoPad:age, open .gpkg Open siaodard for represent11g vectot and raslerdata,
standtwd conlC)&·bble with SOfile tR. v, A)
GeollFF. o;ien stan.. .TIF. TIFF All exlension for georetetencing Aldus-Adobe ptibic
®<d cl<lm8in TIFF format (R).
GPX. open siandard ,gpx Aspecifice,1ion based on XMLfO< t>osic: GNSS dal8 (V)
ln\erchaoge, ESRI eOO ASCII lex! r..e fOf veck>r and idertifying attribute dala
cv,
Keyhoh Malhlp I.an. KMl. XMI. extension for displaying and annotating features
guago, GooglO 8lld imagos CV. I, !'J.
Shape619, ESRI .shp, ,shx, dbf, ,Pf), Three or more binary flies that include lhe vecto, coor.
&~ OChOrs dl"l8IO, atttlbulo, endo!Mf informiat(ln (\I).
NLAPS. NASA vanous tn & d:reaory arnego d8t& from vanous landsat se101uos. in a SQOC•·
fi9d c&-ectory struduf& (I. R).
SOTS, U.S. Govern- oone Ss)atial Data Tf8Mfor Si&ndard, specdtG:s the spat.ial
menI objecl'S, attribules. reference sys1em (R, V, A)
74 GIS Fundamentals
Suggested Reading
Arctur, 0 .• ZeLler. M, (2004). Desig11iug (;eodnrnbnses: Cnse Studies 111 (;IS Daw
Modeli11g. Redlands: ESRJ Press.
Batcbeller. J.K.. Gittings. B.M .. Dowers. S. (2007). Tue perfonnance ofve<:lor ori-
ented data storage in ESRI's ArcGIS. Trm1sncrio11s i11 GIS. II :47-65.
Batty. M., Xie. Y. (1991 ). Model structures, exploratory spatial data analysis. and
aggregatioo. Jmemational Joul'nal of Geographical htfon11atio1t Sys1ems. 8: 29 1-
307.
Bognslawski. P.. Gold. C.M.. Ledou.,. H. (2011). Modeling and analysing 30 build-
ings wi1h a primal/dual data stn1cture. ISPRS Jo,m,al ofP!torogrt1111me1ry aud
Remote Se,,si11g. 66: 188-t97.
Bregt. A.I<.. Deoneboom. J.. Gesiok. H.J.. van Rauden. Y. ( 1991). Determination of
ras1erizing ,error: a case srudy with die soil map of The Ne1berlands. /111er 11nrioua/
Journal of (ieogMphical !11fo1111a1iou Systems. 5:361-367.
Carrara. A.. Bitelli. G.. Carla. R. (1997). Comparison of techniques for gene.rating
digital terrain models from contour lines. lnternational Journal of Geographical
J1,fon110Hon Sysrems. 11 :451~73.
Downs. R.M. ( t 998). The geographic eye: seeing through GIS. Tro11sac1io11s i11 GIS,
2:111- 121.
Holroyd. F.. Bell. S.B.M. (1992). Raster GIS: Models of raster encoding. Cr,111p111ers
1111d (;eoscie1,ces. t 8:419-426.
Lee. J. (1991 ). Comparison of existing methods for building triangular irregular net-
work models of terrain from grid digital elevation models. luforna;Jcnnl Journal
of Geogl'apl,;ca/ /11fon11atfo11 Sysrems. 5:267- 285.
Nagy, G.. Wagle, S.G. (1979). Approximation of polygonal maps by cellular maps.
Comm1111icatio11s ofthe Association of Computational J\lachiuery. 22: 518-525.
Peuquet. D.J. ( 1984). A coucepmal framework and comparison of spatial data mod-
els. Cortographicn. 21:66-113.
Piwowar. J.M., LeDrew, E.F., Dudycha. D.J. ( 1990). Integration of spatial data in
vecior and raster fommis in geographical lnformai.ion sysl'ems. lmen,atlonal
Journal of Geogrnpl,;cal /11fon1101io11 Systems. 4:429-444.
Rigau.,. P.• ScboU. M•• Voisard. A. (Eds.). (2002). Spatial Database.,: with Applica-
tion"" G!S. New Yor.k: Elsevier.
Shaffer. C.A., Samet. H., Nelson. R.C. (1990). QUILT: a geographic infomiation sys-
tem based on quadtrees. /memntioun/ Journal ofGeographical J11/o·n11atfon Sys·
/el/IS. 4:103-132.
Slocum, T.A., McMaster. R.B., Kessler, F.C.. Howard. H.H. (2005). Thematic C111·-
iogrnphy nud Geographic J!is11nli:atio11. 2nd ed. New York Prentice-Hall.
Tomlinso11. R.F. (1988). n,e impact of the 1tansition from analogue to digital carto-
graphic representation. 11,eAmericnu Cnrtographe,; 15:2490-.262.
Tuan. A.N.G. (20 13). Overview or three-dimensional GIS data models. /111er11atio11al
Journal ofFut11re Computer aud Co1111mmicmio11. 2:270-274.
76 GIS Fundamentals
\Vedhe, M. ( 1992). Grid cell size in relation to errors in maps and inventories pro·
duced by computerized map processes. Photogmmmetric Engineering and
Remote Ser,siug. 48: 1289-1298.
Wilkie, 0., Sewall, J., Lin. M.C. (20 11). Transfonniog GIS data into functional road
models for large.scale traffic simulation./£££ Transac;ious 011 Jlis11a/i:a1io11 and
Co111p11te,· Graphics. 18:890-901.
Worboys. M .F., Duckham. M. (2004). G/S: A Comp111i11g Perspectil'e. 2nd oo. Boca
Raron: CRC Press.
Zeiler. M. ( 1999). Modeling 0111· World: The £SRI Guide 10 (ieodatabnse Design.
Redlands: ESRI Press.
Zhi-Jun. !... \Veller. D.E. (2007). A s1ream network model for integrated watershed
modeling. E111;,-o,1111e111nl Modeling nud Assessme,11, DOI: 10. 1007/s 10666-007-
9083-9.
Study Questions
2.3 - Defute a data model and describe lh.ree primary differences between the two
mos, coruruonly used da1a models.
2.10 - Complete the following coordinate conversion table, converting the listed
points from degrees-miuu1es-secouds (OMS) 10 decimal degrees (DD). or from
DD 10 OMS. See Figure 2- 10 for 1he conversion fonuu la.
I 36°45'12" 36.75333
2 114°58'2"
3 85°19'7"
4 14.00917
5 275.500001
6 0.99528
7 183°19·22·
Chapter 2: Data Models 79
2.JJ - Complete the following coordinate conversion table, conve.rting lhe listed
poims from degrees-minutes-seoonds (DMS) ro decimal degrees (DD). or from DD
10 OMS. See Figure 2-10 for rhe conversion formula.
2 122°10·2·
3 15•0·12·
4 322.19861
5 152.65583
6 5.75
7 23°12'50"
2.12 . Calculare rhe radian and surface arc distance. in mere rs. assumi11g an Earth
radius of '6,378. 1kilomerers.
2.13 - Calculate the radian and surfac-e arc distance, in meters, assuming an Earth
radius of 6.378.1 kilometers.
2. 14 - Assume a spherical Earth with a radius of 637S.Okm. Calculate the gre.at circle
distances from St. Paul. Minnesota. latitude 44.9537°. longitude -93.09°. to the fol·
lowing points:
a) Chicago. latimde 41.8781°. longitude -87 .6298°
b) Reykjavik. latitude 64.1265°. longitude -21.8 174°
c) Buenos Aires. latirude -34.603 7°. longitude -58.3816°
2. 15 - Assume a spherical Earth with a radius of 6.378.0 km. Calculate the §feat cir-
cle distances from St. Paul. Miru,esota. latinide 44.9537°. longinide -93.09 . to the
following poims:
a) New York. latin,de 40.7128°, longitude -74.0059°
b) Paris. latimde 48.8566°. Jongimde 2.3522°
c) Tokyo. latitude 35.6895°. longimde 139.6917°
2.16 - \Vhat is vector topology, and why is it important? What is p.lanar topology. ruld
when might nonplauar be more useful than planar topology?
Chapter 2: Data Models 81
2.17 - Identify 1he number of times each of the following 1opologica1 rutes are broken
for the bt,ildillg outlines (red/darker) aud property parcels (green/light) polygon lay-
ers. Note that all layers are semi-transparent so that you may identify overlaps.
········· - · .. ···j ** ...... - - - - - · ·
••
..
•
.•'
.•
..•
•
2. 18 - Identify the number of times each oftlte following topological rules are broken
for the building outlines (red/darker) and properry parcels (greenlligbt) polygon lay-
ers. Note that all layers are semi-transparent so that yo11 may ide11tify overlaps.
:•----~ ---------a•___----,:
I
•I
II
I
~-,.;,~--,~l-..-~.-
1 ,,
•
I
--1-·=:-1 I
I I I
I I
I
•• •
: I
I I I
I
:.
I
I
I
• • '
..........·:
' ..
I
I
I
I
I
I
•---
_____..
I
I
I
I
2.19 - Draw multi-part and single-part tables fo r the United Kingdom, recording the
labeled coun1ties.
2.20 - Draw multi-part and single-part tables for Italy. recording cite Italian mainland
and labeled nllljor islands.
s-1',_r,ntond
~ Itoly
2.21 - \Vbat are the respective adva1.11ages and disadvantages of vector data models
vs. raster data models?
2._22 - Under what conditions are mixed cells a problem in raster data models? In
what ways may the problem of mixed cells be addressed?
84 GIS Fundamentals
2.23 - List the labels (A. B, C, etc.) of raster cells that would be assigned as water
(dark. blue) and not laud (light. green) under a majority coverage mle.
,... ;
·-·····~·······t·
4 ·• ~· .. !· ... ;,
...... .. ....
~
:
: .
,. .'
••
Chapter 2: Data Models 85
2.2.S - The following figure shows change in raster resolution. combini.1.1g four smaU
cells on tbe Jell 10 create an output for each corresponding larger cell on the right. Fill
in the cwo rasters 011the rigl11. for the in1ervaVra1io data (top}. and the nominal data
(bottom). Assume null values are not ignored, and a majority rule for nominal data.
2 2 1 1
2 2 2 2
1 2 1 2
4 5 3 null1
a a a a
a a a b
a b a b
c c c d
2,26 - The following figure shows change in raster resolmion. combining four small
cells on the left Lo create an output for each corresponding larger cell on the right. Fill
in the 1wo rasters on ct,e right. for the i111erval/ra1io data (top). and ct,e nominal data
(bonom). Assume null values are 1101 ignored. and a majority rule for nominal data.
2 2 I 2
null 2 2 2
1 null 4 3
3 null 3 4
a a b a
a b c b
null c c c
c c c c
86 GIS Fundamentals
2.27 . Comple'le the one-to-one (top) and mruty-10-one (bonom) raster tables in the
figure below:
attnbute table
(rows first,Sforl
upper left corner)
cell-lO t· count
2 2 3
>-
2 2 2
- I
2
2
2
I
I
I z I 2 3 I
' s 3 3
3 I
attribute table
2 2 I I cell· IO COUl'\1
2 2 2 2 I 4
2 8
I 2 I 2
s 3 3
' 5
2.28 . Co,nple'le the one-to-one (top) and many-10-one (bonom) raster tables in the
figure below:
at t r ibute t able
(rows first.start
5 7 2 3 3 upper-leH corner)
4 7 cell· ID count
9 10 6
5 I
8 8 3 4 3 7 I
2 I
7 7 4 4 3
8 7 4 3 2
2 1
5 7 2 3 3 attribute t able
7 cell·ID count
9 10 4 6
2 2
8 8 3 4 3 3 ()
.
7 7 4 4 3
8 7 4 3 2
Chapter 2: Data Models 87
2.30 . What are the main concepts behind object data models. and how do they differ
from other data models?
2.36 - Why do we need to compress data? Which are most collllllonly compressed.
raSler data or vector data? Why?
2.37 - What are poimers when used in the context of spatial data. and bow are they
helpful in. organizing spatial data?
88 GIS Fundamentals
2.38 - Write the ntn·length coding for each of the rows in this raster:
b b 0 0 0 c 0 0 0
c c b b d d d 0 0
b b b b b b b b b
e c f b a d f b 0
0 s a f f f b b 0
2.39 . Write tlie run-length coding for el'Ch of the rows in this raster:
c c c c a 0 a a 0
0 0 b b d d d 0 0
,e e e f f f f f e
<l 0 a 0 a 0 0 a a
c c a 0 0 b f d e
89
Introduction
GIS are d ifferent from orher infomia- Most d.1ta and analysis ill GIS use two-
tion systems because they iuclude coordi· dimensional, Canesian coord inate sys·
nares. Proper use of GIS requires rems. These systems are created via map
understaoding bow coordinate systems are prcjectio,,s. 1be sysrema1ic ma1hema1ical
established, how coordinates are measured rendering of latitude/longit'ude coordinates
on !he Eanb 's curving surface. and bow from 3-D spherical sysrems 10 2-D Cane-
these coordinates are convened for use in sian systems. Every map projection must
flat maps. This chapter introduces geodesy. distort surface geometry in some way due
rhe science or measuring Earth shape and 10 lhe Eanh's curvature. When we plo1 la1i-
loca1io11s1 and map p,.ojectio11s the tmns-
1 nide and longitude coordinares on a Cane-
fomlation of coordinate locations from the sian system. "straight" lines will appear
Earlh's curved surface 0010 Oat maps. curved and polygons will be dis1or1ed. This
As described in Chapier I. GIS prac1i-
dis1or1ion may be diffkull to de1ec1 on
1ioners commonly use two kinds of coordi· maps that cover a small area. but the distor-
nare sysrems. The firsr is a spherical tions become apparem as tlie mapped area
coordinate system with latitudes and longi- grows. We Limit the dis1onioo error by
rudes on the surface of an ellipsoid-shaped carefully se.lectiog our map p.rojec1ions,
globe. Geodesis,s meaSl~e. combine. and and by limi1ing the area over which we
optimally estimate a se.t of latirude/longi- apply any one given map projectiO>L We
tude coordinates using our most accurate 1.ailor the type. number and extent of our
and modem technologies. Taken rogetber, map projections to manage positional error
the precise coordinates at these high-accu- according to our sparial error 1olernnce.
racy points and the malhema1icaHy speci- Historically. rhe irregular shape of the
fied globe define ageodelic datum. Datums Eanh and sparse. m1connected S11rvey net-
are the fouodarion of sub5equenr posirions. works were large sources of t.mcenainry in
wi1h all other measurements made relative defining coordinate systems, coupled wilh
10 datums. limits on our measuremenr tochnologies.
Spatial data spanning large distances For centuries we assumed a spherical or
are ollen specified in la1itude/longit11de ellipsoidal shape for the Earth. but depar-
coordinares. Tn,e surface disiance. flight mres of lhe real Esnh shape from lhese ide-
distance and direction, and large areas are
alized globes led to different .es1ima1es of
more accurnrely specified when facroring coordinate locations. Surveys were difficuh
in the Eanh 's, cuivamre. over large areas, depending on oprical and
laborious physical measurements. In one
section of the Great Arc survey across
90 GIS Fundamentals
India, over half the surveyors died of illness, for tee.tonic plate movements. \Ve must con-
acciden1s. or preda1ion. As our measure- sider the epoch, or time of positional mea-
ments and analysis slowly improved. we sure me 111, to accurately place coordinates.
were able to quantify and integrate Eartb•s There is another factor tbat adds confu.
irregular shape, and our positional estimates
and acc11racies improved thro11gh time.
iion. We may choose many diffc-rem ways 10
project coordinates from our curved Earth
Wl1e11 sufficient new survey data accn1e. ellipsoid onto a Oat. Cartesian map surface.
we perfonn network-wide datum adjus1- We choose different map projec1ions 10 man-
me111s. The datum adjustment reconciles age this spatial distortion. Jnterp.reting a data
errors across the set of measuremenls, weed· set's coordinate system can be confusing
ing out blunders and mathemaiically disU'ib- without knowledge of map projections.
uting uncertainty across the network. A An example may help. Figure 3- l shows
datum adjustmen~ is viewed as our best esti- the location of a U.S. survey ma.rk. a pre-
mate of measured locations 11p to that time. cisely surveyed and monumeme-d point
Each datum adjus.tment results in a re-calcu- Coordinates for this point. measured by fed-
lation of coordinates for all existing datum eral aod srate govemment surveyors. are
points. as new pofots influenced the esti· shown at tbe top right of the fig,.ire. There
mates of previous loca1ions. are four different versions of the latitude/
Coordinate systems are also compli· longimde location for this point. The GIS
cated by large 1ectonic crnstal movements. practitioner no,y well ask. which latinidel
Continent-sized plates on tlte Earth's surface longitude pair should 1 use? This chapter
drill, rotate, and rise or fall. meaning all should allow you decide.
coordinates change tl1rough time. Prior to Note that there are also several versions
the l 960s, we viewed the continents as static of the Cartesian x and y coordinates for the
because we couldn' t detect contifiefilal dlifL point in Figure 3-1. The differences in the
Widely·aYllilable technologies now provide coordinate.values are too great to be due
centimeter-level accuracies over global dis- solely to datum diffe.rences or measurement
tances. so most orus can measure and adjust
shown in Figure 3-4. There was substantial Earth's curvature was greater near the Equ.1-
disagreemeui on the exis1ence and amoun1 of tor than a1 higher la1i10des.
n.,uening. Comple.,. repeated. aod highly Measurement effons through the 19th
accurate me.asurements established that an
ellipsoid was 1he best geometric model of and 20th cenmries led 10 a S<lt of official
ellipsoids with vario11$ equawrial and polar
the Eanh ·s surface (Figure 3-S). because 1he radii. Because early surveys could not span
the oceans. ellipsoidal para111e1ers were fit
for each co1umy. con1inem. or comparably
Ellipsoid
pole
l • ~
.. , . (y-y.i
"' large survey area. The Clarke 186o ellipsoid
was conunonly used in North America. and
is Oaner than 1he ellipsoid ,ve use 1oday. The
Bessel ellipsoid. common ill Europe. also
specified radii somewl1a1 diiferen1 than
1oday's bes, global estimates.
Since the 1980s. impro1r.red me.asure·
mems have yielde<l extremely accura1e
global measurements. Ellipsoids such as 1be
fl,otte,ung
GRSSO provide a "best" overall Iii to
foctor observe<! measurements across the globe.
and are now widely use<!.
pol<
Modern Datum Definition
Fiiu1•t .l-4: Au ellipsoidal model of tltc Eat1h's To create a darum. we must complement
tMpc.
our accurate ellipsoid by establishing a lati·
mde/lougimde net on the Earth ·s surface.
An ellipsoid is defined ,n port and estimating the coordinates for a set of
by two radii. o and b predsely measured datum poinls in this net.
His1orically. we've established our la1imde/
We may use the relotionship longinide S}~tem relati\'e to far-off stars. We
d • r · I) •o estimate rod11 use celestial bodies because their relative
positions are invariaOJ 1hrougll time: the
O• ~ angle measure<! be1ween tw() stars is
a, unchanged over the course of human histoiy.
By making precise measuremen1s to stars
from differem locations on Earth we can
w e establish the relative position of points on
~ ....,/
the E.anb's surface. Our zero longimde
.... \ n passes near 1he Greenwich Observatory
6
N ,
·, ...
;
..•.
,,.,,,••' } because it was the first place with a large
l ._ : ······ 0 enough ca1alog of celes1ial measurements 10
•. :: ... d,
"J· .. .. calcula1e latitudes and longi'mdes over much
e, .. s of 1he Eanb.
Establishing latimde/loogimde lines and
measuring point c-oordiua1es is complicated
Fia:m~ 3-5: Two arc5 i.llustratc the: su.r&cc m~- because 1be Earth sys1em moves in mulliple
s,11e:1ncnis aod c11kult11ioru med co cttim1uc the dimensions. The EMth spins 011 its axis. with
scroi,nujor and sc:ini,mil\Or 1U.c1. 11,c: art some precession or "wobble." while a1 the
length, m.1y be mc:.isorcd by , urfocc sun·~·,,
11od the angle$ from a,tronomical obsc:r,aoom. same time orbiting the Slut. Although
unknown prior 10 o,et 960s. large 1ectonic
94 GIS Fundamentals
plates slide, 1wist. and tilt through time over same point measured at a different time.
the surface of the Earth. so any object fixed Mis·alignmem will be greater for fumre
to the surface of the Eanh moves relative to measurements. and any data collected a1 dif-
mosl other fixed objects. ferent times won't overlay correclly. Oigi·
1i2ed locations of power poles o,igh1 end up
Before we discovered plate tectonics.
we assumed the conlinents were stationary three feet into a road in an independent
and 1ha1 we could establish a la1i111de/lougi· streets layer. or buildings might .appear to be
tude grid, fixed for au time with a zero lon- in rivers.
gitude passing through the Greenwich Given plate movemems. how do we
Observatory. Most points were considered establish a coherent datum framework for
static. and the main problems were thought measuru,g locations across 1be globe? Mos,
to be improving 1,ueasu.rement density. accu- countries are adopting 1.he concep1 of
racy, and coherence across all continents. dynamic datums and fixed epocb position·
The discovery of large plate moveme,n com- ing. where we keep 1rnck of the-epoch of
plicated this notion. Figure 3-6 illustrates coordinate measure so tbat data may be
point movement across time periods. called transformed to a common time. ·w e measure
epochs in geodesy. II shows a cominuously the location oflhe con1inems through time 10
operating. f.xed OPS station's drift through millimeter accuracies and we can calculate
time. in th.is case moving more than a meter the shift of coordinate locations from any
(3 feet) over 26 years. GIS data collected one time period 10 another using veloci1y
with an unknown epoch over this period models. These models are based! on a series
could diverge by <>ver tliree feet from the of measurements of the speed and direction
of drift for a network of poin!S. AS with dead
reckoning. we can extrapolate funire posi-
Point Pos,toor,, Epoch lion based on pasl velocilies ru1d din:clions,
and Shrf1 through time, filling in gaps between point re-.measure·
Trek Point, Orange ments. Our data can then overlay to within
County, CA the accuracie.s of our me.asurements. as
we've removed most of the posi tionaJ differ.
ence due 10 tectonic plate mo,•eoiems.
Most modem datums are at their base
dependem on a global measure,nent infra-
simcmre that consists of four main systems
(Figure 3-7): Very Long Range Baseline
106 cm, or Interferomeuy (VLBI). Satellite Laser
3 .47 ft at Ranging (SLR). Doppler Ort>itography by
N•OOZs·rw Radiopositioning Integrated by Satellite
\ (DORIS). and Gloool Naviga1io11 Sa1elli1e
Systems (GNSS). These systems allow us 10
precisely locate point locations and shifts on
tectonic plates through time.
...~ VLS I may be viewed as the base mea-
suremeuJ in es.1ablisbing our datums. A
global network of 3o+ radio telescopes
\ / receive signals front distant quasars (Figure
3-7). The radio signals vary so that a specific
Fia-u1•t 3-6: Au cxa:inplc of coordioatc drift with pa11en1 released a1 10 by the quasar may be
epoch. ot 1ncti1u1etun11 1imc. T11c coordi~tcs for
a pttciuly S1m'c)'t c.lP,0i111 shift dsJ?uJb time, wid1 recorded at different limes across the globe
the epoch for each pomt shown ~dJa,c,:nt to loc~· (1 1 through t~). Correlated signal analysis
tion (councsy R. Maher. KDM Maidi.an). allows calcula1ing the relati"e position
Chapter 3: Geodesy. Projections, and Coordinate Systems 95
QuDSclfS
Fundamental Measurements for the
ITRS/ ITRF Systems
~ VLB I ~
VLSI
DORIS ;::/ ,
~
antenna
@
DORIS
SLR ~
~NSS
vl
sat. ant.
SLR
®6)
s.1t. ont.
SLR , ?
@ GNSS
GNSS~
~
satellite
between these radio telescopes at any one station locations and mo,•ements. particu-
1ime to within millimeters, or hundredths of larly venical changes associated with crustal
inches. Funher. VLBI and related technolo- defonnaiion. ai near milHmeu~r levels.
gies can measure station velocities. both Taken together. these measurements are
absolutely, ai!ld between any one s1ation and the backbone of the lmematioual Terrruuial
any other. This provides highly accurate Reference~,,,.,,, (ITRS), and allow 1he cal-
measurements of 1.he Earth's rotation and culation of the recognized global datmns
variation, the orientation of the polar axis. known as lb.e J111ema1io11al Ten-estrial Refer·
the relative positions of the stations on the e11ce Fr(Tme.s (ITRF). mentioned earlier.
conrinems. aod crustal movements. again. all These reference frames defi:ne the precise
at millimeter ac<:uracies. locations and velocities of sunions as darum
These positional measurements are aug- poims relative to a standard larirude/longj-
mented by othersystems, mosl notably tude nel, at a gi\'en point in time (Figure 3-
DORIS. GNSS. and SLR. The DORIS sys- 8). This last qualification is imponant. that
tem measures Doppler silifts in signals sem the points are only specified at a given point
to satellites. t\'hich allows us to precisely in time. or measurement epoch, because the
locate appro,ciruately 100 ground stations. system is dynamic: the stations are all mov.
GNSS mcamre signals broadcas1 from con, ing relative 10 each other. \\'e can quamify
stellations of satellites to networks of lnm- the velocities and directions: of each mea.
dreds of fixed receivers s1a1ioned across the surement point. iu effect quantifying conti·
globe. quami:fying station location through ncntal drift (Figure 3-8). We can
time. The SLR measures laser travel 1imes to approximate where any point was at a previ.
and from orbiting satellites to estimate SLR ous time by applying these velocities to posi-
96 GIS Fundamentals
Fi&m·(' 3..$: TcclC)l1ic pl~lcs for ~ port:ion of the Ear1h. ~nd :a , ubsct of mcas1u-cd po$itioo$ iom the CTR.$,
t.bowUlg tbc m<:a~wcd direction and relative ,·clocity ofthifts. Note that the n:locit)• and direction of shitt$
vary subs.autially ,.,.;thin and runoug pbtcs.
ments fro m fLxed GNSS measurement instal- updates. This is because GPS became widely
lations, e.g. the U.S. Co11tinuously Operating available toward the end of t he 1983 update.
Reference Stations (CORS. Figure 3-9). a TbeGPS data had higher aC'Cttracies than
network of several hundred fixed GPS sta- those measured using traditional methods.
1ious disrribu1ed across the Nonb America. with improved speed and lowtr coses. How-
or the Canadi.an Active Control Stations ever. a nationwide integration of e.xisting
(CACS) in Canada. where the datum is sur\leys and new OPS observations was
known as NAD83(CSRS). NAD83(201 I) is untried, with unknown cost and speed. In
based 011 the GRSSO ellipsoid. but rite cemer response. NGS collaborated! with states 10
of the ellipsoid is offset by more than a creale limited-area High Accuracy Reference
meter from our bes1 estimate of the Earth's Networks (HARNS), also known as High
center of mass. Tllis was adopted to maintain Pre,;ision Geodetic Networks (HPGN). Geo·
consisteJtcy witb preceding d:tnuns. and to erally, there is a different NAD83(HARN)
reduce addj1ion.1.l computations and confu. for each state or small groups of states. The
sion among users. HARN and subsequent NAD83 uation";de
The NAD83(2011 ) includes recalculated adjustments are largely satellite-based. and
locations of t'111S of thousands oflegacy mark lhe transition from physical and op1ical
'"passive" control poinis. the traditional stan- surveying 10 GPS/GNSS surveying. As the
ing points for most local surveys. These sur- GPS·CORS network was extended. it
vey marks were monumented l'O serve as allowe.d more frequent datum improve-
local staning points. or be11ch m(lrks when m'1lltS. and geodesists assimilated the new
part of precis,,e vertical surveys. Marks often data and me1l1ods into a coherent continental
consist of a metal disk embedded in rock or analysis. over a series of datums:
concrete (Figure 3· 10). Their coordinates NAD83(CORS93). NAD83{CORS94).
NAD83(CORS96). NAD83'(NSRS2007).
and other charat1eristics are maintained,
often in an online,accessible database. The and finally the NA!>$l(20i i) we use today.
median network accuracy for control points Note that the NGS and others sometin1es \!Se
was reponed as better than I cm. altemative designations with only the darum
date. e.g .. NAD83( 1996) is ,equivalent to
There are a series of horizontal danims NAD83(CORS96).
released for North America between 1989
and 2007 thal were precursors to
NADS3(20 1L). remarkable given that almost NATRF2022
sixty years passed between prior dan1m The United States is t11111sitioning to a
set of sateHite-OOsed, dyn(lmic refereuce
frames, called the Nonlt American Tern?S-
n'ial Referonce Frame o/2()]1
(NATRF2022). T11is system is substantially
different from tl1e NA083(201 I). and differ.
ent from but related 10 tl1e ITRF. As with the
ITRF. the NATRF2022 also integrates tirne-
dependency in darum points, and is based in
pan on ITRS measurements. to detem1iue
positions. The system identufies darurn
points with epoch coordinates. reporting
point locations for a specific date/time. and
calculates periodic updates {or the point
coordinates. e.g .• ar five or ren year da1es.
Praclitioners may use epoch.coordinates as a
Fi.aut•t .,l..10: A bronze disk used 10 mouumclll 11 basis for subsequently positioning surveys or
sun·cy mntk.
98 GIS Fundamentals
digitizing points. lines, orpolygons, and for this and other reasons. neither the
may adjus1 10 a s1andard epoch. ITRF nor the NATRF2022 maiutain a zero
NATRF2022 will also identify continu. longitude through the Greeimicb Observa-
ously measured dauun poims. providing tory. Rather. both the NATRF2022 and ITRF
systems adop1 latimdenongitude nets tied to
near real-1ime coordinates and \'eloci1ies for
a se.t ofloc.atfons. Most of these locations celestial frames. These latimde/l;ongimde
will coincide with CORS bose s1atious. If nets have an origin al !he mass c.enler of the
using appropriate GNSS/GPS teclu1ologies. Earth. spin around the pole, and orllit the
new field data ma.y reference the CORS bose Sun according 10 precise measurements and
stations directly. calculating positions for the models. We might think of lhese as imagi-
current 1ime and date. The epoch may be nary \\ire frame of latitude and longitude
recorded in the m.etadata. and positions may lines that are tied 10 the mass ce11ter oftbe
be easily adjusted. to an epoch corresponding Earth, with the tectonic pla1es sliding under
to other data. the wire frame. The zero longitude passes
near. b111 DOI through the Greenwich Obser-
The NATRF2022 and ITRF coordinates vatory. ruid this loogimde drifts slowly
typically are 001 lhe same. although they through time, as the plate containing the
both at their base i11clude measurements Greeu"icb ObserVlltory drifts uuder our lati·
from the 1.ntemational Terrestrial Reference tude/longinide frame. All points will drift
System. The ITRF uses a larger set of Sia· with their plates.
tions to estimate con1inen1al plate dynamics
and average the pfate movements. and hence
A main question is to what do we attach
the two systems use different relative point the latitude/longimde wire frame? The larg-
velocities. The U. S. NGS adopts a conven- est NATRF2022 datum is tied 10 the main
tion that reduces coordinate velocities in North American Plate. meaning chat in a sta-
Nordi A1netita. and bente leng11ieus Ille listical sense the net mo\'ement for a set of
interval between required coordinate recal- points distributed across t11e Nonh American
culation. Some backgromJd helps understand plate averages to zero. Points Oil this plate
this choice. will move. but by small amounts. and bal-
ance each other out across the plate. This
If all our modem datums beld the longi- yields relatively slow velocities for points ou
tude fixed for a zero line tltrough the Green· lhe main plate. yielding measured c,oordi-
wicb Observa10ry. tben all other point nates that have small epoch differences for
coordinates across tbe globe would drift wilh longer periods. Titere is a less frequent need
the continents, some quite rapidly. and in to adjust NATRF2022 coordinate locations
different directions. Drift would be tower in on the main North American plate to accounl
England. and largest for tbe plate mo,•ing for coutinental drift. Under this scheme.
fastest relative to England. This wouldn't be coordinates on other plates move al rela-
the bes1 choice for most cow1tries. because it tively higher velocities. but the NATRF2022
would require more frequent calculations of is designed for U.S. territories.
new positions. For example, in projects
where accuracies of 30 cm would suffice. The ITRF datums are different. Move-
with a Greenwieh-fixoo Olongin1de. the con- ments are averaged globolly. so '!be swumed
tinental drift wouM surpass conunon accu- movement of a set of points distributed
racy threshold in .a decade or less. and data across all lbe Earth's large plates is zero.
will mis-align much sooner for much of the This means that even though the ITRf and
globe. We could l>ack-calculate to a stand.1rd NATRF2022 may inilially align. 1hey will
epoch so tliat data matche<l but would have diverge over time. At some time in the
to recalculate any time data were collected future, the government bodies " 'ill have to
more than a few years apan. incre~1sing work adjus1 for this divergence. bm tbat is pro-
and the c.hance for confusion or errors. jected 10 be seve.ral decades hence. given the
estimated rates of dive.rgence.
Chapter 3: Geodesy. Projections, and Coordinate Systems 99
WGS84 Datums
The World Geodetic System of1984
(WGS84) is a third set of datums, developed
and primarily used by the U.S. Depanmem
of Defense (DOD). II was introduced in
1987 and is u.sed in most DOD maps and
positional da1a. The datum is periodically
upd.11ed. e.g.. WGS84(1 674), and
WGS84(G I7,62). and there will likely be
more adjustments in Ille filntre. The ITRF
and WGS84 damntS have been aligned since
1995. and caa be considered equivalent for survey leg
most. bm 1101 au. G!S applica1ioos. Differ· G
ences betwttJl these datums are generaUy Fljttrt 3· 11: An eXAtUplc tnl'ln3t1b1iol) tuJ'\'C)'
only a few ce.nlimeters since 1995. m:twork. New posi1ions m,,y bc1 c::ikulntcd \'i:t
t"'3rious path&. rc:,·ealiog bluodm aud ioc:«-asiag
the acc:Uf'D.ey o( the sur,·c)'cd pc:,,sitious.
100 GIS Fundamentals
f
NA027
0 10 2() 40
Meters
Fi&lll't 3-U : Datum shifts in the ..»ordinates of a point for some ~rumon dllnmu. Note that !:be cstitwitc of
coordina1c po1itioo shi.tu approxim:udy 36 m from 1hc NAD27 GO t.bc NA.083(1986) danun. wlli.Jr: lhr: tbifls
2inoni NAD83 \'r::r$iou• 11.rr: 0.05 1n0r lr:u. Tbr: sbifi: 10 \VGS84(01 lSO) is aliO shown, here :l))pro.<timntd )•
().9S m. Cnkuln1ions o.re b:ucd on NOS data dtccts. NCAT. and HTOP softw.itrc.
Chapter 3: Geodesy. Projections, and Coordinate Systems 101
shifts fromearly, regional datums to modem, ered equivalent for many pwposes. The
global datums. The figure also shows 1be WGS84(G730) was aligned wilh !he ITRF92
subseq11e111ly smaller shifts for NAD83 damm. so 1bese may be substinned in dan,m
datums throug h lime. and relatively larger transfonuations requiring no better than cen.
dis1auce between NAD83 and WGS84/ITRF time1er accurades. Similarly. !he
datums. The figure doesn' t show 1he rela- WGS84(GI 150) and ITRFOO datums have
tively laige shifts from the NA083(201 I) to been aligned, and may be.substituted in most
the NATRF2022. bu1 is expected 10 be simi· subseque111 transforu1a1ions.
lar 10 1he shift from NAD83 10 WGS84. While locations in ~,e NA083(xxxx)
Datums shifts associated with datum and 1be ITRF/\VGS84 da1wns conunonly
1rausfonnations have changed wilb each suc- differ by over a meter. dan,m shifts internal
cessive daruin reali.zation. as summarii.ed in to these groupings have bec-0me small for
Figure 3· 13. Several datum pairs are cousid· recent datums. Differences between
1986 ITRF88
!TSF I
~In~
~ 10 I m O<IC" to ltw
• )'(:In, Chol\)U ol
1989 NA083(HARN/HPGN) a'"'""
I \ , WGS8A(Gno)
:ir ks.s.
1
I
&Jf~2~.
1994 :,:Z':;'""· .,,,._,.,."'
Mi;pro2'0crn upto2mccn
be~...ccn Oil t.:AO 8J
I
-WGS84(G730) - "'"' -
<lOtm
· ITRF92
1998
2001
,
,,::I3:RS96)
/
"'"'"'''" coos......
tu.Oe3 l"t'Cll.~ 9r.f'ICl'd ry
\ '-WGS84(G873)
\ !
WGS84(Gll50)- & !MOO
::c:~,
-~:'°'~ ITRF94
-
ol,;ncd. dffcl"MC.n
ITRFOO
I
I
2 CfflOI" .CU
l ..., ....,....,
DfTWHl'I INS
ITRF05
2011 NAD83(NSRS2007) ..,,.
:mr
/'''""'"'"" ITRF0 8
I
2012
I
NA083(CORS2011)
Zmer<'n
, - dt'UCF"« ,
'-.. WGS84(Gt762)
I
1
dffC"l"ICtt \.P
~
,o z"'
tr.!Mf 10 ~ MTiITTOll
\
,
I
!TRF14
Flgurf' 3-13: This gr,,tphic $~11nlll«ritc:$ the e,-ohnion of the three nui1i families of d.1t'Wn1 used in N0rth
Ammc.,. AJ tl1c datums b~·e been adjl1stcd. borizootal p<>sition,,l cliffcicnc« betw«o a.u.n·ey marks b"vc
varied. witlllll the nmges shown. ..Alig:ocd'' d11turus(c:.g:., WGSS4(01150) and ITRFOO) may be coosidCTCd
equi,'llkn1 for-mos1 putpOkS \\1tcn applyini cl.an.nu tnuufom'3tioits.
102 GIS Fundamentals
NAD83(HARN) and NADS3(xxxx) datwus threshold. You should verify die magnitude
may be up to 20 cm. but are cypically less of datum shifts between any dallllll pair. and
than 4 cm. so these danun realizations may apply a danim transformation if the shlft
be c-0nsidered equivaJen.l if accuracy limits approaches your error tolerance.
are above 20 cm. The different~ between
NADS3(CORS96) and NAD83(2011) are
often a few c,en1imeters, as are. the differ. Datum Transformations
ences among ITRF realizations. Converting coordinates from one datum
We must e1nphasize a few points about 10 another is t}]Jically done usin,g a datum
datums. First, different datums specify dif- t,rmsformntlon. A danim transfonnation
fereot coordinate systems. You do no1 e.,pec1 provides the latitude and longi1ude of a point
coordinates for any physical point to be the 1n one darum when we know lhem in another
same when they are expressed relative to dif· darum: for example. we can calculate the lat-
feren, danuns. itude and longitude of a survey mark in
NAD83(201I) epoch 2020.00 "·hen we
Second. different versions within a know these geographic coordinates in
danim family are different danuns. ITRFOS epoch 2020.0 (Figure 3-14).
NAD83(1996) is a different realization than
NAD83(20t I). and ITRFS8 is different 1han Datum transformations are often more
ITRF05. The datum is incompletely speci- complicated when they involve. older
fied unless tbe ve:rsion is noted. Many GIS datums. Many olderdatums were created
data sets refer to a datum without the ver- piecemeal to op1itnize fit for a country or
sion. for example. NAD83. This is indeter- continent. so simple formuL1s often do not
minate and confusing. and shouldJ1 ·, be exist for transfonnations among many older
practiced. it forces tbe user to work wil.h daru,ns. Specialized danm, lfansformations
a,nbiguil)I. which leads to blundtrs. may be.provided. us1mlly by gO\'l!nunent
agencies. As an example. in the 'United
Third, coordinates specified relative 10 States, the National Geodetic Survey created
officially adopted datums in most coun1ries NCAT. a danun transformation tool 10 con-
change through ,•ersions. The NATRF2022 vert between various NAD datums.
positions diverge from NAD83(20 l l ) posi-
tions by I to 1.5 meters for mosl poinls in Transformation among ne\ver danuns
North America. The NAD83(1986) datum may use more general mathematical trans-
realization is up to two meters differeot than fonuations between three-dimensional. Car-
the NAD83(CORS96). Differences in datum tesian coordin.11e systems (Fig11re 3-14).
realiza1ions depend on the versions and loca· Transformation equations allow conversion
tion on Earth. Data may 001 overlap cor- among mos, NAD83. WGS84. and ITRF
rec,tly for differenl datum versions. even if geographic coordinate systems. .aud are sup-
you use siandard ,datums es1ablisbed for a ported in large part by the improved global
country. measureme111s described in the previous few
pages. This approach incorpomt,es a shift in
Finally. all data layers used in an analy- the origin. a rotation, and a change in scale
sis should be converted 10 tlte same danim. from one datum to another.
unless you are certain that the shifts between
datums are smaller than the accuracy A danuu transformation is typically a
requirements for your analysis. You don·, m~rlti·Step process. ln past times, empirical,
need toconven all NA083(CORS96) data 10 gnd-based methods have been u.sed because
the NAD83(20 ti) dattun if your accuracy many early datt1ms were DOI strictly derived
requirements are 1.5 meters. as 1be danim from coherent malhematical surfaces. Later,
shift is rypically a few centimeters, but you a J\1olodei,sAy 1ra11iforma1ion was common.
do need to con\'en NAD83(CORS96) data to using a system of equations \\itb three or
ITRF(2008) when applying the same error five parameters. More currendy.. a Helmen
rra11sfor11urtfo11 is employed using seven or
Chapter 3: Geodesy. Projections, and Coordinate Systems 103
to
)(. Y. Z
z
cl""' d) rotate
x·.v·.z·
Z'
Z' :'
/ •
•• >.: •••
z'
... ··~.. ...i.··.··'.··-
- ~ ::;.,,____, ..
.,. Y'
..x·.... ...•
.
y•
)('
•
:,
ti;urt 3-J,1; Applic.,tioo of 3 modem 00.twn trnmfoniution, Oc:ogmpbic coordii1,11tcs (longitude. A. and
la1m1dc. (,) ate tninsfom1cd 10 a uew datum. b)•o) couwnion from scoarapbic to Cartcslan ~diuatcs lU
the old d.11.t\1.111 {d\tOU3b a sci ofcqu.1t-i01li th:u arc l'lOI d1own), b) appl>•illt al'l oritin shift,<) 11eAI.U11,
d) rotnting tbcsc f.hificd coordin.,t~ llOO e) converting a.hue lattct datum Cnrtcsuu, coordinates. X. Y. z·,
to the longihllie and latitude. ')..'. (i. in tbe t:1rget datum.
14 parameters (Figure 3-14). First. geo- Because of tectonic plate movement. the
graphic coorc!linates on the source datum are most precise geodetic measurements refer to
convened from longirode (1..) and latirode the epoch. or ftxed time period. at wbich the
(cj,) to X. Y, and Z Cartesian coordinates. An point was measured or datuL1l fit. The HTDP
origin shift (translation). rotation. and scale software includes options to calculate lhe
are appLied. This system produces new x·. shift in a location due 10 differem reference
Y . and z· coordinates in tbe 1arget datum. darums [for example. NAD83(CORS96) to
These x·.Y'. and Z' Cartesian coordinates are IVGS84(GI 150)]. the shift due to different
then convened back 10 the lougintdes and realizations of a damm [for.example.
la1in1des (): and f) in the target datum. NAD83(CORS96) 10 NAD83(2011 )]. 1be
More advanced 01etl1ods allow these shift due to measurements in different
seven transformation parnme1ers to change epochs (for exan1ple. NA083(CORS96)
through time,. to account for tectonic and epoch 1997.010 NAD83(CORS96) ei>och
other shifts. for a 101al of 14 parameters. 2010.0). and the differences due to all three
These methods are incorporated into soft- factors.
ware that c.alculate 1.ransfom1ations among Summarizing key insights. datums rep-
modem datums, for example. the Horizou1al resent our best estimate of weU~measured
Tirue-Dependen1 Positioning (HTDP) tool coordinate locations at a given time. and
available from the U.S. NOS coordinates in one datum may differ from
(""'"·ngs. no aa.govrroo LS/Htdpi coordinates in tlle previous datum for the
Htdp.shtml). HTDP converts among recem same poims. These diffel'l.~>ces may be small
NA083 dan,nis and most ITRF and WGS84 and ignored with little penalty in some spe·
datums. cific instances. typically wh.en the changes
104 GIS Fundamentals
between datums are smaller th.an the spatial Finally. GNSS positioning i·s good
accuracy required for our analysis. However. enough that we can measure the change in
many da111m shifts are quite large, up 10 tens position over relatively sbon 1ime periods.
of meters. One sltould know tJ1e magnitude so the same feature measured at two differ-
of the datum shifts for the area and datum ent da1es. even with the same dacum. will
transfom1a1ions of interest. have two differem sets of coordinates.
Second. we should use the appropriate Unless proven othenvise, all data should be
da111m transfonnation. There is no generic couvened to the same coordinate system,
transfom1ation between NAOS3 and based 011 the same danun. at the :same epoch
WGS84. Rathcr.11,ere are transfonuations date, or time period. We must note the epoch
between specific "crsions of each. for exam- of data collection, and convert all data 10 a
ple. from NAOSJ.(CORS96) to standard epoch. uuless we know that the
WGS84( 1I 50). Our default practice should coordinate drift is less than accwacy needs.
be to use die proper da1um 1ransfonruuion. If not, dalll may misalign. We should record
for the proper version. 10 transform all data the original danam under which ·1he data
to the same datwn. were collecte<I, and d1e iype andl method of
any datum transfonnation applied to the
data.
Chapter 3: Geodesy. Projections, and Coordinate Systems 105
Heights
TheGeold
We often need 10 specify the eleva1ioo of
objec1s. e.g.. ·10 measure slope. drop. or visi·
bility. We adopt a gravity.related vertic.al
reference called a geoid as a s1aniog poinl
for measuring heiglus (Figure 3· IS). We
don1 t use the ellipsoid as a height reference
because Earth densi1y and hence gravi1y
varia1io11s cause regions 10 dip below or
bulge abo\'e a reference ellipsoid. The
world' s oceans. our historical reference for
heigb1. approximaiely follow a gravi1a1ion·
ally·defined geoid surface.
The geoidal surface ""Y
be 1hough1 of
as an imaginary sea thal covers the entire
Eartn. affected only by Earth's gra,;1y. This
geoid rises above or dips below lhe Eartli's FJaurt 3~1.S: lkpic1ioos oftbc. E:uth ' s lf!"''i1y
field, M c"im..,tcd &om 5tltcUitc musurcmcnl$.
ellipsoid by up 10 I00 me1ers across 1he Tbe, c sliow 1be uodulation1,, g~-11dy e:<aJgCl'3tcd.
globe (Figure 3-16). Allhough ii may al 6rs1 iu the Earth's a,nwity. and hence the a:co1d (cour-
seem diflicul110 believe. me "average" tesy Unl\•usiiy of Texas C«itcr for Space
R.c•cn:n:b, nnd NASA) .
ocean surface near Iceland is more limn 150
me1ers ''higher" than the ocean surface
Fta,u.,. 3~16: Owid~1 bei1lu.1: \'lll)' :tcrou tbe globe. This figure depicts potiti\'e gcoi<bl beiilus in lig]Hcr
tones (gcoid above die elhpsoid) ru.d oeg11.1h·e 1-eoi<bl heights in darker lonet. Note lhat geo1clal beig.l11s ~re
posi1h·c for large arc-u ocar Iceland and lhc Philippines (A aod S. rc,spccti,·ely). while large ucgatin: values
att found south of India (C), Coiuinnual nnd OOU1nry botdo"S arc sho,,,, in ,,1lite.
106 GIS Fundamentals
nonheasl of Jamaka when measured relative The geoid is a measured and interpo-
10 our mathema1ical ellipsoid. Since gravity lated surface: uulike an ellipsoid . the geoidal
pulls in a direcrio:u that is perpendicular 10 surface is not defined by a simple mathemat-
the geoidal surface. and !he pull of gravity ical equation. We use a number of metJ1ods
near the ellipsoid surface is stronger in some to measure the geoid, historically with trigo-
areas than odiers. these geoid variations rep- nometric le,•eling. and more in modem times
resent persistent bulges and dips in the mean with various types of grm·imete,-s (Figure 3-
ocean height above and below our mathe- 17). These devices me&S\lle the absolu1e or
matically-defined. ellipsoid. Variation in relative gravitational force. and are placed or
ocean heights due to swells and wind-driven
waves are more apparent at local scales, but
are much smaller than the long-distance
geoidal undulations.
We define a geoid using a three-dimen-
sional eq11lp()le111inl surface. along which the
pull of gravity is a specified constant. or
more precisely. the gravita1ional potential
energy is at a con:stam. We refer 10 an equal
pull of gravity when describing the geoidal
equipotential surface, a familiar non-mathe-
matical description. Geodesists use the more
precise, physically defined term gra1'irn-
1io11al potential. r.-elated
to the amount of
work required to rnove au object against 1be
Earth's gravitatioruil force. Pure water in a
lake or ocean. absent wind or other forces,
will have a surface that c:onfonns to a gravi·
tational equipolemtial surface. Absent these
other forces, the gravitational force 1'leve1s·· Fls;urt .l.17: A po11.1blt fic:ld p,'1nte1cr, a:ii iru1n1,
the water. u1eut used for meauiriog gnm.1ationol force MII
field loe,atiou (l)a,id Morutiaux).
H ___.,,--- Orthome1nc
·
height
e\\ipsoidOI ne,g'nt ~ -- ~ ---------••'?.eoid
,
,,,,.-------- ,l N ~- ·;;:;;,;,;;~;-···-_,
ellipsoid. e.g. GRsao fl \,
_... -- ----- ,/
'·
Flautt J..18: EUipsoitbl. or1bome1ric, 1\nd gcoi:d.'1.I bc:igJus a.re in1c:m::btcd. Valucs for N are highly cx..,j•
gcrnted in this figi.u-c - Ibey .are typic:.all)' much leq: than H , We use this formuln lo c~lcul~tc ortbometnc
hcipu (elC'\-atioo) when we know d1e elli.psoidal Ji.eight (coomionly from OPS). and g,eoidal bciglu (from
uatioillll models).
Chapter 3: Geodesy. Projections, and Coordinate Systems 107
c.arried on or above the Earth's surface. surfaces (figure 3· 19). This height line may
Recent iucreases in airtiome gravimetry and bend due to small undulations in the equipo-
various sateUjte systems have substantiaUy tential surfaces. The height paths are 001 tile
improved our measurements of the geoidal same as a straight line normal to the ellipsoid
surface. and up to the surface. and not the same as a
Sec.ause we have two refe.rence sur- straight line tltat is nonnal to the geoid sur·
faces. a geiod and au ellipsoid, we also have face at 1he slarting point. bu l lhe differe.nces
two different kinds of heights. Elevaiion is between these paths are usu.ally quite small.
typicaUy defined as the distauce above a We emphasize that a gooidal surface
geoid, often caUed height above meau sea usually differs from the average sea height
level. but more unambiguously referred 10 as measured a1 ru1y one location. A local mean
the crtlr<Jmetric height (Figure 3 -IS). sea level may be h.igher or lower 1ban a
Heights above an ellipsoid. or ellipsoi· global mean sea level becau,se 1100.g.ravita·
dal heights (or height above ellipsoid. HAE), tioual forces sttcb as persisteoi winds. ocean
are used in some coordinate system cakula~ currents. temperature. and salinity variations
tions and for some global navigation systems cau cause persistent locally high or low areas
such as GPS. but ellipsoidal heights are not in the oceans. These non,gravitlltional differ-
our standard way to measure elevation. ences in height can be up 10 a meter (3 feet).
These heights are illustrated in Figure 3-1 S. perhaps small on global scale. but significant
with the ellipsoidal height labeled hand for local or regional measti<ements. Our
orthometric height labeled H. The difference geoid follows a gra,<itational equipotential
between the ellipsoidal height and orthomet- surface which best matches. in a least
ric height at any location is shown in Figure squares sense. mean sea level when aver·
J. 1Sas N. and bas various names, including aged across the globe. The local, long-term
geoidal l,eig/,1 and geoidal sepam#o11. average sea height is differem from the
global spatial averaging of sea height
Onhome1ric heigh1s are defined as 1be defined by a geoid at many locations.
verticaJdistamce measured from a reference
geoid to tile ground surface. along a line that
is at right angles to ioterveniog equipotential
equipotential surfaces
orthometric height .. . . . .... .~
)i
.. .. .
.... ~ .. . .
..... i" . _. :"'.:.-... -.
---. . ·•·.. •. . . . .
.. . .. . .. .. .. . . . .. ... .. .. ... . . .....·..
. . ......
. ....
•.
.
. ..... . . . . ...... • . ... . . . ...
. ,-
.. r : .......·,.. ··~ .._. . .
., . ... -·. . . ..
. . .. . . . ... . . - . ..._...-~ .
-
•
. - ·- •.
. .· ... ...
- ":
.. .. -. ·, . . . . . .. .. . . . . . : . .. . -: .
- geold, W •conston1 volue~.c·.i.~--~-·- ~-- - - - - - - ·
"
0
.
Figort- J-19; 1'Ceigb1& in. North America are refc:reo«d to~ geoid. corm-pondiug to a givm equipotential ,ur-
facc. All the points Oil ao equipotential sudacc luwe the: saiue ira\'itatiorul puJI. Surfaces m11y be c1wi1ioocd
as faye11 with d ec~ :uing l trcngth lll di$tllnCc inc-re211cs from the E2.11h ' 1 m.,.u center: Heights arc uru11ll>•s~c·
ifted o., 011lio1n-etric, mc-.inin~ 11t ri~11 anslc, to nU cquipotcn1:ial , 1.1rfnces along 11lcir p;ath: BccAtlSe potenMl
sutfae<s may tmdula1c. orthoructric J>Clghts may be cur, ed liuc&.. although urua.lly only slig)ldy so.
1
108 GIS Fundamentals
Vertical Datums and Heights cisely meas.ured from there to other points.
Vertical angles and surface distance mea-
To specify heights we define a reference surements are used 10 estimate horizooatal
surface called a ,,enical dntum, and specify and vertical distances at a series of points.
orthometric heighls relative to that surface. Over short distauce. the relative heights can
Modem venical datums l}']lically idemify a be measured with millimetric ac·curacies.
specific geoid, although they may choose Tens of thousands of kilome.ters of vertical
surfaces that average across regions, so do leveling over more than a ceorw:y have pro·
strictly follow tbe geoid surface everywhere. vi<k!d the relative heights for thousands of
We measure gravity 10 estimaie the specific points with.in North America.
geoidal surface. and c.arefully measure orth-
ometric heights on the ground surface. For However. because leveling is difficult to
example, Canada and the U.S. defined the adjust for gravimetric variation between sta·
equipotential surface at a gravity value of tions. and because gravimetric measure.
mems have historically been sparse. long-
W0= 62.636.856.0 m•s·2
distance leveling nelworks provi<le relative1y
as the reference for their future vertical imprecise onhometric heights over the entire
damms. and strive to measure lhis surface network. Recent improvements in measure·
with sufficient accuracy to specify heights ments and analysis have revealed continental
across Nonh America. biases in height measurements and geoidaJ
Governmems have often adopted surface estimates in North America. leading
.. hybrid" geoids that combine their own pre- to current efforts to increase the density and
cise venical surveys with gravity measure- qua lily of gravity measurements to better
ments and models. primarily due to historic specify the geoid.
gaps in gravimelric measurements, and also The current reference geoid~ named
10 ill3lth IO!!g-tefiil pffittice. In illail!llliid GEOIDI 8. tOillbifles GNSS dMa. sa1elli1e
Australia. heights, are relative to measure- gravimeuy. and airborne measurements with
ments averaged over 30 tidal gages spread some lrigonometric leveling data. T11e
along the coast, because they have an NASA GRACE mission. launched in 2002,
approximately I-meter decline in the geoid measured the distance between a pair of sat-
height relative to tidal gage measurements ellites as they orbited the Earth. The satel-
from the northeas.t 10 the southwes.tem part lites were pulled closer or drifted farther
oftbe country. Tlleirvenical damm is from the Earth due 10 variation in the gravity
related to be deviations from a specific field. and precise inter·sate.l,lite distances
geoid. Various European countries adopt allow highly accurate estimates of gravity.
base points near clilferem long-term tidal Because the orbital path changes slightly
gages. or if 1.andlocked. for points related 10 each day. over time we obtained nearly com·
gages in adjac-ent countries. Because tJ1e plete Earth coverage. The ESA GOCE satel-
zero height may dlilfer. you must be caref\11 lite launched in 2009 used precision
when mixing standard heights ac.ross coun- accelerometers to measure gravity-induced
tries. velocity change. Combined, the GRACE and
The most current damm in the U.S. and GOCE observations substautiaUy in1proved
Canada is also a hybrid danm1. combining our estimates of the gravitational field aod
gravimetric and surface measurements in the geoidal shape. averaged over large areas.
official venical damm. They combine past Short-distance variation in gravity fields is
1dgo11ome1ric let~liug smvt9s (Figure 3-20)
provided by surface or airbome measure-
along witll current gravimetric measure· ments. In the U.S.. the GRAV·Omeasure-
ments to establish a vertical reference. Trig- meot campaign qi.,ntified local anomalies
onometric leveling starts at a known by flying gravimeters over sparsely mea-
elevation, often a seaside bench mark. Dis· sured or highly variable regions. Together
tances and elevation differences are pre- with surface gravimetry. we've subslantially
Chapter 3: Geodesy. Projections, and Coordinate Systems 109
horlzontol diST011-Ce .. h
............................................................................
•
~ measured ong e
oonzonu1l ond vtrt,col
• d,sronce-. h or:d v, may
be colculoted by;
h • d · C.OS(U)
v. cl . sill ((I)
Flg11tt ~20,: Lc, ·d ing ~ttn'C)'$ Olien cniploy optic:al 111<':\fl.trcmcnlt of \'ez1ical 1\ng:fc (a.} with u\dlstu·c ·
mc1115 of suril!CC d.ist3JKC (d) and bio,,·fedgc of trigooomc:tric rcfotionsltips to c~lc1.1fa1c boriiout.al di$--
tanec (h) and Ycrticll.l di&tai)Cc (v).
improved our mapping of gravita1ion1l .. sur- Care should be taken ,,11e11 combining
faces in 1be p.as1 20 years. These measure- height data referenced 10 dil'rerem ,·enical
ments led to a seriesof geoid e.stimates and danims. panicularly due 10 rnulliple geoid
vertical datum improvements. with the most updates nea.r the turn of ~te 21st century.
current named the North American Venicai Vertlcal danuns are 1ypicaiiy deveioped ut
Damm of 1988 (NAVDSS). based on concert with horizontal datums. so we
GEOID20tS. should pair our horizontal/v,ertical datums
As with horizontal darums. there are when combining/convening coordinate data
various "lega,cy'' vertical dan1ms, oo longer (Figure 3-21). 1ha1 is. we should conven our
s1andard. bu1 imponant to know about horizontal data to a standard datum. and our
because heights may be referenced to these vertical data 10 1be corresponding ,~nical
datums. The National Geode.tic Vertical datum.
Datum of 1929 (NGVD29) was the first Al ibis writing. the U.S.. is estimating a
widespread danuu in North America. based new modern global geoid. n,imed
almost entirely on trigonometric leveling GE0102022. defining lite g_eopotential field
from seaside benchmarks. This was replaced 1ha1 best approximates global mean sea level
by the first version ofNAVDSS. using a1 a coruis1en1 data collec1ion epoch. In addi·
hybrid geoid named GEOI003 and associ- tion. three refined. regional. gridded models
ated with the NADS3( 1996) horizonllll ,,~Ube fit for North America and overse.as
datum. and hru been subsequemly improved territories. These are called n1odels because
to the curreo1 GEOIDIS. Each successive we measure geoidal heights a1 poiois or
improvement. integra1e.d increasingly den.se along lines al various pans of tlte globe. but
and accurate gravimetric measurements with we need geoidal heights everywhere. Equa-
improved horiiomal measuremems and tions are $la1is1ically Fil to relate 1he mea·
analy~is. primarily from belier GNSS. sured geoidal heights lo geographic
coordinates. Given any set of geographic
110 GIS Fun(l.amentats
coordinates, we may then es1ima1e the onho· ti) in Washington state to zero in Florida.
metric height There are also significant local anomalies in
The GEOID2022 will be integral to a height due to measurement error and spar-
new North Ameriocan vertical datum, called sity, and geophysic.al dynamics such as post-
glacial rebound. earthquakes. and sea level
the Nonh American-Pacific Geopo1emial
Datum of2022 (NAPG02022). This work is rise. This means all heights will change.
in concert with developing the hori2001al some by more than a meter (3 feet) when we
NATRF2022 da11un. and will hannonize adopt the new heigh1 datum.
horizontal and ve11ical positioning. The new The NAJ'GD2022 is a conceptual shift
ver1ical datum wi ll integra1e exteusjve new in how we ,,~U measure heights going for-
airborne gravity s.urveys of the entire U.S. wards. Past practice measuroo h,eigbts from
with pre,~ously d:at.a to yield a geoid surface eslablished physical benchmarks. GNSS
wi1h estimate accuracies within I cm. systems such as GPS now allow· us to rap·
The NAPGD2022 vertical datum will be idly. accurately. and inexpensively deter-
a substantial improvement, but wiJI signifi· mine ellipsoidal heights at any point on the
cantly alter heights across North An~rica surface of the Earth. When combined with
(Figure 3-22). Ne-wer. more accurate mea- ao accura1e model of geoidal heights. we
surements revealed acontinentaJ bias in may then use the method in Figure 3-lS 10
geoid heights. tilting from over 1.2 meters (4 calculate orthometric height Cu,rrent tech-
nologies will yield centimeter-level height
accuracies quickly. a1 relatively low costs.
everywhere. This reliance on GNSS will
grea1ly increase the speed and accuracy of
NGV029, no geoid determining elevations. but is dependent on
with an accW111e model such as tltal produced by
NAD27, NAD83(1986) the NAPG02022.
App,......_.n
O•t""-"'
H.o(NC-..,
, - ... ,_ - oeoii
_.... ' ....... O~DO
u~ a;,,
\ ,µ,--vr:. - 01,ioo,
- ,.,.,or!;
- ll,lg I
f i&unJ.21:: £,cpcctcd shift io heights when cOO.Ugiug from tbc NAV08:S to NA.P002022 (cour1csy
!<GS).
112 GIS Fun(l.amentats
Il
l• llllAW•
...... l· I
· ~ ·-·
O.tum: t.NGVD 1t2t
[!Miff' ll!n) l• ((.-tm) l· J
,.,,..._ ., ....... 0 -
r, G((m mocMt: 0 ...... - jGlOI011A l· I
i.,,- ~,
..
-
lOIIQlluda: ~ 124 1'636
-eo--
,g--~
Q flle~I
0 16tlllS
......
- • eh~•----' ....
Fia,U1~ 3,2J: An Cltamplc o( the application o(tbc ,·mica! danuu tn1nsJ01matiou soflwan: VDarum.
Chapter 3: Geodesy. Projections, and Coordinate Systems 113
surface, absenl wind, waves, and other fac. fac.es than the same orthometric heigh! at a
tors. The wat<?t level in a s1ill bathtub. pond. more southerly location. An. onhomeiric
or lake has tb.e same equipotential surface at height of the water surface at the soutl1 eod
one end as another. Gravity ensures this. of the lake will be higher than al the n0f1h
Second. the equipotential surfac-es are closer end. For example. in Lake l\.1ichiga1l a large
together wbeu nearer 1he mass center o( lake in Nonh Aruerica, 1.he ele\'ation of the
Ean.h. A.1; 1he equipo1en1ial surfaces con- 1he water surface. at the south end is approxi-
ve:rge, or become ;,denser," I.be wa1er surface mately 15 cm higher than the elevation of
seems 10 dip below our fixed onhome1ric the water surface at the nonh end.
height Dynamic heights are most often used
Because water foUows an equipotential when we're interested in relative lteigbts for
surface, and because the Earth's polar radius water levels. particularly ov·er large lakes or
is less than the equatorial radius, the onho- connected water bodies. Because equal
metric heighr.s of the water surface on large dynamic heigJns are at 1he s.ame water level,
lakes are usually different at the north and we use them when interested in accurately
south ends. For example, as you move far· representing hydrologic dr<rp, head. pres-
1her nonh in the Nonhen> Hemisphere. the sure. and 01her "ariables related 10 water lev-
equipotential surfaces converge due 10 the els across distances. But these differences
smaller polar radius with increasing gravita- can be confusing when observing bench
tional puU(Figure 3·24). An onhome1ric mark or sea level heigh1s. aud mtderscore
height is a fixed height above the geoidal again that our height referet1ce is no« mean
surface, so 1he northern orthometric heigh! sea level. but rather an estimated geoidal
will pass through more equipotential sur- surface.
:.·.,
. ......... .
-" ._
I ~
- - ·-· i~e~~,c~~
• • ~~~~
• _....---
e.--
u,·- -· .... . - : J, \
:~:ght-::::::;;.:=;!(
' : ..•:,."••
I ·•
.
•
. • •
*I() - ••• :.. • ••••,-~ ,
•·:~'.'-;"-,.'>"';::;;::.,.__:.;::...=:_;;;----<' .. o·•
• ..N . , •
•'T'"
..... · ..• 2N · ;:·:- ..
.:.i·:..
. . . . . . . . . . . . - .. • O
~
.·, ,••
.... .
• • • •: .• • • .• • " ~· •, • • • ·, • • "'II -·._.:-·:,r•1,ll, '
.,. .-·-,w. :r ••"" ··--·•'
. • •• •.•-•••:-"'•-;-·•t•.•~•• ...._;,,,·•··~
.- . ..
. . •. - -•.--
. - .
. .
.. ... • . ·-..I
.. . -
• '•
H•,•·
.- ...
·- ---.. . .. .
• -
"''-" -------v-
. . ...-ir-- --,-. · 9•"'d
·-. . . •. . .
- . • ---- ~-.----
• ·- -.
f lgurr 3...2-4: An iUu,1.rl'llion o(how d)'t'l.1inic hei;h1s ruwl ort1toincttic bcigbrs in:1.y diffct, ~)d bow
cqu~) ord1oe>otctric heights 111:iy c:<>1TCq,ond to di.ffc:rcnt heights 11bovc the wiiitc:r lc:,"c:I 011 a 111.fi:C: M:c.
8ec-~usc c:quipotc:utial 6'.urfaces com·crg;c, tbc water kvcl al tbc northern aud soothcm el'.trcru« of a
lake will baYc diffc:l'C'ot ord10mc:tric bc:13bu. Dynam.ic bc:iai1t&aod water levels att cqu.al across au
c:quipo.1cntiilll sur&cc.
114 GIS Fun(l.amentats
fi&11rt' J-25; Anilhn:1:n11ion of mean 6Cll lC"\-cl .and otbC1' mc•un, .111 n NOAA long·tcnn ttd.tl gage.
Note tba1 the 1ncao sea. JC\·cl at d:us station i.s 6.64 - 2.34. or4.3 feel abo,-c the NAVDSS zero h¢iglu,
Local Sea Level Datums Data for measured tidal stations are
available from the NOAA web page:
Warer height measurements along the
U.S. coast are typicaHy reference to local sea https://tidesaudcurrents.noaa.gov
level darums. As note<! earlier. mean sea These sites report mean sea level, as well as
level is not zero for almost all pou1ts along mean high. low. and extreme water levels
North America 's coastline. Elevations are (Figure 3·25). Most imponamly. they also
measured relative to a geoid. Zero ele,•,nion repon the NAVOSS orthometric heights for
coincides with zero mean sea level at only each 1idal station. allowing a conversion
one standard coastal station in Canada, near from local sea level heights 10 standard onh-
the centerof the continent. Variaiions in ometric heigh rsused to specify elevation.
gravity, currents. s alinity. tides. aod wind
produce me~,n sea levels that are different Figure 3-25 shows data for a station in
from zero by up to several meters (!Os of Seattle measured since 1$99. Mean sea level
feet) around the rest of the cominenral rim. has a local reference height of 6.64 feet,
But we still need ·10 know the ocean level meaning the sea level averages thai height
along rhe coas1Line for ""'"Y practical pur· above lhe long-tenn measuremeut of a given
poses. including cons1niction. flood protec- law water height. The NAVOSS .height at the
tion. aod water managemenr. \Ve have same point is 2.34 feet. which yields a sea
esrablished a network of long-term. refer- level height of 6.64 - 2.34. or 4.3 feet. As
ence measurement stations along the coast- strange as it may seem at first. the mean sea
line. We precisely measure both sea level level at this Seanle sration bas 3JJ elevation
and the station orthometric height so that we of 4.3 feet. Any point nearby that has an ele·
can tie our suu,dard elevation to local water vation less than 4.3 feet will be below sea
heights. !eve~and will likely nood frequently if there
is access to the sea. Local coi,stntction.
Chapter 3: Geodesy. Projections, and Coordinate Systems 115
water level measureJnenls. or otJ1eractivities This mean sea level offset varies by
depeuden1 ou. sea level will reference 1his loca1io11. for example. Pon Sau Luis. CA.
staiion measurements. and 1he 4.,-fooi off- bas a ven.ical offse1 of 2.7 fee1. and Vaca
set between mean sea level and seaside orth- Key. FL. has an offsel of -0.8 feet When
ometric beigln1s. beiglus of sea le\'el are impor1an1 for au
analysis. projec1s should reference die near-
est local sea level datunt
116 GIS Fun(l.amentats
11and 12• Poi111s roward tl1e edge of Ille map I, ,,,,. ellrpsoid
4,,,. surface
surface. sucb as Dand E. are stretched apan.
The scaled map disran<:e between d and e is
grearer tl,an t.he distance from D 10 E mea-
sured on the surface of the Eanli More sim-
ply put the d israoce along the map plane is • mop
grearer tlian tlle corresponding disrance projection surface
along the curved Earth surface. Conversely, •1tght" source /
poinrs such as Aand B 1ba1 lie in berween I 1
and I 2would appear compressed 1ogether. d,stonce Ob < A§ E
The scaled map disran<:e from a 10 b would distance de > OE
be less 1han the surface measured distance
frolll Ato B. IDistortions ar I 1and I 2are zero. Flipi1·e 3-28: Distortion dutin3 inap projccciou.
Tlus side \'ic\\' shows bolh cxptuuion t1ud conl..
prcuion of llfC.aS on .1 plon,, r mnp.
118 GIS Fun(l.amentats
Figure 3.2s. This planar surface intersects surface of the ellipsoid. and by approxima-
the globe at a line of m1e scale, the solid c~- tion. Earth.
cle shown in Figure 3-29. Disionion Figure 3-30 illustrates the calculation of
inc.reases away from the line of true scale. both the great circle and projecti.on. or Cane-
with features inside the circle compressed or
reduced in size. ,«hile foam res outside the
sian distant~ for two poims in 1be sou1bcrn
U.S.. using the spherical approximation for-
standard circle are expanded. Calculations mula introduced u1 Chapter 2. \Ve use a
show a scale error of -1% near the center of spherical approxunation of the Eanh's shape
the circle. and increasing scale error in con- because il is accurate enough for illus1:ration.
centric bands outside the circle to over 2% The difference between I.his simpler spheroi-
near the outer edges of the projected area. dal method (equal polar and equ~torial radii)
An approxinution of the distance distor- and an ellipsoidal method is typically much
tion may be obtained for any projection by less 1han 0. l%. and always less ihan 0.3%,
comparing grid coordinate distances 10 g,~at so typicaUy less than 50 cm (15 f~t) in our
cfrcledisrances. A great circle distance is example.
defu1ed on the su.-face of the spheroid or Projected (canesian) coordinates in this
ellipsoid (Figure 3-30). The circle distance is example are in the UTM Zone I 5N coordi-
the shortest p.1th li>etween two points on the nate system, and derived from tbe appropri-
ate coordiruue transformation equations.
/
/
,/
--- ----
/
/
I' I
'
I
r
\
\
--..... ~
I
I
\ I
' I
' ' ',
Flgurt 3-29: Approxim111c- Cfl'OI' due to projection distonioo (or a sped.fie oblique 111tfeoppbic proj«-
1ion. A pbnc- intcnl«ti the itobc :u :1. i ~ndsrd cif'('lc. This suul&rd c-irtlc: de finct" li.nc-of1ruc sc-:1.Jc,
where there- i, no didancc distortion. Oistor1io11 incren.scs away from tbi, llnc. ,md ,wic:, from · 1% to
o,·cr 2% in this example (adapted from Snyder. J9S7),
Chapter 3: Geodesy. Projections, and Coordinate Systems 119
Amted with 11te coorWnales for both pairs of A straight line between two poinls
points in bo1h lhe geographic and projoc1ed shown 011 a projoc1ed map is usually no1 a
coordinates. we can calcula1e tbe distance in straight line nor the sbonesr pa1h when trav-
the two systems. and subtract 10 find the eling on the Earth's surface. Conversely. the
length dis1orUou due 10 projec1ing from lhe shoriest dis1ance be1weeu poinls on lhe Earth
spherical surface to a Oa1 surface. surface is likely to appear as a curved line on
No1e !hat web,based or olher software a projected map. Tl1e distortion is impercep-
may report incorrect distances. because they 1ible for large scale maps and over short dis.
may use a spheroidal approximation. use dif- tances. but cxis1s for mos1 1,nes.
ferent Earth radii, or may use compu1ation- Figure 3.31 illustra1es straigh1 line dis·
ally efficiem b\lt more approximate fonnulas 1onion. This figure shows tbe shonest dis·
or algorithms.. so it is best 10 calculate the tance path (the-great circle) beiween Seattle.
values from tl1e original fornmlas 10 ensure USA. and Paris, France. Paris lies almost
accurate resi1ats. due east ofSeanle. bm lhe sbortes1 path
1racesa route nonh of rut eas:1-we.st line. This
shortest path is dislorted and appears curved
. ..,,
(
(
or
Fia:utt J.Jt: Curved representations , i:rn.igbt liucs are a nunifcsmion of projc:1:tioo distonion. A great
circle ptuh. s.bown 31:bo\·c, is tbe shor1cn totUc when Jlying Ctoin P;1tii to Sc-im k. 11:od comsnonly nppcan
dis-lortcd whco displnycd.
by the Plate Carree projection commonly (plane). The orientation of the developable
used for global maps. surface may also change among projections:
Projectioos may also substantially dis- for example. the axis of a cylind:tr may coin-
ton the shape and area of polygons. Figure cide with the poles (equatorial), 1he a.xis ,.nay
3.32 shows various projections for Green- pass through the Equator (transverse). or be
land. from ao approximately ''\utprojected" at an angle (oblique).
view from space through geographic coordi· Note thal while the most common map
nates cas1on a plane. 10 Mercator and trans- projections used for spatial data are based on
verse Merca1or projections. Note ~,e a developable surface. many map projec-
changes in size and shape of the polygon tions are not. Projections with names such as
depicting Greenlamd. pseudocylindrical. MoUweide. s.inusoidal
Most map projections are based on a and Goode homolosine are exa,nples. These
dea·elopnbles ,oface. a geometric shape onlo projeclions often specify a direct mathemali-
which the Ea.nb's surface is projected. cal projection from an ellipsoid <>nto a Oat
Cones. cylinders, and planes are the roost surface. They use mathematical forms not
common developable surfaces. A plane is related to cones. cyliuders, planes, or other
already Oa1. and cones and cylinders may be three-dimensional figures. and may change
mathematically "cuf' and "unrolled" 10 the projection surface for dilferem parts of
develop a flat surface (Figure 3-33), Projec- the globe. but generally are usecl only for
tions may~ characterized according 10 the display, and not for spatial analysis, because
shape of the deve lopable surface. as conic tl1e coordinate systems are uot stric1ly Carte·
(cone). cyliudrica/ (cylinder). and a:i111111ha/ sian.
Chapter 3: Geodesy. Projections, and Coordinate Systems 121
b)
,.. ""' ""' .,.,, ..,.., ,... ..... ,..
..... ....
-\ ..
,..,.
..... -.....
. .....
. ~
:-_, r;j,df ...,.
""' ""' .....
.. --J- --; >
l . • . ,::. ..,.., ,... """
d)
- - - - l:'wlrlCl'I
fiaurt J..Jl: M..ip projci:tio1u: C""o di.s1on 1he shape and .area of rcatures. M ill11s1rntcd " i•h 1bese v.iriou,
projoctioos o:f'Grcentand. U'Oln •) approximatcl)' uoproj«tcd. b) j<:O~p1i.ic coordinate,; on a plane. c) a
i'.\kre:Hor projection. ,md d) ll tr.t1U1\'c:rtc Mertlltotproj «tion.
projection
surfoce
\..._ developoble)
Cylindrical s1.;rfoc.e Conic
Flautt 3,..33: Projcc::tion , urfacct are dcti,"C'd from cur,:cd ·•<1c,•d op11blc" '1tt·£,cn th:u l)U.Y be 1natbnnati·
call)' '\anroll cd'" 10 a flllt , urfocc.
122 GIS Fundamentals
-----
.,
- --
r--+--1-
-- -;_~---1--.--
ssole-error -f%
standard poralleJ.
line oftrue-scale
Mop from
-devtlopeG
,yr1t1der
'
·~
,.
I
..
N
•
\
\
Fla:utt J...35: Tntns\"Cl'$C Mcrcat01' (TM) projcctioo (top). and an i.Uus:trntiou ohbc scale distortion associ·
atcd with the projc,ction (bonom). 11ic TM (>tOjc:ction dis:1or111 dittllUCcfl in an ctiM\'cit din:ction. but b/1$
rdali\'cl)' 1in!c distortion i11 n nqr1h-'°uth d1rcc1ion, 111.is TM intmect• tbc spbn-c along 1wo Ji n«, ;tud dis,.
tortiou ium:aset; wi th distance &ow ll~sc liuc, (bottom. $d3ptcd from Snyder. 19$7),
Chapter 3: Geodesy. Projections, and Coordinate Systems 125
The State Plane Coordinate Sys- and photogranunetric mappfog, and the
tem zones and SPCSs are often adop1ed for G!S.
The Siate Plane Coordina,e Sys1em One State Plane projection zone may
(SPCS) is a standard se, of projeclions for suffice for small staies. Larger siates com·
the United St~tes. The SPCS specifies posi. monly require several zones. each with a dif·
1ions in Cartesian coordina,e sys1e11>s for ferent projeclion. for eac.h of severa1
each state. There are one or more zones in geographic 2011es of 1he stat.e. For exan1ple,
most states, wi~1 slightly different projec.tion Delaware ha-s one State Plane coordina1e
parrune1ers in each Srn1e Plane zone (Figure zane, while California bas six, and Alaska
3-36). Muhip.le Slate PL1ne zones are used 10 has IO Slate Plane coordinate zones, each
l!mil distortion errors due to map projev corresponding to a different projection
HOUS. ,,~thin 1.he state. Zones are added to a state ro
ensure acceptable projection distortion
State Plane systems ease surveying. within all zones (Figure 3-3 7. left). Zone
mapping. and spatial data developmen, in a boundaries are defined by county. parisll. or
GlS. pariicularly when whole coumies or other municipal boundarie.s. For example.
larger areas are covered. The State Plane tl1e Minnesota souih/cemral zone boundary
system provides a couuuon coordinate refer- nios approximately eas1--west through the
ence fof horizontal coordimues over comny stale along defined counly bow1daries (Fig-
t~ mult1-coun1y areas while limitiog distor- ure 3,37. left).
tton error to specified maximum values.
Mos, states have adopted zones such 1hat Most State Plane coordinate systems are
projection distortions are kepi below one based on one of 1wo 1ypes of map projec-
part in 10,000. Some s1a1es allow larger dis· tions: the Lambert confonual conic or the
tortions (e.g.. Montana. Nebraska) for 1he transverse Mercator projections. Because
sake of having o,~y one State Plane zone. distortion ill a ttansvffle Mertaror rncreases
SPCSs are us,d in many types of work, with distance from the cemrnl meridirui. 1his
including property surveys. property subdi· projection type is most often used with states
, isions. large -scale construction projects.
1 or zones Iba! have a long north-south axis
(e.g.. Illinois or New Hampshire). Con-
versely. a Lambert conformal conic projec-
North zone
First
stondord
pomllel
Second
standard
parallel
South zon~
Fl Jure 3-37: The Suite Pla11e zoucs of Minnesota.. and dc1ails o(tbc. stMdatd pll.r:'LIJel plllcoru:m fottbc
M mucsot::i cmti;2l Stale Pbnc ionc.
lion is most often used when 1he long axis of been defined. The Stale Plane system was
a state or zone is Du the east.west direction first introduced with NAD27, and modified
(examples are Nc>rih Carolina and Vitginia). for NAD83 s11ch 1ha1 for many states. the
Standard parallels for the Lambert con- coordinates are different for any point. Tbere
fomial conic proj«tioo. described earlier. will be a new Slate Plane system ,,~th the
are specified for each State PL1ne zone. introduction of the NATRF2022 .
These parallels are placed at one-sixth of the Conversion among State Plane projec·
zone width from che nonb and south limits tions may be further confused by the various
of 1he zone (Figure 3-37. right). A zone cen- definitions nsed lo translate from feel lo
tral meridian is specified at a longitude near meters. Geodesy measurements are based on
the zone center. This central meridian points the metric system, while for a lo11g period of
at grid north: however. aUother merid•1ns time. survey measuremeurs in 1he United
converge 10 this central meridian, so they do States used a non-standard conversion of one
not point to grid north. The Lambert confor, meter equal to exactly 39.97 inches. This
ma! conic is nsed for State Plane zones for yields a conversion for a U.S. sun·ey/001 of:
3 t states.
As noted earlier. 1be transverse Mecca· l foot • 0.30-'8006096012 meters
tor specifies a central meridiau. This central
me.ridian defines gridnorth in Lhe projection.
A line along the central meridian points to The rest of the world adopted an imer.
geographic and grid nonh. and specifies the natioua/ foot of:
Cartesian grid direction for the map projec-
tion. All parallels ofla1imde and all meridi·
ans except the central meridian are curved I foot • 0.3048 metel"S:
for a transwrse Mercator projection, and
hence these lines do not parallel the grid x or
y directions. The 1ransverse Mercator is 11sed The U.S. survey foot is officially aoon-
for 22 State Plane systems (the sum of states doned as of December 31. 2022, but many
is greater than 50 because both the trans, older data sets were created 11su,g 1he U.S.
verse Mercator and Lambert confonnal survey foot. The slightly longer me1ric-to·
conic are used in :some slates. e.g .. Florida). fool conversion factorwas often the default
Finally. note lhat more titan one version in many jurisdictions for S1a1e Plane coordi·
nate systems within the Uoited States. and is
of lhe Stale Plane coordinate systemhas
usually specified as sFT in me1ada1a. Users
Chapter 3: Geodesy. Projections, and Coordinate Systems 127
should verify the units for any State Plane Dislances in the UTM s.ystem are speci.
data set. Jl"rtlcularly those that originated tied in meters north and east of a zone origin
before 2023. (Figure 3-39). They values are known as
unw 11or1M11gs. and increase in a northerly
direction. The x values are referred to as
Universal Transverse Mercator UT}.1 eas1i11gs and increase in an easterly
Coord lnat:e System direction.
The Universal Transverse Mercator The origins of the UTM coordinate sys-
(UTM) coordin.1te sy>tem is another Stan, tem are defined differently depending on
dard. distinct from the State Plane system. whether l11ezone is north or south of the
The UTM is a global coordinate system. Equator. In either case. the IIJTM coordinate
based on the trnns\le.rse Mercator projection. system is defined so that all coordiuates are
It is widely used in the United States and positive within ~1e zone. Zone easting coor·
other pans of North America. and is also dinates are all greater than zero because the
used in manr other countries. ceocral meridian for each zone is assigned an
The UTM system di\ides the Earth into easting value of 500.000 meters. This effec-
zones that are 6 degrees wide in longitude tively places the origin (E = OJ at a poim
and extend from SO degrees south latitude to 500.000 meters west of the central meridiatt
84 degrees nonh latinide. UTM zones are All zones are less tban 1.000.000 meters
numbered from I to 60 in an easterly direc. wide. ensuring 1hat all easti11gs will be posi-
tion. starting at longitude ISO degrees West tive.
(Figure 3-38). Zones are further split north The Equator is used as the northing ori-
and south of the Equator. Therefore. the zone gin for all north zones (Figure 3-39. left).
co111aining most of England is identified as Thus, the Equator is assigned a northing
UTM Zone 30 North. while tile zones con- value of 2ero for uortli 2oues. This avoids
taining mos1 of New Zealand are designated negMive coordinates. because all of the
UTM Zones 59 South and 60 South. Direc- UTM north zones are defined to be north of
tional designations are here abbreviated. for the Equa1or.
example. 30N in place of 30 North.
'
JS SS
fia11~ J--JS; \ff?,.-( ionc bo1.md..irics nnd zone d«igm1or1. Zoo« orc $1:< degrees wide aud rnun\,,crcd from I
10 60 &om 1hc tn1cruacional Jn1c Linc. JSO°\V. Zoo" 11te alw identified by 1bcir position n.ortb and soulh of
the Equ:uot, c.3--, 201,c 7 North. Zone 16 South.
128 GIS Fundamentals
1:}r~ bot..r,d~r In
.,w,2<1'- - I •
Qf:(I
wu( --1-1--.i t}
.:one t:ounclories er.
- ·H - E126
Qf'l9ln ~.~+-~ :i•·d
- tui'
-
N O C1 t/1,c' l,"q\AtQr CC'l'!fn): m,·,itli,JM
E • 0 ol 5C0.000 ffl<'(r,, Yn:,1 .:1 El29" lt'lrl'.
of tr.-: Cf'1'1trol trrer,t_!,on ,s I/.,,,!;)(
ri&uff' J-3$1; 1.JT'M zoue l 1N. lllld .SlS. P~r,,ruclcl"5 .are t.imila.r, \\;lh tbc notable di.ffct'C'occ Qf 1t false
oorthiog, iu 1bc s.oulh zooc:s 10 iuain1ain all coordi.nate1 at posi1i\·c uumbc:1"5.
Universal Transverse Merca1or zones values at tbe Equator are set to equal
south of the EquaJor are slightly different I0.000.000 meters. Because the distance
than those uonh of the Equator (Figt1re 3-39. from the Equator 10 the most southerly point
right). South zones have a false 11w1M11g in a UTM south zone is less than 10.000.000
value added 10 ensure all coordinates within meters. this assures that all nonhing coordi·
a zone are positive. UTM coordinate values nate values will be posi1ive witb.iu each
increase as one 1u,oves from south 10 north in UTM south zone (Figure 3-39).
a projection area. Jf the origin were placed al The UTM coordinate system is common
the Equator with 3 value of zero for south for data and study areas spanning large
zone coordinate systems. then all the nonh- regions. for example, several State Plane
ing values would be negative. An offset is zones. Many data from U.S. federal govern-
applied by assign,ng a lillse northing, a non- ment sources are in a UTM coordinate sys-
zero value. to au <lligin or other appropriate tem because many agencies manage large
location. For UTM south zones, the nonhing areas. Many state government agencies in
Chapter 3: Geodesy. Projections, and Coordinate Systems 129
the United St~tes distribute data in UTM these two zones, and the sta·te of Georgia
coordinate systems because the entire state straddles zoues 16 and 17. If a unifonn.
fits predominantly or eotirely into one UTM statewide coordinate system is required, the
zone. choice of zone is not clear. and either one or
the other of these zones must be used. or
As note<! llefore. all dMa for an analysis some compromise projectio.n must be cho·
area must be in the same coordinate system
if they are to be analyzed together. If nOI. the sen. For example. statewide analyses in
data will 001 co-occur as they should. 11,e Georgia and in Wisconsin are often con·
large \\~dth of the UTM zones acconuno- ducted using UTM-like syst,ems that invol\'e
dates many large-area analyses, and many moving the cenlral meridian to near the cen-
s1a1es. national forests. or multi-county ter of each state.
agencies have adopted the dominant UTM
coordinate system as a standard. States 1ha1 Other Common Projections
fall predominantly or entirely within a zone
often adopt a UTM zone for much statewide l11ere are map projections that are often
data. e.g .. Utah and UTM zone 12 (Figure J. used for ,isualization but rarely for spatial
40). analysis. These maps are often global or
eover very large areas. and may split.
We must note that the l.ITM coordinate severely reshape. or bide pans of the glolle.
system is not always compatible with Directions. distances. and areas are typically
regional analyses. Because coordinate val- not measured or computed in these projec~
ues are disconlinuous across UTM zone tions. as distortions are roo great. Common
boundaries. analyses are difficult across global projections include v.ariams of 1he
these boundaries. UTM zone 15 is a different Mercator. Goode, Mollweide, and Miller
coordinate system than UTM zone 16. The projections. There is a trade-off that must be
stare of Wistoasiil app!6xiillarely sttaddles
12 13 14 15 16 17 18
Flam"f' 3,-IO: UTM zonc-s for the lower 48 <:on1iauowi sta.tcs o(dtc Uniicd St.atC'S of Alne:rica. &ch UTIA.
zone is 6 dc*1"tet wide. All zo11ts i.u the No,1.bcrl, Hemisphere tltt l'lorth zon es. e.g., Zooc 10 No,th.
130 GIS Fundamentals
P'OJected projected
coordinates. COOrdtnote:s.
e.g .• e9
UTM WCS8-<(¢116~) lowo Stote Plone
Zone 15 No<th. WCS84(G!762)
geogrophte
coorcf111tOtes.
WCS8t.(G1762) _ _ _ , , / proJeetion. on
bock projec~ exact conversion
on exact cor"iVerslon
proJecred pl'o,ected
coord1note:s. Goordlnotes.
eg .. dotum tronsformotlon. •9
UTM WGS84(Gt762) e.g. vlo HTDP lowo S tote Pfone
Zol'le !S N()t'!h, NA033(201l)
?.£on.~
on exact
conversion
geographic
cool"dinotes.
WGS84(Gl762)
9eo9raphic:
coordrnotes.
NA083(2011)
J p<OJ~ChOO,
on exact
conversion
Fia;ntt 3-42: \Ve may proj«t bctw~rn most coordinate syst..-mi ,·UI ~ back (or iuvmc) and forward pro·
jcction cqwuiont. Thcs.c eakullltc c!'<ac:1 ~OjJ"l'lpbie c001di.na1cs from projected ooordii.u,~ (o). and d1e:l.'l
new projected coordi,1.tle$ from the geopphie coordii1J1ta. We nnut inkr1 JU1c:x1r., '-'ce wlm1.1 projection
c:ooversiou ioc:ludc, ll dtltmn cba'.lge:, A datum m11,fomlatio11 mu,1 be ,1sed toe:w,-cr1 troln one geodetic
danun to *-DOChcr (b),
Coordinat e System Identifiers this site as the source for their projection
definitions.
Most commonly used. established pro-
jections have been assigned standard identi- At about the same time. the National
fiers. The EPSG IDs are most comn101t Geographic Instinne of France. lGNF. devel-
assigned and maintained by tile E111upea11 oped a set of standard coordin.'1e systems
PetrQ/eum Su,,·ey, Group (EPSG). Petroleum and identifiers for projections, many not
engineers were among the first that needed u~ or specified by the EPSG •t that time.
to sbare completely defined. unambiguous. These are sometimes specified with dat.1,
defini1ions of coordinate systems across 1he although the EPSG are more commonly used
globe, with all projeclion parameters and when both specifications exist. Finally.
datum in standard fonnats. They have sin<:e ESRl. a large GIS software company. has
joined the lnt~mational Association of Oil defined codes for commonly-used projev
and Gas Producers. with a geomatics com- tions that are not specified either in the
mittee that m.aintaius and publishes coordi- EPSG or IGNF.
nate system standards. along with ru.1 onli.ne Coordinate system identifiers are often
registry. curren!ly al https://epsg.orgl displayed when selecting coordinate systems
home.html. Software vendors typically use in software. or on the main map sc.reen, and
132 GIS Fundamentals
-
ners were far from I.heir intended location.
NoCRS(or~pofoctSo,t)
Survey errors were inevitable giwn 1he large
areas and number of different survey parties
.........
" *-tly Uffd COOtdin1te flltifflll'ICI sys.--. involved. Rather u,an iu,~te end less dispute
and readjustment the PLSS specifies tha1
NAOAIVfM10,-. USN EPSQ."'ltil4 boondaries established by !he •1>pointed
Wortlf.>)1otl 13AJ$<100
USA.,.~QPMU#- ESRS:IOZOO'l
PLSS surveyors are unchangeable. and tha1
NAO ..t98.J..101LVT"..,~ISK ESltl:102334 town.ship and section corners must be
WOSSA/PIWOO M•l'CtlOI' EPSG:3167 ac.cepted as true. The typical section con-
Fltui~ 3~3: E:,ca:anplc o( sofm'l'U'C: Jjs1i11a coor- 1ains approxi,ruuely 640 acres, but due in
di1121c $}'Items. $p«-ifying thci.rEPSO, IONF, or pan 10 errors in surveying, sec1ions larger
ESRI ldi:111:ifien.. than 1200 acres and smaller than 20 acres
were also established (Figure 3-44).
in documemation describing data se1s (Fig- The PLSS is a slandardi.zed method for
ure 3-43). The EPSG or other identifiers are designa1ing and describing the location of
oflen ustd as a sltonhand descrip1ion of1he land parcels. meant 10 address 1be shortCOlll·
full projection. and reference 10 them may be ings of meres (Ind bounds surveying. the
confusing if one is unaware of the system of most common prior method. Me.1es and
standard identifie:rs. boouds describe a parcel rela1ive 10 fean,res
on the landscape. some1imes supplememed
with angle or distance measurements. Under
The Public Land Survey System meies and botwds a parcel description was
Fortbe bendit ofGIS practi1ioners in often ambiguous. PLSS replaced me1es and
the United S1a1es. we must cover one final bounds in !he early 1800s, used for nearly all
land designation system,. known as the Pub- land ou1side lhe original thineen colonies.
lic Land Sw,·~ · S;<1e111. or PLSS. The PLSS An approximately unifom, grid system was
is not a coordinate system. bu1 PLSS poims established across the landscape , with peri-
are often used as reference points in the odic adjustments incorporated to accoun1 for
Uni1ed S1a1es, so tlie PLSS shoo Id be well 1he an1icipa1ed error. Parcels we.re desig-
understood for work lhere. nated by their location wi1hin Ibis grid sys-
tem.
The PLSS di"ided lands by nonb-south
lines. 6 miles apart. mnning parallel to a The PLSS is i!llportant 1oday for several
principal meridia1J.. East-west lines were sur- reasons. f irst since PLSS lines .are often
veyed perpendicular to lhese nonb-south properiy botwdaries. they fonn namraJ corri·
lines. also at six lllile imervals. These lines dors in which to place roads. powerLines. and
form square townships. Each rownship was other public services; they are often evident
further subdh~ded iuto 36 sections, eac.h on the landscape (Figure 3-45). Many rood
section approximately a mile on a side. Each intersections occur at PLSS comer points.
section was subdi.vided further. to quaner- and these can be ,~ewed and rererenced on
sections (one.hair mile on a side), or six- many maps or imagery ustd for GIS data·
teenth sec1ions (one-quar1er mile on a side). base developmem efforis. Thus. 11te PLSS
Sections were numbered ia a zigzag pattern often fonns a convenient system to coreg:is-
from one 10 36. beginning in the nonhe.ast 1er GIS data layers. PLSS comers and lines
corner (Figure 3-44. squares numbered I to are often ploned on go,·emment 111.1ps (e.g..
36 wi1hin 1he 1hicker-boundary square). I:24,000 quads) or available as digital d.1ta.
Becaust 1he priu-,ry purpost of the
PLSS survey was to identify parcels. lines
and comer locations were considered static
Chapter 3: Geodesy. Projections, and Coordinate Systems 133
"
,.
28
..
27 25
•• ., ••
7
17
..
l3 311
• • 22 23 •• 21 n ,.
.. ..
20 ~
t ~ ) 2 I
,, ,. " ,. ,.
''
• • ,o
.. ..
7 21 25 ,0 2128 28
"
.. ,, ,. ,. ,. ,.
,," ' "' • • • ' ' ' • "
•• " 10
•• 33
"
•• ..
20 22 23
3 2
•
,0 ,.
20 r,
,. ,.
.. ,. • •• -" II
••
• ' •o
••
"
7 11
,.
7
" "
• 5
23
• 3 2
38
• "
23
" ,. ,. ,. " "
20 21 23 ..
28 27
"' 2S
"',. ,. 28
,,"'
27
7
• • 10
" "'
•• " •• .. .." "
33
" 35
"' 31 32 33
.. 20 21 22 23
3• •
,,
2
' '
,,
5 2 I
•
,. ,." • •
10 II 7 10 II 1
• •• ..
~·
,0 20
- ,. 33
27
.. "
35 38
•• " •• " " "
>3
••
Flaurt 3-44: Exampk of ,Wtiou in the size and shape of PLSS s«tious. Most s.cctious are tepro.'(i·
ouudy one mile squ.a~ wi.1h stttiou liuc-.s p:i.mlld or papc:ndicul~ 10 the ptiuwy ruc.ridi nn, as illustrated
b>• d1.e fOV.'Ut;.hip in lhc tapper left of LIU.S figure. How·c ,·cr, ndju.urncnts due 10 diffctttl.1 primnzy 11'teridi:1.ns,
d1£fc:ren1 s.111"\-C)' partict.. and CITOl"5 resuh m irre:gulnr Kctioo sizes ~od shapes..
I I I I I I ·I l I J
•
- . ~
Flentt 3,..45: PLSS lirics ntc often ,·i.tibk on the fa.ndicape. Roads (li$1u lines on 1bc inugc. :ibo,·c left)
of!cn follow tbc 1cc1ioi1 nucl 1ow11sl1ip lines (above right).
134 GIS Fundamentals
s2ols21
$2'9'1'628
Suggested Reading
Bossler. J,D, (:?002), Darnms and gelX!eti~ system~ ln J, ll9ssl« (Ed,). Manual 9[
Geosparinl TeclmologJ: London: Taylor and Francis.
Brandenburger.A.J.. Gosh. S.K. (1985). The world's topographic and cadasll1ll map-
ping operations. Photogmmmelric E,,gi11een'11g (Ind Remote Sensing. 5 J:437-444.
De,mis. M.L. (2018). The State Plane Coordinate System: History. Policy. and Flnure
Directions. NOAA Special Publication NOS NGS 13. National Geodetic S11rvcy.
Washington D.C.
Doyle. F.J. (1997). Map conversion and the UTM Grid. Photogrommenfc £11gi11eer-
i11g and Remote S,,11si11g. 63:367-370.
Elitborpe, J.A.Jr., Findorff, 0.0. (2009). Geodesy for Geommi,s a11d GIS Pr,ofessio11-
ols. Acron: Copley Custom Textbooks.
Featherstone. \V.E.• Kulm. M. (2006). Height systems and ,•enical datums: a, review
in lhe Australian context Journal o/Sparinl Scfe.uce. 5 I:21-4 1.
Flacke, W.. Kraus, B. (2005). Worl.ing 11;tlt Projections and Do111111 Tro11sfon11atio11s
in ArcGIS: Theory 011d Pmctical Examples. Norden: Points Verlag.
Iliffe. J.C .. Lon. R. (2008). Damms m,d Map Projections for Remote Se11si11g, GIS,
a11d S11n·eyillg. 2nd ed. Boca Raton: CRC Press.
Maling. D.H. ( 1992). Co<>rdi11nte S)>tems n11d Mnp Projectio11s. London: George
Phillip.
Meyer. T. H.. Roman. D.H.. Zilkoski. D.B. (2006). What does height really mean?
Pan Ill: Height systems. S111,•eyi11g a11d La11d lllfon1101io11 Systems, 66: 149-160.
NOAA Manual NOS NGS 5. State Plane Coordinate System of 1983. Imp://
" ~vw.ngs.noaa.gov!PtJBS_LJB/MannalNOSNGS5.pdf
Schuh. H.. Behrend. B. (20 12). VI.BI: A fascinating technique for geodesy and
astr0nomy. Jo11nwl ofGeody110111ics. 61:68·80.
Schwanz. C.R. (1989). Norri, Amerlcm, 00111111 of /983, NOAA Profess.i o11nl Paper
NOS 1. Rockville: Naiional Geodetic Survey.
Smith. D.S .. Roman. 0 .. Hilla. S. (2017). NOAA Tee/mica/ Repo11 NOS NG62.
National Oceanic and Atmospheric Administration.
Smith. J. (1991). /111rod11crio11 10 Geodesy: The His to,, , 011d Co11ceprs ofModem
Geodesy. New York: Wiley.
Snay. R.A., Soler, T. ( 1999). Modern terrestrial reference systems. part I. Profes-
sio110/ Sw,·eyor, 19:32,33.
Snay. R.A .. Soler. T. (2000). Modern terrestrial reference systems. part 2. TI1e evolu-
tion ofNADS3. Professio11nl S11n·eyor. 20: 16-18.
Snay. RA., Soler. T. (2000). Modero terrestrial reference systems. part 3. WGS84
and ITRS. Professiorwl S111w,y or. 20:24-28.
Snay, R.A .. Soler. T. (2000). Modern terrestrial reference systems. part 4. Practical
considerations for accurate positioning. Professfoual Sur,·eyor. 20:32·34.
Snyder. J.P. (1993). Flatre11i11g the £arth: Ji,.o Tho11sa11d l~rs ofMnp Projectio11s.
Chicago: University of Chicago Press.
138 GIS Fundamentals
Snyder. J.P. (1987). Mnp Projec1io11s, A Wo,-ki11g M01111nl, USGS Professio11a/ Pnper
No. JJ96. \Vashing1on D.C.: United Stales Government Printing Office.
Snyder. J.P.. Voxland. P.M. (1989). A11 Album oJMnp Projec1io11s, USGS Professio11nl
Pnper No. 145.1. Washington O.C.: United States Government Printing Office.
Soler, T.. Snay, R.A.(2004). Transforming positions and velocities between the I111er-
national Te:rrestrial Reference Frame of 2000 and the North American Datum of
t983. Joum nl ofSw,•eyiug E11gi11eeri11g. 130:49-55.
U.S. Coast and Geodetic Survey Special Publication 235. The Sta1e Coordinate Sys-
tems. hnp://www.ngs.ooaa.goviPUBS_LIBlpublication235.pdf
Van Sickle. J. (2010). Basic GJS Coordi11a1es. 2ud Editio11. Boca Raton: CRC Press.
Vanicek. P.. St.eeves. R.H. (1996). Transform•tio11 of coordinates be1ween t1vo hori-
zontal geode1ic datums. Jo11111n/ ofGeodesy. 70:740-745.
Wolf. P.R.. Gh.ilani. C.D. (2002). £/e111e111my S11n-eyi11g. 10th ed. Upper Sad.die
River: Prentice-Hall.
Yang. Q.. Snyder. J.P.. Tobler. W.R. (2000). Map Projectio11 Tra11sfor111atio11: P1i11ci-
ples a11d Applfcntlous. London: Taylor & Francis.
Zenk. D. (2014). Correct use ofNAD83 realizations and geoid model. Mimuso1a
SUM'e}'OY, 22:16·18.
Zilkoski. 0., Richards. J.• Young. G. (1992). Results of the general adjustme nt of the
Nonh Ame.rican Vertical Datum of J988. Sun·eyiug aud laud Juformariou S)1s·
tems, 53: 133-149.
Chapter 3: Geodesy. Projections, and Coordinate Systems 139
Study Questions
Canopus
1) measure
d.0
2) radius • i0
Center of 3) circumfer ence
the Earth = 2 Jt r adius
140 GIS Fundamentals
3.3 - Assume the Earth is approximately a sphere (not an ellipsoid). Also assume
you've repeated the measurements of Posidonius. What is your estimate of the radius
of the Earth's sphere given the following distance/angle pairs. Note that the distances
are given in meters, and angle in degrees. and calculators or spreadsheets may require
you emer angles in radians for trigonometric functions ( I radian a 57.2957795
degrees):
a) angle - 2° 59' 31.33325". distance • 332.000 meters
b) angJe = 9° 12' 12.77201", distance = 1.020.708 meters
c)angle = 1° 2' 12. 15566".disiance = IIS.200meters
3.4 - What is an ellipsoid? How does an ellipse differ from a sphere? What is the
equation for the Oanening fac10J1
3.5 - Provide three reasons why 1.here have been various estimates for Earth"s ellip-
soid radii
3.6 - Define tlie geoid. Tell how it differs from the ellipsoid, and from the surface of
the Eanh. Describe how we measure the position of the geoid.
3.8 - How does magnetic north differ from the geographic North Pole?
3.10 - Why are there multiple datums. even for tl1e same place on Earth? Define what
we mean ,'1!en we say there is a datum shift.
3.13 - What is the difference berween an orthometric height and a dynamic height?
Chapter 3: Geodesy. Projections, and Coordinate Systems 141
3.14 - Use the NCAT software available from the U.S. NOAAINGS website (https://
" 'vw.ugs.noaa.gov/NCAT/) to fill the following table. Note that all of tilese points
are i.n the continental United States (CO NUS) and longimdes are west. but entered as
positive numbers.
NAD27 NAD83(86) HPGN
''" S101c 11;.l!tuck ............ l.,t,t116t 111•,tuct, lon<Jl111dit
""''""
I Cd,I (S) 32''..1~· Uf'~A,C 3l'A.c1~1szr U7"0H!>l20l' 32•....·151110· 111'09',51201'
3.J S . Use the NCAT software available from the U.S. NOAAINGS we bsite (https://
www.ngs.noaa.gov/NCAT/) ro fi.U the following table. Note that all of til1ese poinls
are in the continental U1tlted States (CONUS). and longintdes are west, but entered as
positive numbers.
NAD27 NAD83(86) HPGN
Po• s,o,, lll••t# ""9- tm,1'41' ...,.... 1q· ,1.,q, '>')<'...
2 V~tOl'I A1°1r!5' ll'zP18'06" ,U"lT:iA-3571' 11.t'llnO " " ' 4f?r54.Je'l' 1221110,M·
3.16 - Use the World Wide Web version or download and Slart the HTDP software
from the U.S. iNOAAINGS site (at the time of this writing. Imps://
"~vw,igs.noaa.govfl'OOLS/Htdp!Htdp.shtml). and complete the following table. Use
the tooJ for a horizontal displacement between two dates. Enterepoch start and stop
dates ofJauuary I. 1986 and January I. 2015. respectively. Specify a zero height or z
for your danun 1ransfom1a1io11. Use the spherical Earth approximation fonnulas
described in Chapter 2 when calculating the surface shift distance, in centimeters
(cm). assuming a radius of 6,371 kilometers. Report the ground shift from 1986 10 the
2015 time period.
3.17 - Use the World Wide Web version or dow1tload and start the HTDP software
from the U.S. NOAAINGS site (at the time of this writing. https://
""~v.ogs.noaa.gov/fOOLS/Htdp/Htdp.shtml). and complete the following table. Use
the tool for a horizontal displacement between two dates. Eoter epoch start and stop
dates of January I, J986 aod January I. 20 IS. respectively. Specify a zero heigbt or z
for your datum transfomiation. Use the spherical Earth approximalion fomtulas
described iu Cl1ap1er 2 wbeu calculatulg the surface shift distance. in cemimeters
(cm). assuming a radius of 6.371 kilometers. Report the ground disiance between
points from tlte 1986 to tlte 2015 time period.
6 24°33'30" 8t°'.t!H9'
8 ctt51~998-34 WOO'S99970"
Chapter 3: Geodesy. Projections, and Coordinate Systems 143
3.J8 - Use the VDatum software (available at the time of this writing at https://vda-
nuu.noaa.gov/) to complete lhe table. Note that all lo11gin1des are west. entered as
negarive numbers.
3.19 - Use the VDat-um software (available at the time of this writing at bttps://vda-
tum.noaa.gov/) to complete lhe table. Note that lougirudes are west. entered as uega·
tive numbers.
....
NAD27
........ NAD83( 2011)
--
s ..,...
"""" t~ttof'I (m) la•tMc ~)11\Klc -~IO'I (m)
3.20 - Use the VOatum software to calculate the orthometric height change. in centi-
meitrs, forlhe listed NADS3(201 I) geographic coordinates. when switc hing from tl1e
NAVD(geoidl2A) as the source. 10 geoids for 2009. 1999. and 1996 respectively..
.... SlC1c
""""'°" ...........
OCOiO'llA
de...aton (m)
,.,.
~
........ · ((lfl). ~ t (cm).
lo 9'0099
~l(cni)
lo Qt0d9t>
I
"'' ll',ooo· .u1"cow 0
•• ••
48~30'00" .122W·o;,· ,co
--
'NcsfW'IQ10tl
'
s 41°0000• ·69•oo·oo· 200
7 2s'Of!OO· ·81.)(>00' I
144 GIS Fundamentals
3.21 - Use the VOatum software to calculate the orthometric height change. in centi·
meters. for ct,e listed NAD83(201 l) geographic coordinates. when switching from the
NAVD(geoidl 2A) as 1he so1irce. to geoids 2009. 1999. aod 1996 respectively.
"'" Sl41fl
"""'"' """"""
o)ff;-1l2A
......... ......... ...........
d.11('4'1 (cffl) .1.he1Ql11 (cm). ,0,t,c1·,i.otcm)
.
cbal,on (ml
, ,.,..,..,. .c~"'oo·oo- ·9f00'(!()' 200 ,. 11
••
4 TtllO.t 30•15cv ·91 d'OO' 200
3.23 - a) You wish 10 dredge a cbaunel near Naples. FL. near the NOAA tidal gauge
station 872511 O. You wish to maintain a channel depth of 30 feet below the mean sea
level. What is ·the channel depth elevation expressed as an NAVDSS height, in feet?
(the search function ill the upper left comer of the tides website, described ill the !ides
section of this chapter. will help find the station. Then look for a tides and water lev-
els, datums section)
b) Wbat is !be mean bigh-bigh water mark. expressed ill feel. as a NAVDSS height?
3.24 - What is a developable surface? What are tJ1e most common shapes fora devel-
opable surface?
3.25 - Look up the NGS control sheets for the following points, and record their hor-
izontal and vertical datmus. latitudes. and longitudes:
DOG. Maine. PLO= P006 l 7
Key \Vest GSL. Florida. PID =AA l645
Neah A. Washington. PID • AF8882
3.26 - Look up !lie NGS control sheets for 1he following points. and record their hor-
izontal and vertical datums. latitudes. and loogimdes:
Denver. Colorado. PID = KK 1544
Loma East. CA. PID • AC60?2
Austin CE. Texas. PID =DN7664
Chapter 3: Geodesy. Projections, and Coordinate Systems 145
3.27 - Using the spheroid fonuula given in this chapter>calculate the great circle dis·
1ance 10 1he neares1 kilome1er for 1be control poims in question 3.25 • above from:
. DOG to Neab A
- Key West 10 DOG
- Neab A to Key West
3.28 - Calcula1e 1be grea1 circle dis1auce for u,e control poims in question 3.26 -
above fro.m:
· Denver to Loma East
. Denver to Austi.n CE
• .,.\ us1io CE 10 Loma Eas1
3.29 - Describe the S1a1e Plane coordina1e sys1em. Whal l}'pe of projections are used
in a State Plane coordinate system?
3.30 - Define aud describe u1e Universal Transverse Mcrca1or coordina'le sys1em.
What type of developable surface is used with a UTM projection? What are UTM
zones. wllere is the origin of a zone, and how are negative coordinates 8\IOided?
3.31 - Wbat is a datum trru1Sfom1a1ion? How does it differ from a map projection?
3.32 - Specify which 1ype of map projection you would choose for each country.
assuming you could use onlyone map projection for tile entire country. 11te projection
lines of iutersection would be optimally placed, and you wanted to minimize overall
spalial distance distortion for the counuy. Choose from a transverse Mmcator, a L1.m·
ben conformal conic. or an Azimmbal:
Benin Bhutan
Slovenia Israel
3.33 - Specify which 1}'pC of map projection you would choose for eacb country,
assuming you could use only one map projection for the entire counuy, the projecr'ion
lines of iu1crsec1ion would be op1in1ally placed. and you wanted 10 minimize overaU
spatial distance distortion for the counuy. Choose from a transverse Me1ca1or. a Lam-
bert conformal conic, or an Azimuthal:
C11ile Nepal
Kyrgyzstan The Gau1bia
3.34 - Describe 1be Public Land Survey System. Is i1 a coordinate sys1em? What is its
main purpose?
146 GIS Fundamentals
147
........
Fiaurc "'-1: ~1;ip11 lui\'c 5c:r\'Cd to store gcognaphic: knowJcdgc f~ 111luS! tbc pMI 4,000 )'C.llr$.. TI1is c:ui)' m~p
of oor1bcm Europe .bows 3pprox:imatc sbnpcs aod rclnlJ\·c l~.aborui.
148 GIS Fundamentals
title t-J:::::=-~==:::-;-::::;:-:;:;:::";::-=:=:=-;--;-;---,
Recreotionol Londs of t he Contiguous U.S
- '-
doto pone
D Por!</Forest
• Lake ~;;;;,--• -+l- scale bar
- Stote bousdory
- •
0
·-·-
~, !IIOO
Maps often depict coordinate lines (Fig- and is described in detail in Chapter 5. This
ure 4-4). When the Jines represent constant chapter focuses on human-guided coordinate
latin1de and longitude they are called a gm1i- capnire, primarily from images and maps.
c11le (Figure 4-40). These lines may appear and on map output.
curved. depending on the map projection
and scale. Maps may also depict a grid con·
sis1ing of lines of constant Cartesian coordi· Accuracy vs. Reso lution
na1es. Grid Jines are typically drawn in both We start by contrnsting positional accu-
the X and Ydirections. and appear straight racy and spatial resolution. as these concepts
on most maps (Figure 4-4b). Graticules and are often confused. 10 the detriment of digi-
grids are useful because they pro,ide a refer· tal data development. Accuracy is how close
ence against which location may be quickly a value is 10 the truth-here. how close an X
estimated. or Y value in our data set is to t.he 1n1e value
Aerial and satellite images are often for a feature on the Earth. Fonnal accuracy
included as a base i,n digital maps. blll they as.sess1nent is covered in cnapier l4, bow·
are not maps. although the line is becoming ever the concept is importailt enough to
blurred. Uncorrected images usuaUy provide introduce here. We typically quantify accu-
a distorted. non-Cartesian rendering of the racy as some average or range of errors. e.g..
Eanh's surface unless they are processed to stating 1bat measured average error for Light
create orthoirnages. In addition. images are pole locations was 2.5 metMS. AU data have
not maps because features of interest such as positional etTOr. typicaUy due 10 the way we
cities, rivers. or mountain ranges are not collected or processed the d.ata.
explicitly ide11tilied. However. in1ages are a Spatial re.solution in a data sel or display
rich source of geographic information. and is the smallest feanire 1ha1 is indi,•idually
standard techniques may be used to extraCI resolved. we.often note the resohirion of a
features through manual digitizing. raster data set as the original collected cell
described later in this chapter. or through size. e.g.. a I0-meter resolu1ion satellite
image classification, described in Chapter 6. image. or a 6-inch resohnion aerial photo-
Outdoor digiui2ing. usually with the use of graph. The resolution is often not the same
Global Navigation Sa1eUi1e Systems (GNSS) as the accuracy. Much image data have an
such as the U.S. Global Positioning System average resolution that is smaller than their
(GPS) is a co:mmon source of digital data, average accuracy. Forexample. a drone
.... .
\)
'"
• .....
.... ........ ' .....
.....
\
.....
.,,, .,,.,,
""
\
N
/JI
'
.,...
'"" •
0 b
fiaure 4-4: Maps oAen depict l.ines representing (o) a grotirole ofc-ooslaut llititodc anc:l lo11gitudc or (b) t1
grid o( cow1ai11 X arul Y c-oordina.1cs.
150 GIS Fundamentals
Scale
Resolution All displayed spatial data have a scale. a
relationship between a distance ou the screen
D
I or paper and a corresponding distance on
Earth. Map scale is often reponed as a dis·
tance conversion. such as one itlch to a mile.
meaning one inch on the map equals one
Accuracy ,;;-7 \ mile on Earth. They may also be expressed
or ~ \ as a tmitless ratio. such as 1:24.000. indicat-
ing a unit distance on a display ~s equal to
Erro ~ 24.000 unitS on Earth's surface. Digital maps
most often use a third method to repon scale.
as a bar or line of known distance, labeled on
the map (Figure 4-6), although many map-
ping programs automatically adjust the scale
f'ia:m't 4-5: A <Ulta $:Cl way have .i resolution, hac according to the display zoom.
a 5-cm tt'solution i.imac. a.iMi a lower acC'Utacy.
here: wi1.b posi1iot1s (l(fsc-1 b)' SO e1u (ron, tnit Displayuig a scale as a distance conver-
lit1es. sion orunitless ratio may be in error on static
digital document such as a pdf, jpeg. or png
image may be collec1ed wi1h a 2.5 cm ( I file because the 5(:ale is altered by zoom
inch) resolution. but the underlying GPS and level. This changes the magnifiC>tion on an
image processing may yield average errors electronic display without the. ability to auto·
over J meter - a substantially lower accu- matically recompute scale. Digital docu-
racy than resolution {Figure 4·5) ments should most often depict a graphic
Vector data abo have both accuracies scale bar. embedded in the map.
and resolutions. The accuracy is detennined The notion of large vs. small scale is
by the error associated with point and vertex often confused because scale implies a ratio.
locations. Accuracy is how close the coordi- A larger ratio signifies a large-sC1lle map. so
nate pairs that define fe.atu.res are to the tme a 1:24.000 scale map is considered large-
values. Resolution is a bit Jess clear with scale rela1ive to a l:100.000 sc.a]e m,1p.
vector data. Boundary-<1efu1ing vertexes Many people mistakenly referto at: 100,000
may be any distamce apart. and so there is no scale map as larger scale d1a11 a l :24.000
one resolution for a vector data set. There is scale map because it covers a larger area. II
a distribmion of intervals between adjacent helps to remember that features are larger on
vertices. and the resolution is typically some a large-scale map (Figure 4-6). and that
threshold length. .e.g .. the average distance large-scale maps often show more detail but
between venices. The resolution for veclor less area. Notice in Figure 4-6, the larger-
data may be set by image source materials. scale map at the top shows deta,ls of Tokyo
as ii can't be smaUer than image resolution. city. Tokyo shrinks in the successively
but the operator may further degrade resolu- smaller.sc.ale maps. but largeradditional
tion by sampling ,it larger intervals. Resolu- areas are covered.
tion is also limited by the equipmenl used Because maps often report an average
and operator anentiveness wbile digitizing. scale, and because there are upper limi1s on
the accuracy with which data ca.n be plotted
on a map, large-scale maps generally have
less geometric error than suLSll-scale nLSps.
Small errors in measurement. plotting. and
hardc.opy printing are mag:oified more on a
small-scale map than a large-scale map.
Chapter 4: Maps and Data En~y 151
a) Lorge-Scale Mop
b) Medium-Scale Mop
Asio
....
i.' : 11
Pocifrc oc eon
, ()
Sou!h
Cm>S.o 1500
c) Small-Scale Mop
;
-~ Asio
~ • ·. ·-. . Paci~
.. Ocean
.. . '
.•
lnd1on
• • •
Oeeon . ••
• 3.000 km.
Fi.&ur~ 4-6: M!f> co,·crnge. relative di.&tan«. t.od dctn.il cban.gc from larger sc11,lcs (top) to
smallc-r seal«. {bottom).
152 GIS Fundamentals
Flgtu't' -'· 7: A o,11pnuikcl'chooscs the nuncrial1 i11id methods wed 10 produce a lU:1.J.>, :.lid 10 impose$:.
fonit oll spatinl dc~il. Herc, tbc choice ofan irspul image witb ll 2:50·mctu resolution (left) rcodcn ii
impos~ible to r(¥esent all the details of the real Ja.k,: boundaries (right). in this e~ruplc. fc.atures
s.miillcr than appro:Uruatdy 250 metm on a side 1My not be faithfully rcpresentC'd on the lllap.
Chapter 4: Maps and Data En~y 153
regions. errors of omission may dominate. nal source materials 10 collect data at the
For ex.,mple. the prairie pothole region of required precision.
central North America is dominate<I by
Displaced Omitted
(5o(s
....... o~
. . .0..... ~(s
..
4.Oo,O'
"· ..
"'Oo.'•,
'(/
.. ......... ······,,
···...
rood displace~···.•. smoll polygons exoggeroted rood
from polygons om,tted 'Mdlh due to
stondol'(J symbol
The person digitizing affects the geo- automatically sampled at a fixed time or dis.
metric quality of manually digitized da!O. tance frequency. perhaps 011.ce each meter.
Operators differ in visual acuity. band steadi· The stream sampling ra·te must be speci·
ness. attention to detail. and ability to con. fied with care to avoid O\'er.. or under-sam..
centrate. Wort quality may vary through
pied lines. Too sbori acolie(lion interval
time due 10 fatigue or loss of focus wilh results in redundant points. Too long a col.
repelitive lasts. Oigi1izers should take fre- lec1ion interval umy cause the loss of import·
quent breaks, do quality ruid consistency ant spatial detail. When using time-triggered
checks. and compare to other digitizers to s1ream digitizing, the operator must continu..
ensure accurate, consistent data collec.tion.
ously move the digitizing puck: pausing for
a period longer than the sampling interval
The Manua l Digitizing Process digitizes multiple points clustered together.
This often creates a 11rat's nest•· of Jines that
Manual digitizing involves manually most later be removed.
positioning the puck or cursor over each Ult'·
get point on an image or nmp and collecting ~'ltfinimum dis1m1ce digit·izing avoids
coordinate locations. This position/collect some of I.he problems inherent with time·
step is repeated for every point 10 be cap· sampled streaming. In minimum distance
nired. and aJI features of interest recorded. digitizing. a new point is 001 recorded unless
it is more 1han some minimum threshold dis·
Lines ba'Ve a star1;11g node. lhen a set of 1ance from the previously digitized poim.
veJ'lice, defining the line shape. and an end· The operator may pause witJ1out creating a
ing node (Fig11re 4- 12). Hence. lines may be rat' s nest ofline segments. The threshold
viewed as a series of straight segments con· must be chosen carefully- ueilher too large.
necting vertices and nodes. Polygons are missing useful delai~ nor too smaH. in effect
coonec1eo littes 1ha1 enclose areas. reverting back 10 s1team digitizing.
Digitizing is often in fX>i'11 mode. where
the operator must depress a button or other- Node and Line Snapping
wise signal to the computer 10 sample each
point. or in srnum, mode. where poims are Posi1ioual errors are inevitable when
data are manually digitized. These errors
should be small relative to the intended use
oftbe data: for example. 1be positional errors
may be limited 10 2 meters when only 5·
node""' meter accuracy is required. TI1eseacceptable
errors may still prevent the generation of
correct networks or polygons. For example.
river flow analysis may be in error because
major tributaries do not connect. or polygon
feamres nmy be incomplete because their
boundaries don't close. These smalJ errors
vertex~ mus1 be removed or a\'oided during digitiz ..
iug, Figure 4-13 shows some common digi·
1izing errors.
Undershoots and o,·erslioo1s often occur
when digitizing. Undersho<Yts are nodes that
do nol qujte. reach the line. or another node.
and oversboo1s are lines 1ha't cross over
existing nodes or lines (Fignre 4-13). Under-
Flgutt 4· 12: Node, define th<: starti.11; and c-ndiu;
pomts of lines:; \'Miccs dc611c line sJt11pc.
shoots cause unconnected networks and
unclosed polygons. Overshoots typically do
158 GIS Fundamentals
overshoot
undershoot
b-- l
dong ling pseudonode
( node '-.
POiygon ,,,.._
lober point,,,.,,. ·
m issing
IObel point
11.ne crossing
- no node
FIR,UJ'f' 4-13: Conuuondijilizing ctrOtt.
not cause problems when defining polygons. mnce tllay be interpreted as a tlliuimmn dis-
but they may cause difficulties when analyz- 1ance benveen fe~1tures. Nodes or vertices
ing line ne1works_ c.loser 1han this distance are moved to
Node snapping and li11es11(lppi11g are occupy the same loeation (Figure 4-14).
used to reduce. undershoots and overshoots While digitizing. an existing node or vertex
while digi1i2ing. Snapping is a process of bec.omes "magne.tic.," and pulls a new node
automatically setting nearby poims to have or vertex 10 it within !he snap distance. Node
the same coordinates. Snapping relies on a snapping pre\rents a new node from being
snap rofertmce or snap distance. This dis- placed within the snap distance of an already
existing node. Remetllber that nodes are
ofter snapping:
J
.····. '•
:' ,... •• O'Ytl'ShOol
·......
Fiz urr .a..J4-: Uodc:nboots.. o\'cnhooc.. :iod 1:u11PJ,>in.g. Sn:ipping mny ;otn nq,ck,. or m,,y pln.cc n oock onto
a ncart>y line scguKnt. Snappiog docs not occur lf the 1)00es a11dfor lmes arc separated by WON: t1Wl lbe
snap toleranc-c.
Chapter 4: Maps and Data En~y 159
used to define the ending points of a Jine. By poinlS I through 5. and a separale polyno-
su.,pping 1wo nodes 1og.ether. we ensure a mial func1ion 61 10 poin1s 5 1hrougb I I (Fig-
connec1iou between digitized lines. ure 4- 15). Consirauus force 1bese fi111c1ions
Line snapping, sorue1imes called edge to connect smoothly. usually by requiring
1he rrrs1 and second derh'llli'ves of the fouc-
snapping. ma,y also be spe1:ificd. Line snap-
p,~g mserts a. node at a line crossing and tions robe continuous at 1be Lntersection
c!Jps the end when a suL\ll o,·ershoo1 is digi- point This means the lines have the same
tized. This fo.rces a node to connect 10 a slope ai the in1ersection point and ct,e slope
lS cbangmg al 1he same ra1e for bo1h lines a1
ne.arby line \\~hile digitizing. but only when
the ,mdersboo1 or overshoot is less 1han 1he 1~e intersection point. Once the spline func.-
snapping disi.ance. WJ1e11 used properly. lit!<! uons are calculaied, they may be used 10 add
and node snapping reduce lhe nUD1ber of vcnices. For e.~ample. several new venices
undershoo1s and overshoo1s. Closed poly- may be automatically pL1ced on 1he line
gons or iniersec1ing lines are easier 10 digi- between digitized vertices 8: and 9. leadina
tize accurately and efficiently under node 10 1he "smoo1h" curve shown in Figure 4-15.
and line snapping. Da1a may also be digi1i2ed wi1h 100
The snap distance must be carefolly many vertices. High densities often come
selec1ed. lf100 short. !hen Y1apping has little from s1ream mode digitizing. spline or
impact Consider an opera1or and equipmen1 smootJ1mg func,Uolts, or classified image
tha1 rarely digi1izes endpoims wi1hu1 5 data and then ra.s1er.to-vee1or conversion.
meters of .their intended location. If the snap Redundant venices may be removed \\~thout
1olerance ,s se110 1 me1er. few poini.s will be sacrificing spalial accuracy. This helps by
snapped 1.0 taiget lines. wilh frequem dan- reducing file size and improving processing
gles and incomplele polygons. Conversely. speed. Poin1 tl1inning can fix 1his.
we satrifice accuracy if the SliAjl rolerailfe is 1:i:lilYp6lnt rhiillling metl1ods use a per.
se1 100 large. Wii.b a snap tolerance of IO peod1cular "weed" distance.. measured from
meters. feature boundaries less tJ1an 10
me1ers apart C'1llno1 be recorded. The snap one polynom.o t:c;uot-on
d,s1ance should be smaller 1han 1he desired 1s fit to points or,(:
fhrOugh ,,.,._ y • f(x)
positional accuracy. such that the needed
de1ail con1ained in the digi1ized dala is
recorded. Carefid selec1ion of 1he snap dis-
1ance should reduce digilizing errors and sig-
mficantly reduce time required for later Of the JOin po1n1
edi1ing. r(x) • 9Cxl smoothed
,rope of t(x) • sJope ot g{x) line
change: in ~ of f(x) ~Is
lhc c ~ in slope of q{x}
Reshaping: Li ne Smoothing and
Thinning
•
Digilizing software may provide 1ools 10
smooth. densify. or 1hin points while enter-
ing data. One COlllmoo 1echnique uses spli,,e
f1mc1ions 10 smoo1hly in1erpola1e c,rrves 10
::::::,.,29_ _ _..;:.._,u
belween digittized poinlS. and 1hereby bo1h
smoolh and density lhe se1 of vertices used O .secQnl.1 polyf'lo!WOI tat.lOIIC)n
to represent~. line. Aspfo1e is a se1 of poly- ts fit to poks (Ne,
no,.mal func1.oo.s d1at JOID smoo1hly (Figure lhrcugh f:lieVtt\, V • g(x)
4-15). Polynomial func1ions are fil 10 suc- f l11:1u't :f-15: Spline: ime:rpobtion to s:moo1b
cessive se1s of poin1s along lbe vertices in a digitized lines.
line: for example. a fimc1ion may be fi110
160 GIS Fundamentals
a spanning line. to identify redundant points. and a new spanning line is drawn to connect
The Lang method. exemplifies this approach. 4 points (Figure 4-16c. d).
A spanning line connects two nonadjacent The proce.ss may be repeatOO for succes.
vertices in a line. A prede.termined munber sive sets of points in a line segment on1il all
of vertices is spamned. The initial spanning venicei have been evaluated (Hgure 4· 16e
number has been set 10 4 in Figure 4-16. to h). All close ver1ices are viewed as not
meaning four points will be considered at recording a significant change in the line
each starting point Areas closer than the shape. a11d be11ce are expendable. A balance
weed distance are shown in gray in the fig- must be struck between the removal of
ure. A straight line is drawn between a start- redundant venices and the loss of shape-
ing point and tl,e search endpoint. the founh defining points, usually through a careful se1
poim down the line (Figure 4-160). Any oftest cases with successively la~er weed
iotem1edia1e points that are closer than the distances.
weed distance are marked for removal. In
Figure 4-16o. no points are within the weed The.re are mru.ty variants on this basic
distance. therefor~. none are marked. The concept Some look only at three immedi·
endpoint is then moved to the next closest ately adjacent pouus, testing 1be middle
remaining point (1Fig1~e 4-161>), and all poi111 agai11s11he line sporuied by its nvo
imermediate points tested for removal. neighboring points. Others constrain or
Again., any points closer than the weed dis- expand the search based on 1he complexity
tance are mrutt<I for removal. shown as of the li11e. Rather than always looking a1
open circles. Note 1ha1 in Figure4-16b. one four points, as in our example above, more
point is within the weed distance and is points are scrutinized when the line is uot
removed. Once all points in the initial span- complex (nearly s1raight}. and fewer whetl
ning distance are checked, the last remaining the line is complex (many changes in direc-
endpoint becomes the new starting point.. tiod).
e)
•
f)
c) g)
• •
d) h)
• •
Fia;uN" 4-16:~e L::mg algoriduu i, 3 ~~nou Liue·~llnning method. l,Jl tb~ lAng method. ,·crti.c cs ~re
rc1uo,·cd. or 1hinucd. when they ate v.-'l1h.i11 a weed distance to a i-p1uuun; line (adapted from Weibel 1997).
Chapter 4: Maps and Data En~y 161
'l . ..! •
1ir ·:·: jr·
:·:: :
. ''.':'j
tt ..
Fla111't' 4- 17: Skclc1oniting,, n fonn oflinc d1i1u1i11g tbnt i-'" ofln1 npplied nfrcr , c~u di@i1iiing.
162 GIS Fundamentals
to isolate a segment for future dele.1ion.. or to rately, and the new features can be confi.
modify only a portion of the line. Coordi- dently placed in the correct loca'lion.
nates are typically altered by interactively Amomatk remo\lal of sbon uad-ersboots
selecling and dragging points, nodes. or \1er- may be perfomied without introducing addi-
1ices to 1heir bes, shape and location. Points 1ional spatial error in most instances. A shon.
or Line segments are added as needed. dis1ance for an undershoot is subjectively
Groups of features in an area may be defined, but typically it below the error
adjusted 1hrough illternc1ive n,bbershee#ng. inherent in the source materials, or at least a
Rubbersheeiing involves fitting a local equa- distance that is insignificam wbeo consider-
tion to adjust the ,coordinates of features. ing the intended use of the. spatial data.
Polynomial equations are often used due to
their flexibility and ease of application. Slivers
Anchor points are selected, again on the
graphics screen. and other points are Slivers are particularly pemfoious errors
selected by dragging inieractively on ~1e that often come from poor digitizing. These
screen to march point locations. All lines and are small gaps or overlaps along shared
points excep, the anchor poims are internc- polygon boundaries (Fignre 4-19). Slivers
tively adjusted. either fonn gaps that are empty space. which
don't accurately represent adjacency among
All edits should be made with due atten- polygons, or overlaps which ere.ate spurious
tion to dis1ance shifts during editing. On- polygons. and don·, represent any real fea-
screen editing to eliminate undershoots n1res.
should only be perfonned when the "true..
locations offearures may be identified accu-
~-
_
Edito,
{11
·-
"""'
Clo.
INeg«e Fiaurl" ;I. JS: 01$ lOftW1'1"C' provides for a flexible tlnd con~,lctc
set ofediting tools:, Thc,c tools pro,·idc for the rapid, pre:casc•
./..( V.-:SbleSdecben.
=~-, coutroUcd 1:r<ation and modifica1ioo of ooordiuaic, and anri·
s_... bmcs of 'J)IUi:tl datt1 (C'OUrlcs:)' ESR[).
Ophon:
Chapter 4: Maps and Data En~y 163
~
overlaps'/ --gops 1wice. Digitizing and editing tools. described
in tile previous sections. usually include sev-
I I eral auto-complete options. These incorpo-
rate existing polygon edges when digitizing
I ' an adjacent polygon. a111omatically closing a
\ L__ _ new polygon boundary. The-re can be no
gaps or overlaps, because. the 1001uses the
¢9itiled
lliv.s existing edge 10 help create a new polygon.
fl.3utt -l, 19: G:i.ps 11nd O\'c-rfo.p s i ii pol)'@:On bound·
These tools ns1ke digitizing both more effi-
:mes result in ~ll\·cn, which \l~u..,Uy must be cien1 and more accurate.
rq>aircd priorllo data di,tributioo.
Featu res Common to Several
Layers
Slivers are also undesirable because
each spurious polygon includes an emry in One connnoD problem in digitizing
the attribute table. Sliver polygons nmy be derives from represelllation of feanires that
quite numerous, in extreme cases ou1num· occur in multiple data layers or images.
boring the tnte polygons in a da13 layer, This These features rarely have identical loca-
adds storage and processing overhead but no tions ou each source. and often occur in dif-
value to the data set. fereur locations when digitized into their
respective data layers {Figure 4-20). For
SLivers mos:1 ofceu occur when snap dis.. ex80lple. water boundaries ou soil survey
tances are set too short. so that nodes aren' t
maps rarely correspond exactly to water
automatically drawn together while digitiz-
ing. Slivers may also occur when snapping boundaries found ODhydrographic maps.
only to nodes. and not lines or edges. panic- Features may appear differently OD dif-
ularly when tl1ere are reL1tively long, straight ferent maps for many reasons. Perhaps the
illo\Jnl
-
••
.... ~ Sv,,a,"
••
• ••
-.·.
·..•
•.
.--------------~
,::::, l\<rh C""'*"' Coun,y ~ (NCOO'f)
·.,
<::) T;rrr:.>sStt Coun-y 8olxiday {lNC-IS)
164 GIS Fundamentals
maps were made .for different purpose.s or al edges wilJ be encou1uered and sl1ould be
different times. Features may differ because understood.
the maps were from different source materi- There is a trade-off between data vol-
als; for example. one map may have been umes, n.~lutiou, and area coverage. rest1lt-
based on ground surveys while anotl1er was
i11g in da1a often spli1 imo 1iles for delivery.
based on aerial pbotograpbs. Digitizing can Consider a state\\~de elevation r~1ster, gener-
also compound lhe problem due to differ- ally useful. bu1 cumbersome given tha1 most
ences in digiti2ing melllods or optrators. users desire detailed data over a small subse1
There are se,·eral ways to remove this or the state. A data reques1 may resuJl in sev-
••common feature'" iuconsiste:ncy. One eral tiles. which 1hen need be recombined
involves removing inconsistencies while re- before analysis. High-resolUlion images or
drafting 1he data fiom conflicting sources 01her raster data may challenge lhe uansfer.
into a new base layer. Redrafting is labor storage. or display processing capabili1ies of
intensive and time consuming. but forces a common comp11ters. and so da1• are broken
resolution of inconsistent boundat)' loca- into more manageable pieces for delivery.
tions. Redrafring ;also allows several inputs Data may also have edges due to differ-
to be combined imto a si11gle data layer. ences in the time of da1a collec1ion for adja-
A second. oft,en preferable method cent areas. For example., digital .aerial
involves establishing a <!master" boundary photograpl,s are collec1ed for ruos1 of1he
that is 1he highest accuracy composite of the agricultural k1nds of 1he U.S. on an annual
available data sets. A digital copy or overlay basis. Data are collected in blocks over the
opera1ion es1ablishes the common features growing season. and because of weather.
as a base in all the data layers. and this base schedules. and equipmen1 failures. several
may be used as each new layer is produced. days to weeks may pass between data collec-
For exan1ple, watt, boundaries illight be tiou 011 adjacent blocks. MMy fearurois. such
ex1racted from 1he soil survey and USGS as crop cype. stage of developmen1. we1land
sources, and rhese data combined in a third size. or harvest state may have cbaoged
dala layer. The th;rd dala layer would be be1ween image collection dates and be dis-
edi1ed to produce a co01posi1e. high-qua Lil)! continuous or inconsis1em acros.s blocks.
water layer. The composite water layer and hence die dala layer.
would then be copied back imo b<>lh the soils Ditrerem imerpreters may also crea1e
and USGS layers. This second approach. disconlinuities. Large-area mapping proj-
while resuhing i.n visually consisteJll spatial ects typically employ several interpreters,
data layers. is in many inslance.s only a cos- each working on differeo1 areas of a region.
01e1ic improvemem of 1he da1a. If 1here are All professional. large-area mapping etfons
large discrepancies ("large" is defined rela- should have prolocols specifying tl1e scale,
tive to the required spatial data accuracy), sources. equipmenL methods. classification,
tbeo 1he source or the discrepancies should keys. and cross-correlation 10 e11Sure consis-
be idemified and 1he most accur.ue data tent mapping across block or data tile bom1d-
used, or new. higbe.r-accuracy data collected aries. In spi1e of these effons. scme
from the field or original sources. differences due 10 human interpreta1io11
occur. Feature placement. category assign-
Map Boundaries and Spatial Data men1, and geuera1i2atiou vary among inter-
pre1ers. Tliese problems are compounded
Digital data are ofien created or deliv- when extensive check,ing and guidelines are
ered in 1i.les. giving 1heJll edges. Disco,11inu- not enforced across the projec1, especially
ities often occur at tile boundaries. These when adjacem areas are mapped al ditrerem
errors may be lacking in many newer. large- times or by two different organizations.
area daia collected wi1h digi1al methods. bu1
Chapter 4: Maps and Data En~y 165
}' ...
} ' 8""': Tlge, Conversion
""""'
}' 1._t From OLG
use of spatial dat.a. Me1ada1a allow 1he effi-
cien1 transfer of infomiation about da1a, and
infom, new users abou1 lhe .geographic
}' " - ' From !Jlte,cl>onQe l';fe ex1ent. coordinate system. q;uality. and other
} ' " - ' From 5>7 data characteristics. Metada1a aid organiza-
> "-tFromSOT5
} ' " - ' From Vl'1'
tions in evaluaiingdaia to deteonine ifihey
are sui1able for an in1ended 11se. e.g.. 10
review accuracy, coverage, or information
fla;nrt 4-21:.An example of a coovcrsion utiJ· needs. Meiada1a may also ai<l in dam upda1es
ity. here froiu. chc ESRI~GlS 50fu,'at'C. D:ua
may be eo1J,-erted from ouc of SC'\-Cni) fonn111s by guiding the choice of appropriate collec-
to an ESRl·5JXci.fic digi1"l dilta. tion methods and fo nnats for new data.
166 GIS Fundamentals
Most governments have or are in the The CSDGM consis1s of a standard set
process of es1ablishing s1andard methods for of elements that are presented in a specified
rep<>ning me1adata. In the Uni1ed S1a1es, the order.11,e siandard is flexible a11d may be
Federal Geograpbic Data Committee extended to include new elements for new
(FGDC) bas defined a Coment Standard for ca1egories of iofonoa1ioo in 1he fu1ure.
Digital GeospatiaJ Metadata (CSDGM) to There are over 330 diffen.~11 elements in 1be
spec.ify the content and fom1at for me1ada1a. CSOGM. Elemenls have slandardized long
The CSDGM ensures ~,ai spatial da1a are and short names and are provided in a stan-
clearly described so tha1 they may be used dard order wilh a hierarchical numberutg
effectively witbht an organization. The use system. For example, the westernmost
ofthe CSDGM also ensures that data may be bounding coordinate of a data set is element
described to other organizations in a stan- l .S. l. I. deimed as follows:
dard ma1,mer. and that spatial data may be 1.5.1.1 Wes! Bounding Coordina1e - wes1-
more easily evaluated by and transferred to emmost coordina1e ofthe Iimi1 of coverage
01her organizations. expressed in loogirude.
4. Spatial_Rererence_lnformation:
4.1 Horizontal_Coordinate_ System_Oefinition:
4.1.2 Planar:
4. 1.2.2 Grid_Coordinate_System:
4. 1.2.2. 1 Grld_Coordinale_System_Name:
Universal Transverse Mercator
4. 1.2.2.2 Universal Transverse Mercator:
4.1.2.2.2.1 UTM- Zone Number: 10-19
4.1.2.4 Planar_Coorditlate_lriiormation:
4.1.2.4.1 Planar_Coor<!inate_Encoding_Method:
coordinate pair
4.1.2.4.2 Coordinate_Representation:
4 .1.2.4.2. 1 Abscissa_Resolution: 2.54
4. 1.2.4.2.2 Ordinate_Resolutlon: 2.54
4.1.2.4.4 Planar_Distance_Units: meters
4.1.4 Geodelic_Model:
4 .1 .4.1 Horizoniat Dalum Name: North American Datum 1927
4. 1.4.2 Ellipsoid_Name: Clark 1866
4.1.4.3 Semi·major_Axis: 6378206.4
4. 1.4.4 Denominator_ol_Flauening_Ralio: 294.98
4.2 vertical_Coordinate_System_Dennilion:
4.2.1 Altitude_Systern_Oefinilion:
4.2. 1 .1 Allitude_Datum_Name:
National Geodetic Vertical Datum ol 1929
4.2. 1.2 Allitude_Resolution: 1
4.2.1.3 Altitude_Distance_Units: feet or meters
4.2.1.4 Altitude_Encoding_Method: aurlbute values
4.2.2 Depth_Sys1em_Dennition:
4.2.2.1 Oepth_ Datum_Name: Mean tower low water
4.2.2.2 Depth_Re$0lutlon: 1
4.2.2.3 Depth_Distance_Units: meters or feet
4.2.2.4 Depth_Encoding_Method: attribute values
FiauN' 4-22: e..~,mplc of n unnll portion of the FGOC TC<:ornmn,dcd mctfd;i.11' for,i i:100,000 K"lc
dcTfred digital cbtti 5<:1.
Chapter 4: Maps and Data En~y 167
Polygons define area bomtdaries, such as ified value. for example. a Io•c isoplelh
counties or census lrllCIS. Each polygon is defines 1be positiou on the landscape a1 ihat
given a color. shading. or pa11eni corre- 1emperature. Lines 1ypically do 001cross. in
sponWng to values for a mapped variable. that there cannot be two different tempera.
rures a1 the same location. Howe\•t r. iso,.
Doi de1111ity 111ap11 show quaori1a1ivc da1a
(Figure 4-24. bot1om left). Oois or 01her pleths often depic1 eleva1io11. and cliffs or
poin1 symbols are p!011ed 10 represen1 val· overhanging terrain do have multiple eleva.
ues. OOls are randomly placed in the poly- tions tH the same location. ln this ease. the
gon such that the number of dots equals the lower e1e,111ions iypically ~ss "under" 1he
101al value for the polygo1t Each do! on the higher elevations. and 1he isopleth is labeled
ex,mple map represen1s 50.000 people: "i1b the tallest height (Figure 4-25).
however. each point is not a city or other Once the features 10 include on a map
concentration of inhabitants. Note the posi· are defined. \Ve must choose the symbols
tion of pointS in 1be dot density map relative used 10 dtaw 1hem. Symbology depends in
to the city locations in the fe.anire map pan on 1he type of fearure. For example. we
direclly above it in Figure 4-24. have a different set of oplious when repre·
Isopleth maps. also k11own as comou,. senting cominuous features suchas elevation
maps. display lines of equal value (Figure 4- or pollution concentration tb.an when repre·
24. bo11om right). Isople1b maps are used 10 senting discrere features. We also must
represent con1inuous surfaces. Rainfall. ele- choose among symbols for each of the types
vation. and temperature are fe.atures that are of discre1e features: for example. 1he set of
commonly represen1ed using isopleth maps. symbols for poinls are generally different
A line on the isople1b map represents a spec- from those for line or area features.
0 b
Salisbury
\r.
,
i \r. •
~ Major J
J Roods
,"" t
F13.utt +.23: E.,:aiuplc of o) a detailed. $C'Octal-puiposc: lllllp. bctt III portion of a United States Gcologi-
c~I Sun•cy ,unp. rut.db) 111 ipccialitc:d uup fc>cutmg on a specific icl of sd«tcd fc:.ntff:t,. hctt showing
roads. n~ fc.itum; choKn for depiction on the m3p depend on tbc intended m11.p ~1se.
170 GIS Fundamentals
Feature mop
• Citic,
............ HigtlWI:)'($
-....,";,,.""...,,."".....
Fltul'f' ~..24: Conunon h:itdcopy nup typct depicting t.hc 1torthcat1cm United S1:ucs.
//~/·'/
We also .must often choose whether and
bow to label features. Most GtS software
provides a range of tools for creating and
placing labels. and in all cases we must
choose the label foot type and size. location
relative to the feature, and orientation. Pri.
mary considNations when labeling point
features are label placement relative 10 the
point locatiol!1, label size. and label orienta·
tiou (Figure 4-27). We may also use gradu-
ated labels. d,a, is. resize them according to
some variable associated with the poinl fea-
DD ·. ·.. •:-.··.
·:-.·:-
. .': •:.:..:.:'
:..:. .
! ~-,! ~-: ·:·:
n1re. For example, it is c:ommou to htwe .•·r,t ..•. ,•.}
larger features and label foms for larger cit- ·~·~.... .,...
• ;-j.· :~·~· ;-:.,
ies (Figure 4-27). Labels may be bent,
angled. or wr~pped aro,md features to
improve clariiy and more efficiently use Fle;111T ,i.26: Ex:unplcs of poi1u (1op), li.nc(mid),
;md ;u-c., (bottom) ,ymOOls us.c<l to di~iogui.sh
space in a ni.1p. amoug featutd of diffe:r¢1lt l)p,cs. Most GIS soft·
ware pro,·idcs II sc1 of im.ndtud tvntbols for
point, line. mu, 11;nd c0111:i11uou1 Surface- fc:ilUJU.
172 GIS Fundamentals
The legend should be exhaustive. Exam- Other elemen.ls are 1ypicaUy smaller and
ples of each different symbol type that located around the periphery or embooded
appears on the map should appear in the leg- within the main data pane. These other ele·
end. This means each point, line. or area meuts include map insets. \vhich are smaller
symbol is drawn in the legend with some data panes that show larger-or smaller scale
descriptive label. Labels may be next to. views o( a region in the primary data pane.
wrapped around. or embedded within the Good map compositions usually group
features. and sometimes descriptive numbers related elements and use empty space effec.
are added. for example. a range of continu- tively. Data panes are often ,grouped and leg-
ous variables (Figure 4-29, upper left). Scale end elements plac,ed near each other, and
bars, north arrows, and descriptive text grouping is often indic.atoo with enclosing
boxes are typic.illy u,cluded in the legend. boxes.
Map composition or layout is another Neophyte canographers should avoid
primacy task. Composition consists of deter- two teodeocies in map composition. both
mining the map elements. their size. and depicted in Figure 4-30. First. it is generally
their placement. Typical map elements, easy to create a map with automatic label
shown in Figure 4.3 and Figure 4-4. include and legend generation and placement. Tite
one or more main data panesor areas. a leg- map shown at the top of Fig11re 4-30 is rypi·
end. a title, a scale bar and north arrow, a cal of this automatic oompo:sit'ion. and
grid or graticule. and perhaps descriptive includes poorly placed lege11d elements and
text. These each must be sized and placed on too small. poorly placed labels. Labels
the map. crowd each other, are ambiguous, and cross
These map elements should be posi- water/land or other feature boundaries. and
tioned and sized in accordrutce with their fonts are poorly chosen. You should note
iillpoilauce. Tlie map's ,uost inljl<liffiit data that automatic map £Ymbol ·selection and
pane should be largest. and it is often cen- placement is nearly always suboptimal, and
tered or 01herwise given visual dominance. the novice cartographer should scnuinize
these choices and manually improve theni.
Area Legend
Graduated
Symbols
Le.gend
@ [
oO
.. "" oO""
Une Legend Sca le Bor s North Arrows
,..
7 (Pj A.. -+ ~ s
Flptt 4-19: E.,:amplcs o( lcgcod c1cmctUS and rq,rcscut-fltioo of S}'Olbols. Some S)'lllbol.s may be
grouped iu 11 «mrp11ct way to communieiu c the ,-alu« :usociatcd wi1b each aymboL
174 GIS Fundamentals
Spa.in
4S"O'O"N
~lf'(f(f'
.- ..... •
{ Legend'
• Cily ~lf'O'rtN
,.
JS-0-0".
-
....... RNor
~
•
Height
N
... "' '--+
10"C'O'W s:•O'O,, 11'()!(1' S"O'tr'E
...... """
Spain
•City
,,._ Rrtffl'r
.., H<>9"1
- ,45•0'C)"N
•
N
-+ .../ Madrid
•
O"O'O"'N
-1,,,.
,. • ..,.,,'?.r
,)?t,·
($' ~"
3S•O'O"N '"a<'<>•n ~.,1:,,·~e"
•l
1o·ovw
.. "
$W"W
"' '· ...... ·=-
Fiaurt' -1..30: An cx..,mplc of poor in.1p ,ksign (1op). T11it IOJ> pa.ncl sl10ws 11 nwubcrof1nis1l\kct conb11os1
for the ocopbytc carcopnplm, inclodmg ,mall labels (citi«) 3.Ud mllJn.,tr:bcd fon1, (graticul.c labels,
title). poor labclfog (clly label, o\•crlapping. ambiguously placed. aod crosslnj; dis-1inctly shaded .mas).
unlabc-lcd (cafW'cfl (occ11ru 1111d sc11t). poori)' pfaccd sc11Jc bar 111\d legend. ru)d u11bJ!ll:utccd ope:~, ap:w:c on
the left side ofdx m.,p. Thoic p:rob!c:mr :ire not prci;c111 in die improved amp design. '110wn in the lower
paocl.
Chapter 4: Maps and Data En~y 175
The second common error is poor use of for the Atlanlk Oce.an and Mediterranean
empty space. those par1s oflhe map_"itbou1 Sea. The emp1y space is more balanced in
map elements. There are two opposite. ten- 1ha1 it appears around 1he major map ele-
dencies: either to leave too much or unba.1- ments in approximately equal proportions.
aneed empty space or 10 cluner lhe map in an
M 1101ed earlier. this is only a brief
a11emp110 fill all empty sp_ace. Nole duu the inl:rocluction to canography. a subject cov-
map sho,1;n a'l the top of Figure 4·30 leaves
ered by many good books. s<>me listed a1 lhe
large emp1y spaces on lhe lefl (wes\em) end of this chap1er. Perhaps the bes1 com-
edge. wi1l11be A1lan11c Ocean devoid of fea- pendium of examples is the Map Gook
tt1res. The cartographer may address this in Series. by ESRJ. published annually since
several ways: by changing tlte size, shape. or 1984. Examples are available a11he time of
ex1en1 oft~ area mapped: by adding new this writing al www.esri.com/mapmuseum.
features. such as data panes as insets. addi- You should leaf1brough se,·eral volumes in
tional 1ext boxes. or other elements: or by this series. with an eye 1oward critical map
moving 1he legend or 01her map elemems to design. Each volume coma ins mant beami·
that space. The map shown at the bottom of ful and infonnative maps. and provides lech·
Figure 4-30. while nol perfect. fixes 1hese niques wo11h emulating.
design Oaws. in pan by moving the legend
and scale bar. and in pan by adding labels
176 GIS Fundamentals
Coordinate Transformation
Coordi'1nte trn11sformatio11 was once a PLSS surveys. These depict the original
common operation in the development of PLSS boundary lines. as well as lakes and
spatial data for GlS. Ii is becoming less fre. wetlands, and in some instances forests.
qucntly needed as we have worked through grasslands. or other feanires. We may wish
digitizing the legacy paper maps on which to compare past and current con-ditions. but
we stored most of our spatial data. and new the PLSS is a system of land division, with
image sources uow routinely correct1he geo- 110 coordinates or projection associated with
metric distortion inherent in images. How- the lines. We need to register the paper maps
ever. some older 111aps still remain 10 be 10 a projected coordinate systelll! prior to use.
convened. and some image systems generate The PLSS line intersections nmy be
images with acceptable geometric accura- used to register the original maps to a pro-
cies, although nol ortl10-correc1ed. Coordi- jected coordi1m1e system. As noted in Chap-
nate u-ansforniations are covered here for ter 3. PLSS lines often became propeny
completeness. boundaries. and subsequent roads often fol.
A coordinate transfonuation aligns spa- lowed these boundaries. SectioUt line inter·
tial data into an Earth-based m.ap coordinate sections may be surveyed directly. or
system. This alignment ensures featttres fall extracted from other registered data suc-h as
in their proper relative position when digital aerial images. and hence used to transfonn
data from different layers are combined. the original maps to a projected coordinate
Within the limits of data accuracy. a good system. Features on I.he original maps. snch
transforma1ioo helps avoid inconsistent spa· as lakes or wetlands at the time of the origi·
tial relatiooships such os faru1 fields on free- nal surveys. may be digitized rutd compared
ways. roads under water. or cities in the to current ones.
middle of swamps. except where these truly
exist. Coordinate trnnsfonnation is also
referred to as ,-egisrmtiou. because it "regis· Control Points
ters" the layers to a map coordinate system. A set of control polutS is used to trans-
Coordiuate transfonna1ion is most com· fonn the digitized data from tlie digitizer or
monly used to convert newly digilized data photo coordinate system to a map-projected
from an art>itrruy image. digitizer. or scanner coordinate system. These control points are
coordinate system 10 a standard map coordi- used 10 estimate equations lhat we use for
nate system. T11e input coordinate system is the coordinate transformation (Figure 4-32).
usually based on the input device. An image Control points are dilforent from other digi-
may be scanned and coordinates recorded as 1ized features. When wedigitize most poinls,
a cursor is moved across the image surface. lines. or areas. we do not know 1he nmp pro-
These coordinates are usually recorded in jection coordinates for these fea•ures. We
pixel. inch. or cemtimeter units relative to an simply collect the digitizer X and Ycoordi-
origin located near the lower left corne.r of nates I.hat are estabLished with reference to
the image. Before these newly digitized data some arbitrary origju on an image or digitiz-
may be used with other data. these "inch· ing table. Control pcints differ from other
space" or "digitiz,er.. coordinates must be digitized points in that we kno,v both the
transfom1ed imo nn Earth-based map coordi· map projection coordinates and ·the digitizer
nate system. coordinates for these points.
Figure 4-31 depicts the application of a These two sets of coordinates for each
coordinate transfonna1ioo in data develop- control point, one for the map projection and
meJJI. Early su1'1eys were often stored on one for t!,e digitizer system. are used 10 esti·
paper maps; in this instance. the original mate the coefficients for transfonna1ion
Chapter 4: Maps and Data En~y 177
equations, usually through a stalistical, leasl may be displayed on screen just prior to dig-
squares process. The u-ansfomuuiou equa. itizing a new map. Conirol points may then
tions are 1hen used to convert coordinates be entere<l. lhe new map anached to the digi-
from the digitizer system to the map projec- tizing table. and the map reg istered. The new
1iou system. data may then be displayed on top of the pre-
The transfonnalion may be estimated i.n viously registered data. Tbis allows a quick
the initial dig.iti2ing steps, and applied as the check on lhe location of lhe newly digitized
coordinates a:re digitized from the map or objects against corresponding objects in the
image. Th.is •~on-the-fly•· transfonnation sn1dy area.
allows data 10 be output and aual)'2ed with Control poin1s should oJeel several crite-
reference to map-projected coordiruues. A ria. First. they should be from a source with
previously registered data layer or image the highest feasible accuracy.
Soul"Ce laytr.
sconned. old
'
mop
matched
control
poln1s
•Yst~m ~
Second, control points should be more accu- na11hings (N), and are related to the X and Y
rate than the desired overall positional accu- digitizer coordi11a1es by the equations:
racy for the spati•I data. TI1ird. control
points should be evenly distributed through-
out the data area. A suf!icieut number of (4.1)
control points should be collected. abo\'e the
minimum to improve !he s1a1istically fit
transfonna1iou functions.
(42)
The coordinates of control points should
be kno"" 10 a high degree of accuracy and
precisioo. Because high is subjectively
defined there are many methods to deter- Equations 4.1 and 4.2 aUow us 10 move
mine control point locations. Subcemimeter from the arbitral)' screen or digi·tizer coordi-
accuracy may be :required for propeny nate system 10 the projec1 map coordin.1te
boundary control points. while a few mete.rs system. We know the X and Y coordinaies
may be acceptable for large-area vegetation for eve!)' digit.iud poiot. line vecnex. or
mapping. Common sources of comrol point polygon vertex. We may calculate the E ru1d
coordin.1tes are 1raditiooal mrnsit and dis- N coordinates by applying the above equa-
tance surveys, GNSS measurements. exist- tions 10 eve!)' digitized point.
ing ca11ome1ric qwality maps. or existing Te and TN can be thought of as shifts in
digital data layers on which suitable feanires the origins from one coordinate system to
may be identified. the next. The 0 1 and b1 parameters incorpo-
rate the c.hange in scales and rotation angle
Transformation Equations between coordinate systems. The affine is
tile mos1 commonly applied coordiua1e
Different base equations can be used in transformation because it provides for these
coordinate transformation. Tbe most com- three main effects of translation. rotation.
mon is the affi11e C()Q1r/im,1e rmn.tfon11atio11, and scaling. and because it often introduces
which employs linear equations 10 calculate Jess error than higher-order polynomial
map coordinates. Map projection coordi- 1ransfonnatio11s.
nates are often referred 10 as eastiogs (E) and
Pr0Jec:110J\
Cont rol point s coorc1tno1es tUTM)
ID x y E N
I 1030 +100 1 500.0834 5.003.6835
10 2 08 ·69.1 504.092 J: 5.002.4995
l -200 ·690 504.907 5 s.ooz.,995
9 -600 _,,.o 506.493.3 5.001.6735
'5 -1020 -t172 508.1013 5.001.6517
6 · IOI 7 108 508.0901 4.999.3840
·860 7~8 507 4759 • .996,s,oo
8' -40.0 457 505.6892 4.998,022.0
9 ILO 36.8 503,679? 4,998,~80
10 63.0 3,0 501,657.9 4.998.479 ~
11 630 177 !>01.6691 4,999.1160
12 630 643 501.680.3 4_997,2960
13 1060 <77 500.0053 4.997.9435
FIJUN' .a.J2:Au c:t11.mpk of c0111tOI point loc:1ti011.S (ro1u a n:wid d.11a layer. ruM! corrcspondi.n,g di1itizcr
:md 111;lp projccli:'On coonlinMcs.
Chapter 4: Maps and Data En~y 179
The affine system of equations has six minimizes the ro011110011 square en-or
parameiers 10 be es1ima1ed: TE• T..,. o1. o2,b1. (RMSE). The Ri\1SE is defu1ed as:
and bz. Each control poio1 provides E. N. x.
and Y coord.iaales. and allows us to write
two equations. For example. we may have a RMSE:
e1l • e2·
l
e3l .. ,. e;,l (• .5)
conrrol point consisting of a precisely sur- n
veyed center of a road intersection. Th.is
point has digitizer coordinates orx • 103.0 where the ei are the residual dis1ances
ceo<imeiers ~od Y • -100.l cemu11c1ers. and be1ween the trne E and N coordinates and the
corresponding Eanh-based map projection E and Ncoordiua1es in <he ou1pm daia la)'er:
coordioaies of E • 500.083.4 and N •
5.003.683.5. We may 1hen ,vri<e 1wo equa-
tions base<i on th.is control point:
nates. It provides one index of transform- including the higher-order squared and XY
ation quality. Trnosfon11a1ions are fit. The cross.product terms. and all estimated
RMSE will usually be less than the tme parameters \\~II vary.
transformation error at a randomJy selected
point. because we are acti\'ely minimizing
the N ao<l E residual errors when we statisti· A Caution When Evaluating
cally fit the transfomrntion equations. How. Transformations
ever, tlie RMSE is an index of accuracy, and Selecting the "best'' coordinate transfor-
a lower RMSE generally indicates a more mation 10 apply is a subjective process.
accurate affine transformation. guided by mul1iple goals. We hope 10
Estimating the coordinate transfonua- develop an accurate 1ransfonna1ion based on
1ion parameters is often an iterative process. a large set of well-dislributed control points.
Typically. control points are en1ere<I. the lsolated conrrol poims 1ha1 subs1an1iaUy
affine 1ransforrna1ion parameters estimated. improve our coverage may also contribute
and the overall RMSE and individual point E substantially to our 1ransfom1ation error.
and Nerrors evahuated. Suspect points are There are no clear mies on the number
checked. blunders fixed. and the transfonua- of points versus distribution of points trade.
tion re-estimated and errors evaluated until a off. but it is typically best 10 strive for the
final transformation is estimated. widest distribution of points. We wam at
Other coordinate transfonnations are le.ast two control points in e.ach quadrant of
sometimes used. A eonformnl eootdiunte the working area. with a target of20% in
11·m,sfon,,ario11 is similar to the affine. each quadrant. These goals are often 1101
except that it requkes equal scale changes in possible. The lransfonnation equation
the X and Y directions. Thisresults in a sys- should be developed with the following
tem of t<1ua1ious witli only fou, unknown obseJVations in mind.
parameters. and s,o the confonual may be First, bad control points happen. but we
estimated when only two control points are should thoroughly justify the removal ofany
available. control point. Every anemp1 should be made
Higher-order polynomial rransfomia- 10 identify 1he source of the error, either in
tions are sometimes used 10 transfonn the collection or in tl>e processit1g of field
among coordinate systems. An example of a coordinates. the collection of image coordi-
2nd-order polynomial is: nates, or in some blunder in coo:rdinate tran.
scription.
Second. a lower Ri',tSE does not mean a
beuer 1!l1Jlsfonua1ion. Tbe RMSE is a useful
1ool when comparing among tm13sfom1alions
that have the same model fonu. for example,
Here. tliecombined powers of the x and when comparing oneaffme 10 another affine.
v variables may t>e up 10 2. This allows for The RJv!SE is not usefol when c<>mparing
curvature in 1be transfom)ation in both the X among differeol model fonns. for example.
and Ydirections. A minimum of six control when comparing an affine to a s,econcl.order
points is required 10 fit ibis second-order polynomial. The RMSE is typically lower
polynomial 1mnsfom1ation. and seven are for higher-order polynomials than ao affine
required when using a 5lalislirnl fit The esti- U1U1sforma1io11 but this does 1101 mean !he
mated parameters Te, TN, o1• o2. b1• and b2 higher-order polynomial provides a beuer
will be different i;o equations 4.1 and 4.2 lit High-order polynomials allow more Oex-
when compared 10 4.7, even if the same set ibilily in warping the surface to fil the c,on-
of comrol points is used for both statistical ttol points. Uuforrunately. this warping may
fits. We change tile fom1 of the equations by significa111ly deform the non-control-point
Chapter 4: Maps and Data En~y 181
coordinates, and add large errors when the Control Point Sources
1ransfomu11ion is applied 10 all da1a in a
layer (Figure 4-34). Tlms. high-order poly- Control points may be obtained from a
nomials and o thers should be used with cau- variel)' of sources. Tradilional ground sur·
1iou. veys based on optical surface measurements
are a common. although decreasingly used
Finally. independent tests make 1he beSI method for de1enuining coMrol poim loca·
comparisons :among transformations. A tions. Federal. state, county,. and local gov-
comple1ely independent se1 of widely dis- ermneu1s au maintain a set of accurately
tributed test points is ideal. but these rarely surveyed locations (Figure 4-35). and these
exist \Ve often use a ''boots.trap" approach points may be used as control points or as
that successively removes poims. One poim s1aning points for additional surveys. Many
is \\~thheld, tile trnnsforma1ion estimated, of these known points have been estabLisbe<l
and the error at 1he withheld point calc,,. using traditional sur\'eying techniques. The
lated. The po;m is replaced and tl1e nex1 ground survey network is often qui1e sparse
point withheld. fitting the sallle type of and insufficiem for registering many large·
transfomi.'ltion. The equations will be scale maps or images. The global position-
sligh1ly different. The error at this second ing system (GPS). GLONASS. and Galileo
withheld point is 1hen calotlated. This pro- are Global Navigation Satellite Systems
cess is repeated for e.ach co01rol point. and a (GNSS) that allow us 10 establish control
mean error calculated. points. GNSS. discussed in de1ail in Chapter
5. can help us obtain the coordinates of con·
rrol points that are visible on a map or
image. GNSS are paniculariy useful because
FlfUtt 4-.34: An illustrati.oo tbat ~\1.SE should not be used to c,0mparc di.ffcr«ll otod« tnrufoi:-
11utioni. .-\bo\'C' :1.-re ponions of :1. 1r:1nrfonncd inug:c duit was n:3i1tmd 10 :t toad n.:twot~. l}'u•
11rc:n L5 i11~crsti!i:11 t~ l S wide!~· di.strib1.11cd c~1m;>I poi111~ B,«ausc the Jrd-onlcr po~)Uoru1:1l 1s
qu11c flexible m 6ttu1g th<, pou1ts aod ~1ng RMSE, 11 dlMorts ar~s between t~c ~ontrol
~ints. Tli.is is shown by the poor matc:b b<twC'ffl un11ac and , ·cc:1or roods (abo\'c naJu), AltJs.oua;b
11 has a bi3bcr R.\.fSE. the fitst~rdcr 11:1.nrfomuition ou t1tc left is bcttcro,•crnll.
182 GIS Fundamentals
Suggested Reading
Bolstad, P., Gessler, P., Lillesand, T.M. ( 1990). Positional uncertainty in man.ually dig-
i1ized map dala. lmematfouaf Joumal ofGeographical /11/on11a1fo11 Systems.
4:399-412.
Kimerling. J.A .. Buckley. AR., Muehrcke. P.C.• Muehrcke, J.P. (2016). ,Wap Use:
Reading. IJ.11alysis, a11d I111ei·pretatio11 (8th ed.). Redlands: ESRI Press.
Laurini, R.. ThompS0!3. D. (1992). F111tda111e111als of Spatial J1ifor111atio11 Systems.
London: Acadenuc Press.
Maq11ire. D.J .. Goodchild. M.F.. Rl>ind. D. {Eds.). (1991 ). Geogropl,;cal l11for111ntio11
S)·$tems: Principles and ~pplicatio11s. Harlow: Longman Scientific.
McBratney. A.B .. Santos. M.L.M.• Minasny. B. (2003). On digital soil mapJ>ing. Geo·
der111a. 117:3-52.
Nagy, G.. Wagle. S.G. (1979). Approximation of polygonal maps by cellular. maps.
Commuuicafior,s ofthe AssociaJ;ou of Computatioual ,\facl,;11eiy. 22:518-525.
Peulter, T.K.. Chrisman, N. (1975). Canogmphic data structures. TheAme,icm, Car·
togrnplrer. 2:55-69.
Peuquet. D.J. (1984). Acoucepmal framework and comparison of spatial data models.
Canogmpl,;ca. 2 1:66-113.
Chapter 4: Maps and Data En~y 185
Zeiler. M. (1999). ,\{odeli11g Our Jll>rld: Tlie ESRI Guide to Geodatabase Desig11.
Redlands: ESRJ Press.
186 GIS Fundamentals
Study Questfons
4.4 . Identify tlte kind of generalization at the labeled locations a through d in the
map below. !ell. compared to 1he ·~nub" in 1he image. below right. Categorize the
generalizations as fused, simplified. displaced, omitted, or exaggerated.
c-
R
~bin
Chapter 4: Maps and Data En~y 187
4.5 - Identify the kind of generalization at the labeled locations a tl,rough d in the
map below. left. compared co the "truth" in the image. below right. Categorize the
generalizations as fused. simplified. displaced. omitted. or e..aggerated: or if it
doesn't fil in one of these categories. then categorize it as "other:· and describe the
generaliza1ion.
, c,
;? •
•
I
u)
a ·'
\ • ~
• •• \ 'd
•
• ~
b- '
_ _e f'-.n+a
4.8 - Which map typically shows more detail - a large-scale map or a small-scale
map? Catl you give three reasons why?
188 GIS Fundamentals
4.9 - Complete the following table that shows scale measuremenls and calculations.
4.10 . Complele the following 1able d1a1shows scale measuremems and calcula1ions.
4.11 • What is snapping in 1hecome.,1 ofdig.i1izing? Wba1 arc undershoors and over-
shoots. and wby are they undesirable?
Chapter 4: Maps and Data En~y 189
4.12 - lde11tify a characteristic feature or error in digitizing at each of the labeled let-
ter locations in the drawing below: for example. node, overshoot. missing label. ere.:
"-----...h .s
g
,4 e
4..13 - lde,11ify a characteristic feature or error in digitizing at each of the labeled let-
ter locations in tile drawing below: for example. node. overshoot. missing label. etc.:
d
+5
e +6
0
+4 g
+3 +2
190 GIS Fundamentals
4. 14 - Sketch t.he results ofcombined node (open circle), vertex (closed circle), and
edge (lines) s,~apping with a soap tolerance ofa) a disiance of5 units. and b) a dis-
tance oft O11n;ts. as shown by the snap circles. Note the radius and not the diameter
of lhese circles defines the snapping distance.
[__
4. 15 - Sketch t·he results of combined node (open circle)."'"°' (closed circle), and
edge (lines) snapping with a soap tolerance ofa) a disiance of5 units. and b) a dis·
tance of IO units. as shown by the snap circles. Note the radius and not the diameter
of lbese circles defines the snapping distance.
0
5 un,t radius
a··.,.~-~, ''
'·
....
10 unit ._,,.;·'"
rod1u.s
I I
Chapter 4: Maps and Data En~y 191
4.16 - Wbat are splines, and how are they used during digitizing?
4.18 - Contrast manual digitizii1g 10 the various forolS of scan digitizing_What are the
advantages and disadvanlages of each?
4.19 . Wbat is the "common fearure problem" when digitizing. and how might it be
overcome?
4.21 - What are control points., and where do they come from?
4.22 - Define an affine ll1lllsformation. iitcluding o,e fonn of the equation. Why is it
called a Ii.near transformation?
4.23 - Wbat is the root mean square error (R,IIISE), and how does it relate 10 a coordi-
nate 1:rans.fom1atiou?
4.24 - ls (he average positional error likely to be larger, smaller, or aboul equal to the
RMSE? \Vhy?
4.25 - Wby are higher-order (polynomial) projections to be avoided under most cir-
c:umstanc-es?
4.16 - Wbich of the foUowing tra1JSformations wiU likely have 01e smal.lest average
error at a set of independent test points?
a) affine. RMSE = 14.23 b) affine. RMSE = 9.8
c) 2nd-order polynomial, RMSE • 9. 7 d) 3rd-order polynomial RMSE • 6.45
4.27 . Wbich of the foUowing transformations will likely have the smalllest average
error at a set of independent re.st points?
a) 1~1-ordel' polynomial. RMSE = 5.3 b) affine. RMSE = 9.8
c) 2nd-order polynomial. R-M:SE • 4.9 d) Ist-order polynomial. RMSE • 9.9
Introduction
Broadly defined. there are 1wo common
ways to obtain geographic coordinates. T11e
fi.rs1uses remote data collecrion, primarily
from aerial and sa1elli1e cameras. Coordina1e
positions may I.be obtained to within a few
cen1ime1ers (inches) from properly collected.
carefuily processed images. Digitizing from
these sources is described in Chap1er 4. and
image S)-Stems and automated image
extrac1ion are described in Cbap1er 6.
We also commonly collect coordinates
using GNSS/GPS for field measurements.
tools described in this cbapier. We travel to a Ffaw·e !i- 1: A ONSS :.w! d.'l.l.:l col!cc1ion p:td,:.:1gc
feature and physically occupy a locaiioo to fit for lon_g-1eno ~ ng~ird tnckiu.g (courtesy Aleic
Jahn. mdwia uw,~nnty),
meas.ure unknown X. Y. and often Zcoordi·
na1es. Measuremem sysiems have become
quite powerful. incorporating satellite and GPS more spe<:ifically refers 10 a U.S.-based
laser 1echnologies. primarily Global Na,;ga· sate1Li1e naviga1ion system. 1be first devel·
tion Sa1ellite Sysiems (GNSS). as well as !ta· oped and deployed globally.
dilional grom1d surveying methods. Field Coordinate surveying is often used to
measurements :may be accurate to within complemem GNSS measuremen!S. Coordi·
millime1ers (1enU1s of inches). na1e surveying uses optical and electronic
GNSS are satellile-based technologies angle and dislance measurem.ents. some
tha1 give precise positional information. day already described in Chap1er 3. Because bolh
or night, in most wead1er and 1errain condi- GNSS and coordinate S\ir,·cying measure·
tions. GNSS lechnologies may help navigate mems are importam. they \\~UJ be covered in
and lt3Ck any moving objec1 Iha, can carry a Ibis chap1er.
receiver. Receivers shrink in size. weight.
and power requirements eacb year (Figure 5- GNSS Basics
1).
GNSS have had a pervasive impact in
GPS, for Global Positioning System, is 1he geographic infonnation sciences, and
someiimes used synonymously for GNSS. underly almosl all modem spatial data
194 GIS Fundamentals
collection, either ,directly during field sur- GNSS is the use,· segme.111. the GNSS receiv·
veys. or indirectly 10 locate and orie111 image ers.
data. A GNSS recefrer is a device d1at records
As of2022 there are four functioning data rrausmined by each satellite. and then
GNSS systems. a11d two regional systems. processes these data 1_0 otnain three-dimen-
The U.S. NAVSTAR Global Positioning sional coordinates (Figure 5-3). There JS a
System (GPS) was the first deployed_aud is wide 3rr3y of receivers and methods for
the most widely used system. There is an determining position. Receivers arc often
operational Russi:an system named <?LON- handheld de\~ces with screens and key-
AS$ with 24-hour coverage worldwide. The boards, or electronic components mo~ted
Chinese Compass. or BeiDou. is a third sat· on ws and tnicks. plaues. or 011,er obJecls.
eUite navigation systemand includes a con· The satellite and control seg_ments differ
stella1ion of 30 positioning satellites with for each GNSS. The NAVSTAR GPS
gloool covernge. A fourth system. Galileo. includes a constellation of satellites orbiling
bas been developed by a consonium of the Earth at au altitude of approximately
European govenunents and industries. with 20.000 km. Initial system design, included 21
a total of30 satellites in the constellation. active GPS satellites and three spares. dis-
There is a regional system by the IJldian ~ov- tributed among six offsel orbital planes.
enunent (IR.l'o!SS). with seven satelhtes giv- Every satellite orbits the Eanh twice daily.
ing covernge 10 south-central Asia. and eac.h sate!Lite is usu.illy above the flat
operational in 20 16. but semi-functional horizon for eighl or more hours ,each day.
because. of equipment raHures. Another Experimental aud succe~ive operational sat·
regional system. QZSS. covers Japan, east ellites have outlasted thetr design life. so
Asia. and the wes.1en, Pacific Ocean. In the there have typically been more than 24 salel-
following disct~sion we use GNSS asa lit~ in orbi1 siillult:moously. Between four to
generic tenn for all global systems. and use eight active satellites are typically visible
GPS to refer specifically to the U.S. from any unobstmc1ed viewing location on
NAVSTAR system. Eanb.
There are three main components. or GPS is controlled by a set of ground sta-
segments. of any <)NSS (Figure 5-2). The tions. These are used 10 observe. maintain.
first is llie satellite segmem. This is a con- and mauage satellites. coin1~mti~a1i~ns. aud
stellation of satellites orbiting the Earth aud reL,ted systems. There are five tracking sta-
transmitting positioning sigL'ils. The second tions in the GPS system. spread across the
component of any GNSS is a cawrol s~- planet. Data are gathered from a number of
ment. This includes lt3cking. comummca· sources by the s•11ions. including satellite
tions. data gathering. integration. analysis. healtl1and status from each GPS satellite,
and control faciliuies. The third pan of tracking infonnation from each tracking sta·
tioo. timing data from the U.S. Naval Obser-
sotelllte segment vatory. and surface data from the U.S.
Defense Mapping Agency. A Master Control
~ ~ ~ Station symhesizes infonnatiou and broad-
~
casrs navigation. liming. and 01her dat~ to
I each sa1eUi1e. The Master Control Stauon
also signals ea_ch satellite as _appropriate for
course corrections. changes m operation, or
ij ot.htr maintenance.
user segment control segment TheGLONASS system is aoother cur-
Fit,urr S-2: The three scg,nci.is tba1 co111prisc 4
rently operating GNSS. GLONASS was ini·
GNSS. 1ia1ed by the former Soviet Union in the
Chapter 5: GNSS 195
early 1970s. Sa1elli1es were first launched in geostationary and inclined o:rbiting satelliles.
1he early 1980s. and the system became The system is designed for both civilian and
f,mc1ional in 1be mid-1 990s. The GLONASS military use. ";1h subs1an1ially augmented
system was designed for military navigation. accuracies, up to a few cenlimete.rs within
1arge1ing. and tracking. and is operated by China. due 10 a network of rocal receivers.
the Russian Ministry of Defense. with con- The European Galileo system also
trol and trackjng stations similar to those for
implements satellite. couLro1, and user stg·
the NAVSTAR OPS system. mems. There are 30 sa1eUi1es in 1he com-
GLONASS was designed 10 include 21 plete Galileo constellation. Sa1ellites are
active satellites and three spares. New armJ1ged in ~iree orbital paths a1 a 54°
designs have been phased in as older satel- orbital inclina1ioo. with a sa.1eUi1e altitude
lites have expired. and system managers near 23.600 km above 1he Eanh. This satel-
have focused on maximizing coverage over lite constellation will provide better cover·
Russia. The GLONASS system is estab- age of high nonhern la1i111des than the U.S.
lished, ";1h a. published renovation and NAVSTAR OPS sys1em. 10 bener serve
maintenance plan. such that commercial nonhern Europe. Galileo wull be managed
manufacturers have developed dual OPS/ through 1wo comrol cemers in Europe and
GLONASS <".1pable receivers. 20 Galileo Sensor Stations 5pread through-
The Chinese Compass (BeiDou) system out the world to monitor, communicate witJ1,
cousi.s1s of30 sa1ellites with aneodant and relay infonna1ion amon_g sa1eUiies and
ground station infrastruclure. Eighteen satel- 1he control centers.
lites were latuiched in 2018. and 10
upgraded sa1elli1es are scheduled for launch
in 20L9. The conste11ation includes both
FlgmT 5,3: A handbdd ONSS ttc:ch·cr (left) and a GNSS rcec-i,·« in uac (ria}t1, cour-
,CJ)' Juniper S)'1tcms).
196 GIS Fundamentals
-------
-- jl-•-
fl tm•t S.7: Range mcasumneni. from mnhip1c ONSS sa1cUi1cs. Range 1mas~mcats uc combined to
nanowdown the position of a GNSS r«civcr. Ran;c rucMUtcmcnts from more than (our satellites maybe
ukd 10 iruptovc 1bc accllr.lcy of a metu;urcd poti1ion (l\d:1p1cd £ro111 Hum, 1989).
lect infom1ation on multiple GNSS signaJs noise relative to the mean signal strength are
simultaneously and use sophisticated models ignored. Multipath signals may :also be
to remove most of the ionospheric errors. screened by properly designed antem1as.
Ou.al.frequency receivers are dropping in most commonly those with a gr<>mtd plm,e, a
cost rapidly. and may soon be widely afford· metal disk uuder Ille autena. Mwltipath sig·
able. Atmospheric range errors are still nals are most commonly a problem in urbau
largely present alter dual-frequency analysis, settings that have an abundance of strong
so these are best removed by differential cor· comer reflectors. such as the sid:es of build·
rec-tion. described in Ille next section. ings and streets. This is often a large source
System operation and delays also adds of ran_ge error. particularly when collecting
to range error. Satellite tracking is imperfect. data without a specialized multi-path rejec,
and timing and ot.her signals are slowed ,ion antenna.
during transmission through lhe entire sys·
tem. Atomic clocks on Ille satellite may be in Satellite Geometry and Dilution
error. although these are typically small. of Precision
Many of these errors may be partially
removed in rigorous analytical post-process- The geometry of the GNSS satellite con-
ing. rarely applied due to the complexity of s1ellation is another faclor that a.ffecls posi-
the calculations and needed additional data. tional accuracy. Range errors create au area
Differential corre<tion. described later. also ofuncenainty perpendicular 10 the GNSS
removes much of the systemic range error. signal transmission clirec.tion.. Tbese areas of
uncenainty may be visualized as a set of
Receivers also introduce errors into nested spheres. with the tn•e position some-
GNSS positions. !Receiver clocks may con· where within the volume defined by the
tain biases. Signals 111.ay reflect off ofobjects intersection of these spheres (Figure 5-10).
prior to teatli.iJig 11\e a,ueilitll. These These areas of uncertainty intersect. and the
reflected. 11111/rip,rth signals have a longer. smaller the in1ersection area, the more accu-
erroneous range than direct CNSS sign.ils. rate the position fixes are likely to be. Sig-
Multipath signals often have lower power. nals from widely spaced satellites are
and so may be screened by setting a thresh- complementary because they result io a
old sigoal-to-noise ratio. Signals with high smaller area of unc.enainty. Signals from S3l·
horizon
ongle
atmosphere
FJgurt S..9: GNSS rccc:1,·c:rs often <i1eard si$:JU11S from satc:lli1c:s nca.r the: borizoo. As 1.his ituas.c shows, si3-
n:1.ls from satdlita high Abo\•c: 1be llOrizon {o) wilh high horizon 1111,glc:s (P) h.wc sll()ttcr ~th lcnt.1l1s lit die:
111mospbc:rc: than lo·v:..ui.glc satellites (b, ,vitb an~tc w). Alm0$J>bcric debys ~od hc:oc-c rnu.gc c:mm arC"
l.at&cr for"tc-Ui1« wi1b low horizon 11D.Rles. Typically. a "mad:-'' is set at approxiin11tcly IS dcg:re«. abo,·c
the borizou, :tnd satellites ru'C' iguottd i(dtcy arc below this limit.
Chapter 5: GNSS 201
... ..
.-. · -~~~~~~~
. ~·
\.\1\.
',. o~. ',.
• ~(Q·
~
Fi,i;urt S-10: Rdativc OPS sa1dlitc: pos.ilioo aff«u. positional a«urn.cy. Ra.:ugc uni:ertaintic, a.TC a,sociakd
wnb each nmse mc:Mure1ncm. Til-Cse ooinbinc: 10 fonn au area. of unccrtllimyat the: intcl'SC'etiou ofthe muic:
mc-a.flurcn,a 11.$,.
ellites in close proximity overlap over broad ofDOPs, including the Horizontal (HOOP),
areas. resulting in large areas of positional Vertical (VDOP). and Positional (POOP)
uncertainty. Widespread satellite oonsrella- Dilution of Predsion. The POOP is most
1ions provide more acc.ural'e GNSS position used and is the ratio of the volume of a te.tra·
measurements. hedron created by the four most widespread.
Satellite geomeuy is summarized in a observed satellites 10 the volume defined by
number calle<l rbe Dilution ofPrecision. or the ideal tetrahedron. This ideal tetrahedron
DOP (Figtire 5- 11 ). There are ,.,,rious kinds is formed by one satellite overhead and three
........ ..
···- ..-...............
. ... . . -·-··:;pi
,,,/
-·~···.... "' _
...... ,,...,.-"
. . . . .. >
~
~ ;z:::.:.":.:
· - · · -.......
- ~.! .,,_............ ~
_,·-. \ .~:::;aof
unce.rtolnty
fi&m't 5· 1.l: OPS $8.leUi1e distrib\uion aff«1s posi1ioua.l accuta(';)'. Closely spaocd satellites rcwh io
l:kJltr positiostt1l ffl'Ots 1h.'1" widely spaced 131ellites. Sn1elli1e a.come-try is summarized by POOP. with
lowu PDOPs indic:u-ing better $:i1.clhtc geometrics.
202 GIS Fundamentals
satellites spoced at l 20.degree intervals design have lowered this to a few 10 tens of
arotuid lhe horizon. This constellation is minutes. rather than hours as in ·,he recent
assigned a POOP of one. and closer group- past Fast. inexpensive, dual-frequency
ings of satellites have higher POOPs. Lower chipsets are becoming available lhat may
PDOPs are bener. Moot GNSS receivers provide 20 cm (eight inch). real-time accu-
review the almanac transmined by the GNSS racy. Another teclmique. differential correc·
satellites and aue111pt to obtain measure- lion, is the most reliable means of obtaining
ments lhat include the constellation wilh lhe 30 cm (one foot) accuracy. When combined,
lowest POOP. If this best constellation is 001 multi-frequency and differential correction
available, for example, some satellites are may yield accuracies of a few centimeters
1101 visible, successively poorer cons1e1Ja.. with occupatfon times measured! in minutes
tions are tested until the best available con· Differential correction is described in the
steUation is found. The receivers typically following section.
provide a measurement of POOP while data
are collected. and a ma.timum POOP ctiresh·
old may be specified, above which data are
1101 collected.
Range errors and DOPs combine 10
affec1 GNSS position accuracies. There are
many sources of range error. and these com·
bine 10 form rut o,·erall raoge uncenainty for
the measurement from each visible GNSS
sa1elli1e. If more precise coordiua1e loca1ious
are required. then the choices are to use
equipment that makes more precise rru1ge
me-asurements. and/or to collect data when
DOPs are low.
GNSS accumeies depend on lhe type of
receiver. aanosplteric and ionoopheric condi·
lions. the munber of range measurements.
the satellite constellation. and the algorilhms
used for position <1e1enuiua1ion (Figure S-
12). Current CIA code receivers typically
provide accuracies between 3 and 30 m for a
single (ix. Errors .larger lhan 100 m for a sin·
gle fix occur occasionally. Accuracies may
be improved subs.1antia1Jy. to bet,veen 2 and
15 m. when multiple fLxes are averaged. The
longer the data collection time, the greater
the accuracy. improvements come largely
from reducing the impact of rarer, large
em,rs. but a, erage accuracies are rarely
1
o) b)
12 P1oresslonal.grade 10 ·l P1oressionaH1rade
GNSS receiver, g GNSS receiver.
10 ., 1..... j.. post-pro«>ssed diff·
8 ..........•' - ... WAAS corroclion,
erenliat correction, ........ open site
8 open site 7
6
.... .: .:
··-························-
Is
.r
..
.
.
...•..........
....
..,...............
...
..
...
..
""'
!!
.r
5
:
··1··········;-·········r···
. . ... .. .
4 ·•·: ··>··········1··········0:·
4
3
.
• •I·
.. ....········"····
-········ ..
·)
2 .
.
. ...
2
I l h4n . I ·a················-
. ······~....
.. ··t··
0 0
o.oo 0.05 o.,o 0.15 0.20 0.2 0.4 0.6 0.8
Single Fix Error (m) Single Fix Error (m)
c) d)
8 Consumer· -
grade GNSS
receiver. 6 ................. i .-t' ............. ::'"j
6 .. ··-·':' •. WAAS correc- ;:
" ~
tion, open site
~ !.,;
a'& -{,
- g- J
~
~
~
~
....... ·······).········•·.
..... .,..........~- i .. 1. l
. . £ ii
··-·····
h: .. ···.
.
-
t8. .1" 8
.
O~::'.!.:~~~~~
O I 2 3 4 5
0
Fl&urt' 5-12: Obfcr.·cd OPS erTOr dis1ri\mliQ1ts for \·;lrious rec:eh·m ,uidcr OJ>cn 11..;• coud1ions (o
lhtough <:) au.cl meau crronindcr open sL.1• And dcuse deciduous fores, ~nopy (d). R«uhs show bighc:$1
a«utt1cies for a professional-grade GNSS rc~in::r (pnncJ o, TRL.\WLE 6H. ~t-proocu.cd carrier
ph.uc diff«nui:al correclion, < 20 ktn 10 ll bMC: stn.tion), a profcuion.,1·~ GNSS rt<ci\'cr with
WAAS rc..tl4imc correction (b), n11d • n iucxpcusi\'c consi11ncr gr.ic:k GNSS (c, Gann.in c•rcx). r,,fcan
0
errors Cor both open &l..) and under forest canopy arc $.bown in p;u>e) d for these three reccin:r'Vcou.figu.
1
rnriom, nnd for an iPbooc 5 and a similar tablet OPS. Ccuti:mctC1"·ic\'d MC'Ut'3cies arc a,11ila.blc with the
beat cquipmcu tutdcr optimal co11di1io111. but ku ccxpcnti\'t rccci\'cn ;-ttld ob1tn1t1cd 1ki.EI reduct
aceurncic,. In ,omc in!itrui.,ces, tbc indi, ;du.11 fix error is in-,,ortnol. for example.. wbco digitizing line,
or polygon boundaries. while awrngc enon may~ more unportanl wbcu collectin; poin1 features.
allowina: muhiplc fixes. Note that ffl'<>n de-crease as tc-cbnol0$)' improYct. bu1 1bc lu~ct-priccd rc«i,··
crs nuiy no t lx more cxpcn$i\'c o,·cr tbc life of lljn'Oj« I if IO\\'cr :.<:curacies rc<i.uirc 11gni:ficn.n1 mtenUIII
cdi1i.ng for ONSS collected d:&1.1 («1W1c5y Andy cob).
204 GIS Fundamentals
Differential Correcti on
The previous sections have focused on range measurements. and these range mea-
GNSS positjon measurements collected \\~th surements contain errors. Some of these
a single receiver.This operating mode is error.i oredue to uncenaincy in !he measured
known as autonomous GNSS positioning. travel times from the satelute to the receiver.
An alternative method. known as differe11tlal These combined trave.l time errors. also
positioning. employs two or more receivers. known as timing errors, are ofteu among the
Differential positioning measurements are largest sources of positional uncenainry.
used primarily to remove most of the range In differential correction. we use the
errors and ~ms gr,eally improve the accuracy known base s1a1ion position to es1.ima1e the
ofGNSS positions (Figure 5-12). However. timing errors and hence range errors. Each
differential posi1ioning is not always GNSS satellite broadcasts its position along
employe.d, because single receiver position. with the ranging signal. The "true" distance
ing is accurate enough for many applica- from a given saiellite to the base station can
tions. and differemial positioning requires be caJcuJated because the base station and
more time andlor greater expense.
satellite locations are known. Note tlte quali·
Differential GNSS positioning entails fying quote marks around the tn1e dis1ance.
establishing al least one independent base We cannot exactly define where the satellite
station receiver at a known coordinate JOC3· is. and the base s1a1ion coordinales have
tion (Figure 5-13). The rnie coordinate loca- some (usually small) level of un.cenainty
tion of the base station is typically associated with them. If we are very careful
detennined using high-accuracy surveying about surveying the location of our base sta·
methods. for exnoople. repeated astronomical tion. then the errors in the base-to-satellite
observations. highest-accuracy GNSS. or measuremem a.re almost always smaller than
precise ground su.rveys. as described in 1he range errors contained in our uncorrected
Chapter 3. tim.ing measurement.
We use the base station to estimate range The difference between the true dis-
measurement errors for each position fix. tance and GNSS-uieasured distance is used
Remember that GNSS is based ou a se1 of to estimate the timfog error for a given sate I·
lite at any given lime. The. timing errors
change each second, so they should be mea-
Differential Posrtiornng, sured frequently.
simultaneous GNSS
measurements at Timing correclions may be applied to
field -roving (unknown) 1he range measurements collected by a rov-
and base (known) ing receiver (Figure 5·14). These roving
sites receivers are used to measure GNSS posi·
tions at field locations with 1ulknown coordi-
nates. The timing error. and hence range
error. for each satellite observed! at a field
location is assumed to be the same as the
range error observed simultaneously at 1he
base sta1fon. We adjust the timing of each
satellite ,neasuremenl made by the rover.
then re-calculate the rover's position. This
adjustmenl usually reduces each range error
roV1ng receiver base site and substantially improves each position fix
taken with the roving field receivers.
Fla:ntt 5· 13: Di.ffcrcruial GNSS posilioninJ..
Chapter 5: GNSS 205
Base Roving
station receiver
true position corrected 0 ronge
position ~ ,,/ correction
GNSS measured / /v ronge error
\ \ GNSS measured
posrt,on e O position
Flgurt- s..1.-: Difrcttnti.al co1nctiou i, based ou mcasuriu3 GNSS tllni.ng: and nmic «rorJ 111 a b3~
s1n1-iou, 111:l(l «pplyins 1l1csc cm>ri u cCITT'C'Clious 10 $.imuhn.nc0111ly measured ro\'cr-po<titiom1.
The timing errors change across the sur· men ts are collected more than a few tens of
face ofdte Eardi. and tltis plac~ a restriction seconds apart. they do not c-0rrespond to the
on lhe use. of differential GNSS correction. same set of errors. and thus the difference at
Our roving receivers must be "near'" our lhe base station cannot be used 10 correct tlte
base staiion for differemial comction to rover data. Many systems allow data collec·
work. A subsrantial ponion of the range lion to be synchronized to a standard timing
error is due to atmospheric and ionospheric signal, thereby ensuring a good match \Vhen
interference with tile GNSS signal. Fom•· the error vectors are applie<II 10 correct the
nately, lbese conditions often va,y slowly roving receiver data.
with distance through 1he atmosphere, so Base stadon data and roving receiver
errors in one .location are likely to be similar data must be combined for correction. Data
to errors in a nearby location. There.fore. as are often stored and then do·wnloaded from
Jong as the ro,'e.r is relatively near the base both receivers. and combined on a computer.
station. wi1bi11 a few tens to hundreds of Sofnvare provided by most -GNSS system
kilometers. we may expect differential cor· vendors is then used 10 compute and apply
rection 1·0 improve our jX)Sition measure·
the differeniial correcrions 1-0 the position
menrs. fixes. This is known as post-processed dif·
DifferenUal correction requfres the ferential correction, as corrections are
paired receivers collect data from a similar applied after. or pos1. data c-01lec1io11 (Figure
set of satellites. We cannot fix a timing error 5·15. top).
we do not measure. so the two receivers Post-processed diJferell!ial positioning
must measure the same satellites at the same is appropriate for many proj;ects. Road loca·
time. Any four satellites providing accept· !i91ls m~y l>e digi!ize<! wi!JJ ~ GNSS receiver
able POOPs will suffice, although more sat· mounted to the 1op of a vehicle. Tite vehicle
ellites are better. is driven over the roads to be digitized. tlte
Successful differential correction al$0 rover data differential corrected, and then
requires near simultaneity in the base and exported as a data layer suit.able for GJS.
rover measureruems. Errors change rapidly
through time. lft be base and rover rueasure-
206 GIS Fundamentals
Post-processed
differential
GNSS COf•
rection t'Olltr and bo~ stohon
(Iota ore, tronsported ro o Flaurt< S..15.: P,o$t•pro·
computer, and the chfterennol «ucd IUld real-time dif.
corrtctlOl'ls from me bOse sto, (crential GNSS correction.
ho~ Gf't oppl~ ,o rht «:,.·tr CSOto
- - --'""'"'""
D - - -- -- -
bose sto1,on
Real-time
differential
GNSS cor-
rection
bose station
Chapter 5: GNSS 207
through vegetation and otherobstacles. or systenL For example, the U.S. Coast Guard
cell phone ne1works are also often used. The bas established a set of GPS radio beacons
base station collects a GNSS sig,ml and cal- in Nonh America that broadcast a standard-
culates range djstances. The error is c.alcu. ized correction signal (Figure s.16). A com.
lated for eaeb. rouge distance. The magnitude patible GPS receiver near these beacons can
or
and direction each error is passed 10 the use 1be signal ror differemial correc1ion.
radio transnli'tter. along with infom1ation on These GPS behcon ,.ecefrers typically have
the timing an~ satellite constellation used. an addi1iooal antenna and erectrooics for
This continuous s1ream of corrections is processing the beacon signal. They form pan
broadcast via the base station radio and of a National Oifferemial GPS system
an1eru1a. (NDGPS) under development through a col-
Roving GNSS receivers are out-fined labora1ion of tile U.S. Oepanments of Trans-
with a radio. <.ell phone. or other communi· portation. Homeland Securi•y. and others.
cation system. and any receiver within lhc NDGPS will support navigation and posi-
broadcast ran.ge of the base station may tioning in areas distant from me Coast Guard
receive the correction signal The roving network.
receil'er is simultaneously recording GNSS
data and calculating position fixes. Each Real-Time Kinematic and Virtual
position fix by the roving receiver is Reference Stations
matched to tile corresponding correction
from the base station. TI1e appropriate cor- The highest accuracy differential correc.
rection is then applied to eac.h fix and accu. 1io11 is provided with dual.frequency. carrier
rare field loeations are computed in real phase positioning. often used as real·time
time. ki11e111ade (RTK) GNSS. The amounl of ion-
ospheric delay is different for differen1 fre-
R~l-tifilt differential eomttion quencies. so by comparing signals. such as
requires a broadcasting base station: how- theGPS carriers LI and L2. the ionospheric
ever. eve[)' user is not requfred to establish a delays may be es1ima1ed and removed.
base staliou and complete commuoica1ioo While single-frequency positions collec1ed
• ..
Fiaun- S.16: 11te foc11tion of rndio bc;1con st~tioos (dot,) in 1hc cent~ U.S.. in April. lOJ S. Oist.;incCf
frun the u.c-lU'C&I bc-.a~u are ,bown in various shades ofgrny.
Chapter 5: GNSS 207
through vegetation and otherobstacles. or systenL For example, the U.S. Coast Guard
cell phone ne1works are also often used. The bas established a set of GPS radio beacons
base station collects a GNSS sig,ml and cal- in Nonh America that broadcast a standard-
culates range djstances. The error is c.alcu. ized correction signal (Figure s.16). A com.
lated for eaeb. rouge distance. The magnitude patible GPS receiver near these beacons can
or
and direction each error is passed 10 the use 1be signal ror differemial correc1ion.
radio transnli'tter. along with infom1ation on These GPS behcon ,.ecefrers typically have
the timing an~ satellite constellation used. an addi1iooal antenna and erectrooics for
This continuous s1ream of corrections is processing the beacon signal. They form pan
broadcast via the base station radio and of a National Oifferemial GPS system
an1eru1a. (NDGPS) under development through a col-
Roving GNSS receivers are out-fined labora1ion of tile U.S. Oepanments of Trans-
with a radio. <.ell phone. or other communi· portation. Homeland Securi•y. and others.
cation system. and any receiver within lhc NDGPS will support navigation and posi-
broadcast ran.ge of the base station may tioning in areas distant from me Coast Guard
receive the correction signal The roving network.
receil'er is simultaneously recording GNSS
data and calculating position fixes. Each Real-Time Kinematic and Virtual
position fix by the roving receiver is Reference Stations
matched to tile corresponding correction
from the base station. TI1e appropriate cor- The highest accuracy differential correc.
rection is then applied to eac.h fix and accu. 1io11 is provided with dual.frequency. carrier
rare field loeations are computed in real phase positioning. often used as real·time
time. ki11e111ade (RTK) GNSS. The amounl of ion-
ospheric delay is different for differen1 fre-
R~l-tifilt differential eomttion quencies. so by comparing signals. such as
requires a broadcasting base station: how- theGPS carriers LI and L2. the ionospheric
ever. eve[)' user is not requfred to establish a delays may be es1ima1ed and removed.
base staliou and complete commuoica1ioo While single-frequency positions collec1ed
• ..
Fiaun- S.16: 11te foc11tion of rndio bc;1con st~tioos (dot,) in 1hc cent~ U.S.. in April. lOJ S. Oist.;incCf
frun the u.c-lU'C&I bc-.a~u are ,bown in various shades ofgrny.
208 GIS Fundamentals
for periods of less than an hour are typically nath1es because almost all mobile phones
in error by 1ens of cen1ime1ers (a half-fool) may col1Jlect 10 tlte Imeme1 ,~a an NTRJP
or mor<, dual-frequency GNSS are often client applkation and 10 a GNSS receiver via
accurale to a few centimeters (an inch) or bluetooth. TI1is remove. the need for a radio,
bener. geni.ng a radio license as required for some
RTK is such a powerful technology that jurisdictions/radio frequencies. the comma-
many state and national govenunents are nication range is generally longer, and the:e
establishing a dense constellmion of dual- is les1 chance of radio interference. One
freq ueacy receivers in a Vrnual Rtference does need a mobile phone. coverage. and
S1a1io11 network (YRS). Stations are spaced often registration or credentials from the
in a ne1.work over some n..~ion such that a provider. but these are available in much of
roving receiver is never more than an accept· 1be world. Many GNSS receivers suppon
able distance from a base (Figure 5-17). The differential correction 1hrough an NTRJP
systems provide single- or dual-frequency protocol.
base station data broadcast in a standard way There are disadvantages 10 RTK GNSS.
over a given radio or cellphone signal, along The receivers are more expensive. althou~b
with OOse srn1ioo lnfonnation. A roving prices are dropping. TJ,e roving RTK
receivers may identify the closest or best receiver mus, be closer to the base station for
local receiver. and compare base signals to highest accuracies. typically within tens o:"
roving signals 10 obtain positions to wi1lrin a kilometers. This requires either a denser net-
few cen1ime1ers while in the field. work of base stations. or 1ba1 the RTK users
Networks oflen distribute correction sig· set up their own base slat ion. Finally. as with
nals using a Netwo,·k Tro11spor1 ofRIQ\1 ,ria all carrier phase positioning. saielli1e signals
/111e11,e, Pro1ocol (NTRJP). This requires m11s1 be continuously tracked for longer
pe.i iuds, ahltuugb mud~1J1 1ixeivt:'J!i have:.
less field equipment than radio-based alter-
reduced this time 10 a few 10 tens ofminutes.
than 3 111. an improvement over uncorrected systems. Some of these adopt RTK-like
C/Acode (Figure 5-12). WAAS is often techniques. Examples include Trimble Cen-
unavailable at extreme northern latinides. terPoint RTX. NavCom StarFire. and Veri-
where equatorial geostationary satellites are pos TerraStar. Typically. these systems are
often not visible. sold as a s.ubscription service, in which a
moothly or annual fee is paid to access tfle
real.time improved satellite positioning and
Precise Point Position Ing other data. These data are ofien broadcas1
Precise point positioning (PPP) is via a cellular modem or satellite radio.
another alternative to differen1ial correction.
This technique uses precise satellite, clock,
A Caution on Datums
and orbit measurements to solve for point
locations. These improved satellite ephe- Errors may easily be injected into GNSS
merides (orbit data) are usually only avail- data due to improper datum transfom1ations.
able a few hours 10 days after any GNSS One must ll>e cautious in using GNSS data.
observatioo. PPP bas the advantage of centi- either directly. or after applying a differential
meter.level, world\\•ide positioning without correction. bec.ause. the datum transforma 4
need of a base Station. Unfortunately. PPP tion used is often not transparent. and is
requires complex calculations. on loog. unin- often poorly documented. The U.S.
tem.1pted observations for hig.l1est accura. NAVSTAR GPS system provides a good.
cies. e.g .. several to tens o( hours of example of the confusion that may occur.
cominuous GNSS obse,vatioos. PPP also GPS satellite locations are reported in
have a few hours to few days delay in pro- the most cutrrent ITRF/WGS84 datum,
cessing because the acn,ally satellite posi- although they are often convened to other
tions must be re-eStimated post-facto. There dan,ms in software before being displayed to
are a number of free services that do PPP the user. P~actices may change in 1he U.S.
calculations, e.g.. the U.S NGS OPUS ser- with the adoption of the NATRF2022. bm
vice or the CNRS-PPP run by the Canadian now the OPS-measured ITRF/WGS84
government. as well as privately.run ser· datums are quite different from the datums
vices. e.g. a free option in Trimble's Center· used with most GIS data in the United
point RfX. States. for example. NAD83(20 1l). We mttSt
PPP is best used when long observation transfonn data collected in the WGS84
1imes are possible. few poims are required. dan,m 10 the NA0 83(2011 ).
there are relatively ,mobstrncted sky ,iews. As noted earlier, ignoring orselecting
and positions aren't neede.d within a day of the wrong <latum transformation wilJ intro-
collectioa They are most appropriate for duce error imo our newly-collected data.
measuring fixed assets. e.g. when locating a GNSS vendors typically provide an option to
f1Xed location for an RTK base station. or for repon data in a commonly used coordinate
monumenred points from whi<:h laser or systems and specific danim: for example. the
other measurements will be collected. PPP user may set the GNSS receiver to display
isn't appropriate for measuring most geo· UTM or Slate Plane coordinates in the
graphic fearures, or for most areas where NAD83(2011) datum. However. the GNSS
trees. temin. or buildings obs,truct much of softwares sometimes do not clearly idenrify
the sky. the datum transformation used. As noted in
Integrated systems of satellite obsel\'3· Chapter 3. early versions of the NADS3
tioos and corumunkations are available to dan1m that underlie the UTl'.•t system were
provide PPP.like accuracies in ne.ar real and remaim up to 2 m (6 feer) different from
time. wi1hout RTK connec,iou to a second the WGS84 damm. so you cannot assmne
grotmd-based station. and with shorter 1bey are the same. as is common practice.
observation periods the OPUS or similar You must carefully c.hoose the correct datum
210 GIS Fundamentals
GNSS Applications
Tracking. navigation. field digitizing. Field Digitiziting
and surveymg are the main appl.ications of
GNSS. Navigation is finding a way or route. Field digitizing is a primary applica1ion
and !tacking involves noting 1he l0<ation or of GNSS in GIS. Data may 'be recorded
objec.ts thr°':3gh time ..A common example is direc1ly in 1be field 10 update poim. line. or
t:rackmg delJvery vehicles in near re.al time. area loc.ations. Features are. visited or 1ra-
Large delivery companies would like 10 versed in the field, and an appropriate m1m-
J.a1ow where cheir vans are a1 all times. Vehi- ber or GNSS fixes collec1edl. GNSS
cles equipped with a GNSS receiver and a receivers have been carried in automobiles..
radio or phone link may repon back 10 a dis- on boa1s, bicycles, and helmets. or by hand
patch office every few seconds. Icons on a 10 cap111re the coordinate locmious or poims
digital map are used to represent vehicles, and boundaries (Figure 5-19).
and a quick glance can reveal which vehicle GNSS da1a are oflen more accurate than
is nearest a delivery or retrieval site. or da1a collected from the lligbes1.quality digi-
which driver overly frequents a donut shop. 1al images. For e.xample, RTK GNSS data
Navigation is a second common GNSS typically have accuracies bener than 5 cm.
application. GNSS receivers have been and often below 2 cm. while accuracies are
dev_eloped specifically for naviga1ion, witl1 often near 15 to 50 cm for the highest-reso-
d1g11al maps or compasses set into on-screen lution sa1elli1e images. and for national
displays (Figure 5-1 8). These GNSS receiv- aerial im~ge pr~ms. Oro11es often provide
ers and digital maps are extremely special- images with cenluneter-leve I resolutions. but
ized GIS sysrems. These sys1ems are useful accuracies are generally in 1he 30 cm range
when collecti.ng or verifying spatial data, or poorer. Precise differemial correction of
such as to navigate to the approximate vicin- carrier plmse GNSS data often yield centi-
ity or field measuremem plots. mel'er-leve.l accuracies. far bettertl1an can be
obtained from digitizing a~10os1 all images.
GNSS is oflen used 10 directly digi1ize
new control poims. Remember 1hat co111rol
points are used to correc1 and 1ransfom1
fiJl.ll'~ S-1.8: A OPS r«dyer developed for marine (left) ;mdaerial na,i gation (right. cour-
lCS)' Gamun COf'P. and Tnwbtc Lid.),.
212 GIS Fundamentals
image da1a or maps 10 real-world coordi- 1ion a viable al1ernative for mos, organiza-
nates. Aerial images may be available, and 1ions chat collec1 spacial dala.
the coordinales may be unknown for fea1ures Feamre digicizing wi1h GNSS often
visible on cbe aerial image. Coucrol poims involves 1be capmre ofbodt coordina1e and
may be dif!icul! or impossible 10 obcain auribu1e dam. Typically, 1he GNSS recei,•er
directly from surveys or from the informa· is activated and detects signals froll! a set of
cion ploned on existing lllllps. partic,~arly sa1ellites. A file is opened and J)Qsilion fixes
when graticule or gridlines are absent are logged at a sel rate, such as every two
GNSS offers a dic-ect method for measuring seconds_ A11ribu1e daca may also be entered.
cbe coordi.na1es for po1en1ial concrol poi.ncs either while 1he position fixes are being col-
represented on 1he image or map. Road lected. or before or afler positional data col-
intersections or other points may be identi· lec1ion. In some software. 1be posi1ioo fixes
fied and then visi1ed with a GNSS receiver. may be iagged or identified. For example. a
GNSS-measured control points are 1he specific comer may be tagged \vhile digitiz-
basis for almos1 all curreol projec1s tlia1 per- ing a line. Multiple feacures n,ay be col-
form analycical correc1ion of aerial imagery lected in one file and 1be idemities
(see Chapter 6). Most image data are not ini- maintained via attached attributes. Oa1a are
tially in a map coordina1e system. yel images processed as needed 10 improve accuracy.
are often panicularly useful for developing and convened 10 a formal compatible wiui
or updating spacial data. Aerial photographs 1he GIS sys1en1 in use. GNSS da-ia collection
contain detailed information. However. and data reduction tools often provide the
aerial photographs are subjec, 10 geomecric abilicy to edic. spLi1. or aggregace collected
dis1ortion. These .errors may be ru.ialycically data, for example. converting multiple fixes
correcled dirough sui1able methods (see into a single point average. These functions
Cbap1er 6). buc 1hese me1bods require sev- may be applied for all posi1ion fcxes in a file.
eral control points per image. or at leaSt per or for a subset of position fi~es embedded in
project, when multiple, overlapping aerial a GNSS file.
pho1ograpbs are used. GNSS significao1ly Large. field-pormble displays and
reduces 1he cos1 of control poin1 collection. advanced editing software may be combined
thereby making single- or mulliphoto correc-. with real-tjme differential correction 10
improve field digi1izing. Table1 compmers
Chapter 5: GNSS 213
Fiaurr- 5-21: Fututt$ ma)· Ix c11tettd 11nd edited i11 tbc field u,.iog II GPS receiver 1111d :ipproprintc , oft\\'tlrc
(eoort«y Trimble. LtcL).
214 GIS Fundamentals
Fi1u1•t S-12: All c~amplc of scn:ral 6cld-di#i.ii:tcd liocs, usiug: a profC;Ssio11,aJ-,grndc: GNSS r«<ivc:r in a
ttccp nwinc with dclliC fotcsl co,•ct. Sinilc fixes ,-ary .su~sl:tt11i:llly nbotu rbc: '!'K loclltions. Tbc s~13fe,foc
cJTOrs l)J"C lMgcr th~i :l\'<fflgc s Mid other S\1tlll113T)' t1atu hcs rq>0r1cd for I) rcc:cwcr. ca~cd by t<mlm ~od
, ·cgctatioo obsmKtioo. reducing POOP, o.nd imreasiug mu1ti·~tb i.igoffl..
Chapter 5: GNSS 215
gle-fix collection times rarely take longer rearrangeme11t of the satellite coustellation,
than a few seconds to 01inu1es when using such as from point c 10 point b u.1 Figure 5·
modem. professional-grade receivers and 23. However. on average. an obstnicted sky
when forest canopy is the primary sl-y results in a reduced constellation of GNSS
obstruc.tioa CoUection times will increase satellites and higher PDOPs when compared
correspondingly when multiple fixes are 10 fiat terrain. This problem is p.1nicularly
required per feature. vexing in urban settings because the horizon
Terrain may block sa1elli1es. a signifi- angles change subs1amially over sbon dis-
cant problem when the blocked satellites are tances. This makes it difficult to predict
greater than t5° above the local horizontal when GNSS satellite coverage will be. ade.-
plane. Sa1elli1es less than I 5°above the local quate. and thus plan data collection effons.
horizontal plane are of Limited use. even in Forest. terrain. and building effects may
open conditions, because they exhibit large occur together, further reducing accuracies
range errors due 10 aunospberic imerference. and decreasing efficiencies. This is a com-
GNSS receivers designed forGIS data col- mon occurrence in foresled, mo1u11ainous
lection typicalJy provide senings that auto. terrain. and in urban areas with both 1all
matically reject sa1elLi1es below a specified buildings and mature trees.
horizon angle. We can use equipment 10 improve
Terrain obsuiactions often rise above GNSS accuracy and efficiency. Dual fre.
15°. such as wheo moumains. hillslopes. quency receivers, describe earlier. are more
buildings. or canyon walls reduce the mun- accurate from single-frequency receivers.
ber of visible GNSS satellites (Figure 5-23). and so improve positioning. Adding RTK or
Terrain obstruction often reduces c0Uec:1ion another form of differential corr,ection
efficiencies and accuracies. Because the greatly improves accuracy. So does the use
GNSS sigMls do :not pass llilough soil. rock. of au anteillla with a ground plane to elimi-
wood. or concrete. any obstructed satellite nate multi-path signals. as described earlier.
crurnot be used for GNSS positioning. ln None of these technical fixes wi II improve
some instances. 3 short wait may resull in a GNSS coUectioo efficiency, that is. the num·
Figm'f' S--2J: OPS satcUitc signals may be blocked by terrain or built &trucnttc&. Th.is redoccli the «m·
s.tcU.atioo of a\'111ilflblc sa1dl.itcs. Ulci-c-as.in; error 111od rcduci!'.13 coll~ti:ou efficiency, Herc, satellites O
,md b nrc \'itiblc with con'tlponding loc:1.I horil(n, nnglcs of u nnd ~. The tigiul fn>in s111cllitc C is
blocked by local tcm.in.
Chapter 5: GNSS 217
,.t
,Any POOP. :;;
l: POOP< 8 ~ POOP< I.
i! 80 •. "' sote11r,es
80
".
i 80
!! 60 ;- ~
~ ~
l •o f •o
t ~
j ,o i 20 ) 20
l O
15 25 35 45
l O
J.5 25 35 45
§. 0 ' - - - -.......- -
15 Z5 35 •5
mcon honz°" devoflO" (deg) mean horirot1 elr:YOtioti (deg) IUOI\ h0t,2on dc-Jouon (deg)
Flem~ S-25: The pc:rccntagc of OPS positiou futcs twit~ su«csd\UJy eo11«:tcd d«rcascs in ,·alleys, or in
any other loca1iou ,,.,be:t'C lhc-:au;lc 10 1.bc horizon increases. Th.is n:i.,y be ofuc:1 somcwb11 by nlmow1n3
poorn (brgcr) POOPs, IIS shown in dtc lcAnlOSl \'S. rigl111i10Sl figures 11.bo\'c (n.,l.nptcd from Scrin.ti 11nd Flo-
m. 199$),
nal mengtb 1hresbolds. but 1his often comes collec1ion systems 10 coOlrol these multiple
a11he expense of decreased acc11rncy. trade-offs.
Sophisticated receivers allow multiple set· The availability of multi-path rejection
tings: a 1arge1 POOP. above which the antennas is a primary difference between
receiver wiUsearch for bener satellite con- GIS-grade receivers aod recrea1ionaJreceiv-
stellations. and a maximum POOP, above ers costing much less. Recreational receivers
which data collection will cease. This aUows are substantially less accura1e in, obstn,cted
the user to b.1lance the trade-off between terrain because they often place priority on
accuracy and efficiency. collecting a fi,. ac<epting some degrndacion
Some GNSS receivers allow adjust- in accuracy. If you are los1 in the woods you
ments in 1he threshold for acceptable signal would rather know where you are withi.n a
strength. For example, satellite signals that few me1ers than wait for a higher.accuracy
pass 1brougb a fores, canopy are weaker. signal or constellation. Recrea1ioual receiver
Including these weaker signals improves the tllresbolds for signal streng,h or POOP are
number and ofleJL the distribution of satel· often configured for highest eftkiency and
lites. !hereby increasing collec1ion efficiency thus lowest accuracy under obstructed con-
and perhaps accurncy. However. weak sig- ditions. and 1bese thresholds often may not
nals may also resuh front reflected or mul. be adjusted by the user. Irrespective of the
1ipatb traosmissioas. As described earlier, equipment, there is a 1rade.offbe~veen 1he
multipath signals have larger rnnge errors. accep1able signal strength and 1he introduc-
Lowering the threshold for acceptable signal tion of multipath errors. Setting the maxi-
strength is Likely co increase positiooal error. m,uu acceptable POOP higher or accep1able
as it increases the likeliho-Od of multipath signal strength thresholds lower will
measurements. However, some data are increase efficiency of collection. bul often at
often bener than ,ione, and lowering the the cost of increased error.
POOP tlireshold for collection is sometimes
the only way to collect data.
GNSS receivers specifically designed
for GIS data collection may use multiple
techniques to improve accuracy and effi.
ciency. Manufoc1\lrers have invested sub--
stantially in optimizing antenna design and
Chapter 5: GNSS 219
H = h-N
surface
..-
...-- orthome,r,c
•.
height
. h t ~ -- h IH
ellipsoidal t,e19 -- -----------9._eo;d
,,,
,,,,------- I N ;:;;;;ii,;;~~~-----,,,
ellipsoid ',,.
,,,
------ ,,
fiJttN' 5-?7: Ca1c.,.il.atioo of or1hometric height from cllip1,oidal bclgh1, Mo,1 ONSS $)'1tc:m, reporl ellip-
soidal heights. wbich must be coun:rtcd to onhomctric heights before most uses, \\~ may estimate gcoidal
bc-igbt in k\'cral 'Wll)'t. inchidin3 de\'clopcd *-co id.al models. ot ncntb)• 3C'Odctic «mtrol points.
Chapler 5: GNSS 221
If the reoei,·er does not support geoidal not constrained as much in the venical direc-
height estilllation. and if the user bas no tion. because sa1elti1es are not visible on the
access to computing or software required for opposire side of the Earth. Range uncenainry
a geoid model, one may estimate geoidal is not constrained for the ne.ar-vertical satel-
height by referencing a ne111ty comrol point. lites. and so venical measur.emeut errors are
(or example, ~n NGS control data sheer in larger. !J'nsing GPS (or precise height mea-
the United St.ates. These sheets usually pro. surements. occupation limes must be longer
vide geoidal lteighrs for listed points. For to oblain a heigh, accuracy similar 10 hori·
many regions of1he world, geoidal heigh1s zonral position.ii acctJracy.
do not vary rapidly across space, and the
spatial variability of geoidal heighrs may be
es1ima1ed by retrieving heights for several GNSS Tracking
nearby heigh1control points. The nearest. an GNSS ,racking of people. vehicles.
average. or some similar combination of packages, or animals is an innovative and
geoidal beigh1s may be used in equation growing application of GNSS. GNSS receiv-
(5.2) 10 calculate ortbometric height. given a ers are routinely placed on trucks. ships.
measurement of ordiometric height from the buses, boats, or other transport vehicles.
GNSS. Variauion in geoid beigltt among 1he These receivers are often part of systems 1hat
neart>y srations should give an estima1e of include inforns11ion on local conditions_
1he variability in local geoid heiglus. and the speed or trnvel, and perhaps the condition of
added local uncertaimy in estima1ing ortho- tl,e shipped equipment or cargo.
metric height using equation 5.2.
GNSS is also increasil\gly applied to
In additiou 10 geoidal heigh1uncertainty. track individual organisms. This is revolu-
vertical GNSS height measuremems are not tionizing animal movement .analysis because
as accw-ale as horizontal measurements, and of the frequency and density of poims 1bm
so require greater care. longer acquisition may becollec1ed (Figure 5,28). More posi·
times. and more stringem processing for a tion f1Xes can be collected in a momh using
given level of accuracy. GNSS signals are GNSS equipment da, n may be collected in a
Fijt'H't' S.ZS: A wildd.>ccst 61 with a ONSS crai.-ki.ug collar. 11ic antcuna is \'i&iblc as di<; white patch on
top of 1bc collar, and the power supply and data louiua: Jiousi.ua: i.Hisiblc al tl1t bon01:n of die collar
(cc>tu1c:i)' Gordon T. Cul. Lotck \Vi.rc:lc:H hl<.).
222 GIS Fundamentals
decade using alternative methods {figure 5, utes 10 every few days. and data may be peri-
29). odically downloaded via a mdio link.
Animal movement analysis has long Systems may be ser up with an automatic or
been based on obser\'llliou of recognizable radio-ac1iva1ed drop mechanism so thal data
individuals. Each time a known animal is may be downloaded and 1he receiver re,
seen.. the location is noted. TI1e number of
used. While only recenlly developed. GNSS-
posi1iou fixes is oflen low. bo,vever, because based animal tracking units are currently in
some animals are difficul1 to spot. elusive. or use oi, all cou1iueu1s in lhe s1udy of 1brea1,
live in areas of dense vegetation or varied e,1ed. endangered. or imponani species.
terrain. Early alternatives to direct human GNSS !racking for individual or flee1s
obsen<a1io11s were based prim,,rily on radio- of vehicles 1ypically involves a munber of
1eleme11y . Radiotelemetry involves the use subsyslems (Figure 5-30). GNSS receivers
of a transmitting and receiving radio unit 10 and radio transmitters must be placed on
de1en11.ine animal location. A transmitting each vehicle 10 record and 1ranSJ.ni1 posi1ion.
radio is attached to an animal. and a techni- Sa1elli1e or grotmd-based receiver neiworks
cian in the field wes a radio receiver lo collect and transmit positional and other data
de1ennine lbe posilion oflhe animal. Mea· 10 a compu1er runnu,g a tracking and man·
S\lfemeocs from several directions are com- agemen1 program 1ha1 may be used 10 dis-
bined, aud the approximate location of the play. analyze, and control vehicle
animal may be ploned. movemen1. lnformatiou or ins1n1c1ions may
GNSS animal tracking is a substantial be passed back 10 vehicles on 1he road.
improve111en1 over pre,1ious methods. GNSS GNSS..aided vehicle numag~men1 may
unilS are fn 10 animals. usually by a harness be combined wi1b 01her spa1ial da1a in a GIS
or collars (Figure 5-2$). The animals are framework to add inunense value 10 spatial
released, Mid Jl(>SitioMI iilfolilliltioa iliialyses. Vehicle location can be moni1orl!d
recorded by 1be GNSS receiver. Logging in real lime. and compared 10 delivery loca-
intenials are variable, from every few min- tions. Delivery planning may be optimized
..
• •lit •. "·"'·
..,
r. . .. j
·-·~
'
•
.'
.:.. :· • •··..,
e
~
. •,
.. •
·.,,,rr~'-1 ....... ,
..,_. ..
.. ... . .. • •••
• •• .. :&
~ .. -· ..~
t'
•• • I
,..... . .. .
. . ..--:......,,..,
• ,-...,, •
, .......
..
: . . ... ..
..... ...
.. "'
..• ..!f
•
,. . ~
-
~ ~'
.. ··)•... . .-. , ... ... .
.., .... :
.
. ~.
~ . . ' . . .!•·
~
~.:f.,. ~-
.. ~
. . ,. ··..... ~ . ~~":.fr'
. -~ -. ;¥~-i.ii¥
~~.rr
• •
FJa:ure S..29: GNSS tn1clciua: data for wildebeest io dtc Sun 13cti and Na:oronpo Cralct tt3iom of Tau-
:t2ni:1. Sn1'lict of 1nwd routes and lubittu Uk by migrtuorv rutim."11& lll'C suhlr:tntiillly intpro,'ed by GPS
da1a collcctio11 (coW1~)' $, Thirgood. A, Mosser-. and M. boruCT).
Chapter 5: GNSS 223
This is a vaJid assumption when t11e errors Plane surveys typically start at or are
inherent wilh ignoring lhe Earth's curvature traceable back 10 a larger survey network
aJe smalJ compared co the accuracy require- through Bench Marks (precisely located
menls oflhe survey. or to the. errors inherent points described in Chapter 3), or through
in the survey measuremems 1bemselves. The local or project-specific control poims estab-
distance error due 10 assumu1g a Oat rather lished from high-accuracy GNSS positions.
than curved surf.ice over lO km (6 mi) is These marks or control poi1us often se.rve as
0.72 cm (0.28 in). Therefore, plane surveys sraning and endi11g points of a survey. 10
are l)']lically res1ric1ed 10 distances under a allow accuracy verification.
few tens of kilometers. This restriction is Distance and angle measurements are
me1 in many surveys, and a subs1antial the primary field activities in plane survey-
majority ofU,e lines and points surveyed 10 ing. Distances are measured between rwo
date have been measured using plane S\Ir· sun·~, stations, which are points occupied
veying methods. Plane surveying is suffi. on the ground. The directioo is specified by
cien1 for mos'l s11bdivisions. public works. an angle between a standard direction. usu-
construction projects. and propeny s11rveys. ally norll1or soolh. and the direction of the
Hisrorica Uy. plane surveys have been surveyed line between !he 1wo s1a1ious (fig-
conducted wiJh optkal i11stnune1.11s similar ure 5-33). The distance is iru some standard
to those described for geodetic surveys. units. forexample. standard intemalional
These insrrumenr.s typically have angle meters.
gages in the horizomal and vertical planes There are lwo conunon ways of specify.
and an optica l sight. usually with some ing angles. The first uses the azimulh. An
degree of telescopic magnification. The azinnuh :mgle is measured bl a clockwise
i11stn11nents have various names. including. direction. rypica Uy relative •o grid or geo-
in increasing order of sophistic~tion and gmpllie uollli (Figure 5-33). Azinlutlis ,,ary
capabilities. • level. a 1ransi1. a theodolite. from O10 360 degrees. Nore that azinmrhs
and a 101al station (Figure 5-32). typically reference grid non:h. ahhougJ, they
may also be specified relath.ie to magnetic
nonb. so care should be taken in clarifying
which refereJtce is used.
Azimuth Angles
s . 180°
Fl&m'f' S-33: Augl« in n.-n:ys may be reported
Fiaurf' S.J 2: A $Ul'\'cyi1l3 i1ucnuue1ll for e-ollee-1· :u :u.iniu1b 11.1~~.1. inca, w-ed clocl.•wise ttlati\'e co
ing. e-oordina1c geometry dn1a. North. and r.mgi11g &om Oto 360.
226 GIS Fundamentals
azimuth involves noting the reference direc- know beonng, then: know be(lf"W'l9. the,t
oz • lSO•r"IUmtl"iC oz 180~r"lumer,c
tion (Nor$). and adding or subtracting from pert ot br. e 9 . port of br, e.9 ..
constants. depending on quadrant (Figure 5-
35). We often convert from bearings 10 azi-
muths when calculating positions from a
sequential set of distance and angle measure- s
meUls tluu form a.survey.
A gure 5-35: Cou\'ns:ion fonmi.las: &om be:nius
Many surveys are n·averses, a series of 10 112:imu1.b, by qundrnn1.
connected lines lba1 have a marked begin-
ning and ending point. Trave™'s iypically
start at a known control poinl. or start at a used to calculate the s1ation coordinates.
poim 1ha1 bas been refereuce<l 10 a known Subsequenl distance and angle measure-
control poin1. As described in 1he preceding men1s may be taken. and in mru used 10 caJ.
sections. the control points are often part ofa cula1e the coordinates of subsequem
geodetic eomrol network. or pM't of a suli- stations. Atraverse may be ope11, with a dif-
neiworl< established by a municipal sur- ferent beginning and eoding point or closed.
veyor. A distance and angle are me~1Sttred wi1h the traverse eveo1Ually connec1iug back
10 the starting location. Most of the millions
from the control point 10 the first survey Sill·
1ion. Coomi11are geome11y (COGO) may be of miles of property lines in North America
have been cs1ablisbed via plane surveys of
open and closed traverses.
Bearing Angles
Coordina1e geomell)•cousis1s of a start·
ing point (a station) and• list of direc1ions
N (bearings) and distnnces to subsequent sta·
,-~b~rlng • 1ions. The COGO defines a connec1e<I set of
N71"E points from ,be s1arting station 10 each sub-
sequent station. A sample COGO descrip-
1ion rouows:
·The starting point is a 1·indl iron rod
lhat is approximately 102.4 ft norlh and 43.1
ft west of lhe northeast quarter of !he south-
east quarter section of section 16 of Town-
ship 24 North, Range 16 East, of lhe 2nd
Principal Meridian. Starting from the said
poin~ !hence 102.7 llon a bearing north
beomlg • SJr W S 72.3 degrees east. lo a 1-inch iPOO pipe;
thence 429.6 ft on a bearing soulh. 64.3
fia:urt 5..J4; Oircctious may be lpCCifiecl AS degrees east to a 2-inch iron pipe ... •
bearings, with a North or South ref«C11<:c:. aod
tin E:utor\Ves1 tu'*-le.
Chapter 5: GNSS 227
Basic trigonometric functions are used designed for impon into various OIS soft·
to calcula1e 1be coordinaies for each survey ware systems.
station. These stations are located at 1he ver- COCO calculations are illustrated on the
1ices that defi.ne lines orareas of interest In left of Figure 5-36. S1arting from a known
the J"'SI. these dis1ance and bearing da1a
were manually ploned onto paper maps.
coordinate. "o, Yo· we measure a distance L
and an angle 0. We may 1hen calculate the
Most survey data are now transferred disumees in the>< and y directions to another
direc1Jy 10 spatial da1a fonnats from the sur- sei of coordinates. x1and y1• The coordinates
veying instrumeoi or associated software. of x1 and y1 are obtained by addition of the
Field measuremems may be direc1ly appropria1e 1rigonoruetric ftructious. COGO
entered and coordinate loca1ions derived in calcula1ions may then be repeated, using lhe
the GIS sof~vare. or 1he coordina1e calcula- x1 and y1 coordinate.s as dte .new starting
1ions may be perfonned in the surveying location for calculating the posi1ion of the
instrumem first. Many currem S\u·veying next mwerse station.
instruments cootain an integrated compmer The right side of Figure 5-36 shows a
and provide for digital data collection and sequence of measurements for a traverse.
storage. Coordinates may be 1agged with S1arting •• x,. y,. 1he dis1ance Aand bearing
attribute dara in the field. at the measure- angle. here 45°. are measured to station x.n,
ment location. These data are then dO\vn- y..,. The bearing and dis1ance are 1hen mea-
loaded directly from a coordinate measuring sured to the next s1ation. with coordioatesxn.
device to a computer. Specialized surveying yn· Distances and angles are measured for all
programs may be used forerror checking subsequent stations. Starting with the known
and other processing. Many of these survey-
ing packages will then 011tpu1 data in fonuats
coordinates at the starting station. xs· Ys·
!- d~ -
1
L
•, '
•
I
(I
!
x,. ~ ·
>ti• x ..• dx
~·Y,, •dv D
dx • L· sin (0)
dy - l·COS (0)
tneretore
x 1• Xo· l · sin (0)
Y1 = Y0 • L · COS(O)
F'lgntt s..36: Coordinllte geometry (COOO) i11ows 1.be c:1kub1iori of eoord.i1l3fe loc:11i0115 from open
(shown 111,,o,·e) <>r t:lowd lnwcnc.,. Di,tnnec. ruid angle mc.nwrc1ncnts arc. combined with 1rigo11omelric fOJ'·
ruulas to calculate coc1'din11.1cs.
228 GIS Fundamentals
coordinates for aU Ofher stations are c.alcu- GNSS is now used for most measure..
la1ed using COGO ronuulas. mems farther than a few hundred meters.
Assigning the proper signs to dx and dy COGO is more commonly applied in GIS
is imponan1 in COGO calculations. An when collecting data with a GNSS receiver
incorrect sign for any leg of the iraverse will in combination with a laser range finder.
propagate lhrougb all subsequent coordi- Laser rnngefinders emit a focused. cobere111
nates, causing each 10 be in error. The proper beam of light to calculate distance. The max-
sign is obtained when directions arc imwn range depends on the size and reflec-
expre.ssed as azimuths and a standard sel of 1ive propenies of the 1aageL but many
fommJas is used. moderately priced lasers are accurate up 10
several hundred meters. Electronic com..
The trigonometric sine and cosine func. passes must be periodically calibrated and
tions return the proper magnitude and direc- adjusted for proper magnetic declination. but
tion of d, and dy when applied 10 azimuth can be quite accurate. Rangefinders often
angles. lf1rnverse angles are provided as also have venical angle gages. because all
bearings, they are typically converted to azi- measurementsare as.smued para Uel 10 the
muths first. using the rules shown in Figure datum plane. and non-horizontal me.asure-
5-35. remembering to con\'en the measured meo1s must be adjusted. Careful. laser range-
angles 1·0 radian units if those are required by finders measuremeurs c..111 be accurate to a
the spreadsheet or computer language used centimeter overa I 00 m distance, improving
for calculations. Sine and cosine values are 1he efficiency of GNSS dalll collec1ion.
then calculated and multiplied by tlte tra-
Terrestrial. Lhree.dimensional lasers are
verse leg distance. resulting in d, and dyval·
ues of1he correct length and direction. rapidly becoming common, and allbougll
Examples for all four quadrants are shown in curremly used primarily for s1rucnire analy-
Figure 5-37. sis, Ibey may be used for CilS data entry.
N
AZ • 349 dx•L· s,n(Az)
dx•l·sin(A.z) •100· sin(7l/57.295579)
• l to ·s,n(349/57.295579) · 94.55
· -20.99 dy•L·cos(Az)
• l00·cos(7!/57.295579}
dv•L·cos(Az) ·32.56
•llO·cos(349/57 295579) Az,71°
· 107.98
dx•L·sin(Az)
d,·L·s,n(Az) •84·sin(!65/57 2955 79)
~
• 76·sln(!87 /57.295579) •21.74
· -926
dy•l ·cos(Az)
...' cl,•L·eos(Az)
· 84·cos(l65/57.295579)
• 76·cos(!87157.295579) · -81.!4
• •7543
Az•l87" Azi!65°
s
Fi11111' S..37: Exrunplc calc:ul4tion1 of d:t 1tnd dy \"llluu, g:i\'cn ll l UJ'\'C)' ditltlncc (t) nnd rut 2t;imu1h :U!.glc
(Az). Note thnl tbc 5inc .ind c(»"ine fuoctions return the correct sign a, wcll M magnitude, so th111 the d.,: alld
dy values~ positi\·c or ncgath·c. as appropriate. Also 1100: t.h.11 many , prcadsbec1s aud coding: applicaitons
default 10 Ddtlln uuits in ,inc .t1tid eosfoc hlilctio1u, hen« the. dM11ion of dca,cc 3:Ua}es by tbc. COllVC':l"Sion
fac1or S7.29SS79.
Chapter 5: GNSS 229
Suggested Reading
Abidin. H. (2002). Fundamentals of GPS signals and data. In Bossler, 1. (Ed.). ,\,fn11-
1ml of Geospotinl Science n11d Technology. London: Taylor and Francis.
Bobbe. T. ( I 992). Real-time differential GPS for aerial surveying and remote sensing.
GPS World. 4: 18-22.
De<:ken. C.J .. Bolstad. P. V. (1996). Forest canopy. terrain. and distance effects on
globaJ pos.itioning system point accuracy. Forest Scle.11ce. 62:317-3 21.
Dominy. N.J.. Duncan. B. (2002). GPS and GIS in an African rain forest: Applica-
tions lo tropical ecology and conservation. Consen·ariou Ecology. 5:537·549.
Dow. J.M.. Neilan. R.E.. Rizos. C. (2009). The International GNSS Service in a
changing landscape of Global Navigation Satellite Systems. Joumal ofGeodery,
83: 191 · l 98.
Owolatzl..")I. B.. Trengove, E.. Smnhers. H., Mcintyre. J.A.. Maninsou. N.A. (2006).
Linking the global positioning system (GPS) 10 a personal digital assistant (POA)
ro supyort ruberculosis control in South Africa: a pilot study. J111enraU011al Jou,...
uni o/Henhh GeogmpMcs. 5:34.
Fix. R.A.. BU11. T.P. (1995). Global Positioning Systems: an effective way to map a
small area or catchment. Ennlt S11rface Processes and La11dforms. 20:S 17-827.
Gao, J. , Liu, Y.S. (2001 ). Applications of remote sensing, GIS and GPS in glaciology:
a review. Progress in Physical Geogmphy. 25:520-540.
Gao, J. (2002). lntegration of GPS with remote sensing and GIS: Reality and prospect.
Photog,.ommetric Engilleering &. Remote Se11si11g. 68:447-453.
Haibo. H .. Jinlong. L.. Xu. J .. Guo. H.. Wang. A. (2014). Performance assessment of
single- and dual-frequency BeiDou/GPS single,epoch kinematic positioning. GPS
So/11tio11s. 18:393-403.
Jagadeesh. G.R., Srikanlhan, T.. Zhan3. X.D. (2004). A map matching method for
GPS based real·time vehicle locanon. Jo11,.,,al o/Nm;gatio11. 57:429440.
Johnso,~ C.E., Banon. C.C. (2004). \Vhere in the world are my field plots? Using
OPS effecrively in environmental field snidies. F10111h:n ;"11 Ecology & the £in,i-
m1m1•mt. 2:475-482.
Kennedy, M. (1996). The Global Posirio11i11g SJO!em n11d G/S. Ann Arbor: Aun Arbor
Press.
232 GIS Fundamentals
Mintsis. G.. Basbas. S.• Papaioannou. P.. Taxiltaris. C.. Tziavos. J.N. (2004). Applica-
tions ofGPS tec.bnology in the laud transportation system. E11ropea11 Jo11r11al of
Operario11a/ Research, 152:399-409.
Nresset, E.• Jonmeister, T. (2002). Assessing point accuracy of OOPS under forest
canopy before data acquisition. iu the field. and after processing. Sca11di1,a1ia11
Journal ofForest Resen1-ch, I 7:35 I -358.
Odolinski. R.. Teunisseu. P.J. (2016). Single-frequency. dual-GNSS versus dual-fre-
Q,uency, single-GNSS: a low-<:ost and higl1-grade re<:eivers GPS-BDS RTK analy-
sis. Journal of Geodesy, 90: 1255-1278.
Scrinzi. G.. Floris. A. (1998). Global Positioning Systems (OPS). uoa nu ova xealta nel
rilevamento forestale, Ani del Convegno "Nuovi orizzonti pe.r l'assestamento
forestale" 14-56.
Small, E.D., \Vilsou. J..S .. Kimball. A.J. (2007). Methodology for the re-location of
pem1anent plot markers using spatial analysis. Nonhe,·r, Journal ofApplied For·
esr,y, 24:30.36.
Thirgood, S .• Mosser. A.. Tham. s. . Hopcraft. G.. Mwangomo. E.• Mleogeya. T..
Kllewo. M., Fryxell, J., Smclau, A.R.E .. Bomer. M. (2004). Can parks protect
migratory ·ungulates? l11e case of the Serengeti wildebeest Animal Conserrn,;011.
7: l 13- I 20.
Toledo-Moreo. R.. Betaille, D.• Peyret, F.• Laneuri~ J. (2009). Fusing GNSS. dead-
reckoning. and enhanced maps for road vehicle Jane-level naviga1ion. /EEE Jom·-
ual ofSelected Topics in Sigual Processlug. 3:798-809.
Van Sickle. C. (2008) GPSfor Laud S,m-eyors, .ird edi1io11. Boca Raton: CRC Press.
Welch. R.• Remillard. M. Alber1s. J. (l 992). lmegration of OPS. remote sensing. and
GIS techniques for coastal resource management. Pl,orog,·ammem'c £11gineeri11g
aud Remote Seusing. 58: 1571-1578.
Wilson. J.P., SJlangrud. D.S., Nielsen, G.A., Jacobsen. J.S., Tyler. D.A. ( 1998). OPS
sampling uuensity and panem effects oo computed 1errain anributes. So;/ Science
Society of.America Journal. 62: 1410-1417.
Chapter 5: GNSS 233
Study Questions
5.1 - Describe the geueral components of GNSS. including the three co:nui1on seg-
ments and what they do.
5.2 - Whal is the basic principle behind GNSS positioning? Whal is a range measure-
ment. and how does it help you locate yourself?
5.3 - Describe 1he GNSS signals lha1 are broadcast. and 1he basic difference between
carrier and coded signals.
5.4 - Ho,v many sa1elLi1es must you measure 10 obtain a 1hree-dimeusio ual position
fix?
5.5 - \Vha1 are the main sources and relative magnitudes of uncertainty in GNSS
positioning?
5.6 - How accurate is GNSS positioning? (le sure you specify a range, and describe
under what conditions accuracies are a1 the high and low end of the range.
5.7 - Whal is a dilution of precision (DOP)? How does ii affect GNSS position mea-
surements?
5.8 - Which of the following figi1res depicts the Iowes1 and highcs1PDOPs. assuming
the obserr.rer is near the center of 1be drawn surface?
a) b)
H ,Q' "<I»
""·
- ......... n
- -
.................•
<l d)
,Q'
.ti'
- 'I>,
..........
.Q'
- -~
234 GIS Fundamentals
5.9 - What is the rank order. highest to lowesl~for accuracy of GNSS data given the
satelLite constellations in the following figures, assuming lhe observer is near the cen-
ter of lhe drawn surface?
b)
- -
<) ...,_ d)
,1;· ,Q' ~
....
•,p.
,Q'
- -
,lil'
-- ---
5. 10 - Describe tlie basic principle behind differeniial positioning.
5.11 • What are the differences between post-processed and real-time differential
positioning?
5.12 . What is lhe primruy source of error reduced with a dual-frequency GNSS
receiver?
5.13 . Place these in order of accuracy. assuming lhe best equipment and practices
applied to data collection:
a) dual frequency. real-time kinematic positioning.
b) precise point processing. and
c) post-proces~. dual frequency positioning.
5. 14 - How is GNSS accuracy affected by the local terrain horizon? How is it affected
by canopy cover or building obstmctions? Why does positional accuracy change as
these conditions change?
5. 15 - How are GNSS data accuracy and efficiency (points collec1ed per given time
interval) relat~.d when collecting data in obstructed environments? Why? How is this
controlled by field persollllcl?
Chapter 5: GNSS 235
5.17 - Wby are distance measurements devices and offsets used when collecting
GNSSda1a?
5.18 • Wba1 is COGO?
5.19 . Complete tl1e table below. calcula1ing missing elemen1s accordin.g to 1he for-
mulas presented in this chapter. Distances are in meters. 1he angle e in degrees.
i
.. 10
Yo
20
0
30
L
500
••
2500
d,
433.0
><,
2600
v,
.cSl.O
,oo 876.C
! L
•, 931
91
12
60
.,••
J0!2
t!J2( 1!11!1 7
S060
I
! (I 1540 lo.!8 85 496}
! 369 280 10 220
Xo, YoI
S.20 - Co1nplete the table below, calculating missing elements according to the for-
mulas presented in this chapter. Distances are in meters. 1he angle e in degrees.
..
' x,. Y~
:- d.. - - .
Y. 0 L •• d, ~ v
0 0 70 100 ••o 3<2 940 342
I••
!
L
,oo
150
10
IS 3S
0
80
25
·~
IS
66
88
130
200
20
12
336
14H
125 ft
881
-,--·--·--·-
Yo'
Xo,
5.21 - Fill in the missing cells. convening points between awnuths and. bearings.
Point I 2 3 4 5 6 7
S.22 - Fill in the missing cells. converting poinls between azimuths and bearings.
Point 1 2 3 4 5 6 7
5.23 - Compfe·1e 1he 1able below for a 1raverse wi1b the listed distances and bearings.
given as azimuth degrees (drawing a rough sketch may help with the calculations).
Note that many spreadsheet. calculator. and trigonometric fimc1ious require .input in
radians (approximately 57.2958 degrees = 1 radian).
Stort,ng point PO. X • 10.128 3, Y • 6.096.4
5.24 - Complete the table below for a traverse with the listed distances and bearings.
given as azimuth degrees (drawing a rough sketch may help wilh the cakulations) .
No1e that many spreadsheet, calculator, and trigo11ometric fimc1ions require input in
radians (appro:ximately 57.2958 degrees= I radian).
5.25 . Complete the table for points 2 through 5. which have the listed GNSS-deter-
mined laliirudes. longitudes. and eWpsoidal heigbt.s. The NOS data sheets for control
locations near the measured points are listed. and you may use these ro look up
appropriate geoidal data.
Poinr I 2 3 4 5
(Oo'ICI~{ PIO SFi.tlt PIO E'O!tJQ PJOCF"601• PIOOW0320 ?lORDtQtO
NGS k- 1 ~'1-'"" "" ·-ron N At.e RMl ()Q
"""""""" 'Nl VN
Long. 11.ide 68' 19' 19· 83"' IT 552" 88° 34' 40 ~- 109°29' 439· 123'50· 310"
5.26 - Complete the table for points 2 through 5. which have the listed GNSS-deter-
mined lar;rudes, longitudes. and ellipsoidal heights. The NOS data sheets for control
locations near the measured poin1S are listed. and you may use these to look up
appropriate geoidal data.
Point
(Qf"(l nta'f'!{
-·-s --·-,
I
fl10 004877.
,.• --·• MIIW\
2
PIO Aao•60
AUS Al)9.\ TX
3
PI00\1?10
s,a Cofd. (A
__ .,.,,
4
PlO PX0445
5
PIOCl.6000
CNIW<nh
Lo11tude: 45°05· 454" 30' 17 29.7" 33"24" 159" 44~00' 338" 47" ,9· ss r
Longrtude 93'00 17.9' 97' 41 34 3' 118° 24 54 2· 109' 30" 20 2· 119' 59 216'
OMhomietr1e
Height Cm) 280.15
238 GIS Fundamentals
239
fla,urt 6, 1: lim3cs arc a ,-atuablc: soinc: o(spatilli dtlta.. TI1c upper ima.l< o!thc lbaian Pc:niru·.o]a illustrates
thc: bro:id·IU'c::1 c:o,<cn;e;c pro\•idc:d b)' t111c:1ti1c: d.'\111. The: lowtt ll1123"c: nurdowntown B11redoi:ia illtmrn.11:1 die:
high .ipalii,I dc:t11il ;n.ubblc, bc:rc wi1h iodh·Ldwd l!l\llomobile5 rewln<l froOJ space: (co1.1t1C:i)' NASA. top.
aud Maxar. bonom).
Chapter 6: Aerial and Satelltte Images 241
Basic Principles
The mosc conunon fonns of remote drops off at lligber wavelengths. Some
sensing are based on reflected elec1romag- reg.ions of the electromagne1ic spectrum are.
ne1ic energy. When energy from the sun named: X-rays have wavelengths of approxi-
strikes an object. a portion of the energy is mately 0.0001 µru. visible ligbt is between
reflected. Different materials reflect differ- 0.4 and 0.7 J.1111, and near-infrared light is
ent amounts of incoming energy. in different between 0.7 and I.I µm.
wavelengths. and U1is differential reflectance Our eyes perceive light u1 the visible
gives objects their color. portion of the spectmm and are. blind to
SunLigbt is the principal energy form shorter and longer wavelengchs. We assign
detected u1 remote sensing for G!S. Ligbt three base colors: blue. from abo1110.410 0.5
energy is characterized by its 11nve/eng1!,, µm. green from 0.5 10 0.6 µm. and red from
the discance between peaks in cbe eleccro- 0.6 co 0. 7 µm . Other colors are often
magnetic stre-am. Each color has a discioc- described as a mixnire of these three colors
1ive wavelength. for example, we perceh1e at varying levels of brightness. For example.
lighc wich wavelengths betwetn 0.4 and 0.5 an equal mixnire of bh1e. green. and red light
micrometers (µm) as blue. Light emined by at a high incensity is perceh·ed as •·,vb.ice"
the sun is composed of several different light. Other colors are produced with other
wavelengths, and the full range of wave- mixes: for example. equal !'arts red and
lengths is called the electromag,1edc spec- green Iigbc are perceived as yellow. The spe-
trum. We graph the Sun's light ene.rgy cific combination of wavelengths and their
(vertical axis) across wavelengti,s (horizon- relative intensities produce all the colors vis.
tal axis) co represem tbe spectrum (Figure 6- ible to the human eye.
2). Energy from the s,m increases rapidly to Electromagnetic energy striking an
a maximum between 0.4 and 0. 7 µm. and objecc is reflected. absoroedl. or transruined.
wovelength (!lf'nl 10
1
J 10
1
10 10' tO
• 10
l ll llllll I I I I I
Fignr,r 6-2: tlc~rom0,t11c1ic C1Ktt)' is cmiu«I by d1e Slm :ind t~11sruit1cd dll'Ough the a1mospbcrc (upper
graph), Solar radiatioo lJ r,aitially a'bsort>ed as it pimes throu,&li I.he nn05pbcrc. lbi.s remits in \'anablc sur-
face radi11tiou in 1bc ,isib c and infrared (IR) \\"1'11\'clcnitb rc;1ou, (lower ~pb).
242 GIS Fundamentals
Mos.t solid objects absorb or reflect incident reducing energy reaching the Earth 1s surface
electromagnetic energy and tnUISJUit none. in ponions of the electromagnetic spectrum
Liquidwater aud atmospheric gasses are the (Figure 6·2).
most common nat11ral materials 1hat transmit Most aerial images and sateJlites use
light energy as weU as absorb and reflect it. p1,ssire remote serrsirrg, in thal 11ley use
Natural objects appear to be the color energy generated by the sun and reflected off
they most re Dec 1: for example. green leaves of the 1arge1 objects. The images from tliese
absorb 1UOre red • nd blue light and reflect passive systems may be affected by atmo·
more green light. , ve sense these differences spheric conditions in mulliple ways. Figure
across a range ohvavelengths 10 distinguish 6·4 illus1rn1es the many paths by which
among objectS. Wltile we see only visible energy reaches a remote sensing device.
wavelengths. these differences also extend Only one oftbe energy paths is useful. that
into other pans of the electromagnetic spec- of surface reflection fromground features.
trum (Figt~e 6·3). Jndividual leaves of many The other paths provide little or no infonna-
plant species appear 10 be similarshades of tion al.>out the 1arge1 objects. Most passive
green: however. s.ome reflect a higher pro- sys1ems are not useful during cloudy or
ponion of infrared light. and thus appear 10 extremely hazy periods because nearly all
have a different "',color'' when ,~ewed at the energy is scattered or absorl>ed. There
infrared wavelengths. are some passive systems usefiJU at night.
Energy transrninance through the atmo- e.g.. emission microwave sensing, bu1 most
sphere is most closely tied to the amount of passive systems rely on the sun's energy. so
water vapor in the ait Water vapor absorbs are useless at night.
energy in several ponions of the spectrum. Actfre systems generate an energy signal
and higher atmospheric water conlent results and detect the energy ren1med. D ifferences
in lower trnoSitlitlMCe. Cailioa dio:<ide. in llie qmmtity and direction of the rellli'iled
other gases. and paniculates such as dust energy are used 10 identify feamres and their
also contribute 10 atmospheric absorption. properlie.s. Radar (radio detection and rang-
60 ,.. ____ _
I
~50 I ...-··-' \.-··-·" -..- ··-··-··- ..............dry
_··-soil
~
"""'"'
'-
20 ...," -
/ I\
\J
I \.
• - '
vegetot,on
.... ....... ,
,,
--.
10
/I woter
0.4 0.6 0.8 1.0 12 L4 L6 LS 2.0 22 2.4
wavelength ( rim)
fleure 6-J: Spcc:11al fC'0cctan((' Cut\'CS for some comn1on subst1U1ccs. The propor1io11 of iuromi.i13
m:li:uio111h2t 1& n:Otttcd \'ru'ia 2crou w1wdcngd11 (21i:Lprcd from LiJJeu11d ~nd Kiefer. 1999).
Chapter 6: Aerial and Satelltte Images 243
sensor
~ ~ atmospheric
absorption
~
cloud reflecllOn
end absorption
surface
emission
S\lr'fOCe
surface
reflection absorption
fiau~ 6-4: Encl'J)' pa1.bway5 from source to r.enwr. Li~t and othtt clcctroruagrn:tic: energy mi,,y be
absorbed. tntrmruttcd. or rcRe¢tcd by 1hc atmospbc-re. L1i;tbt rdloct«t 6-otn the surface and tnumnincd to
the .k11SOT ia lded 10 crc:lllc al'l inugc. Tilc U1ugc: may be dc:p':1.ded b)' :umo~bcric sc:mc:riil! due to water
vnpor, dust, nuokc. nnd other c:onstitna1ts. incoming or rc:R«tc:d cnert)' U'-"Y be scmtered,
ing) is a common active remOle sensing sys· shades from black to white. or are '"Stacked''
tem. Radar focuses a beam of energy composite color images. with specific wave-
through an amteona. and tben records the leogth regions set 10 color blue, green. and
reflected energy. These signals are. swept red outputs (Figure 6-5). Color images are
across the lalldscape and the rerurns assem- most colUll\on. as we usually collect at leas1
bled to produce a radar image. Radar images three spectral bands. often four. and with
are usually monochromaric (in shades of some scanners up to hundreds of bands.
gray). These images may be collected day or Each band records the amount of energy in a
night. and most radar systems penetrate specific set of wavelength. e.g.. a near-infra-
clouds bec.ause water vapor does not absorb red band may record tl1e energy returned
the relatively long radar wavelengths. between 0.9 and 1.0 µm.
LiOAR systems (or lidar, /ig/,1 de1ec1io11 Each indi,; dual band m.ay be thought of
a11d ranging) are also conwon. They use as averaging the energy returned O\'er a
lasers. typically in the visible or near-infra- range of wavelengU1s. Bands may vary in
red portion of the spectum, to measure a dis- limits, e.g.. one c.amera may record a green
tance from the scanner to a larget. and I.hen band Uiat senses energy from 0.6 to 0.68 µm,
calculate heights. Heights measured across while another camera may sense energy 0\ et 1
~!l i1f~~ !ll!!Y lx ~Qmbin~ !Q Hf~!e ahfight !ht r.mge ffQm Q,61 tQ Q,67 11m, Dam for~
surface and DEM. LiDAR is described in band may be conceived as stored in a grid,
detail in a later section. representing the reflectance for each cell. or
Images we view eitherdisplay a sing.le pixel. over the area imaged.
range of wavelengths and appear in gray
244 GIS Fundamentals
We must assign eac,h recorded bru1d 10 a provides an image simiJar to the colors
color on our displayed image. Images. bolh observed by lhe human eye.
on a monitor or a hardcopy. are created by We may also assign different band com·
mixing primary colors, typically blue-green. biaations or different orders. Many aerial or
red. We assign one layer recorded for one satellite sca1mers sample in four or more
band 10 the green color. and variation in bands. including an infrared band in addilion
intensity sensed in 1he band translates into to the three visible bands. The Sentinel satel-
the range of dark 10 light greens for this lite scanner. described later in tllis chapter.
layer. We do the same for red and blue lay- collects thirteen bands. We must choose
ers. assigning an input band to each. We then which of tJ,ese bands "ill be assigued
combine lhe bands. tliat is. the colors for respectively 10 tbe blue. green. and red dis·
each corresponding pixel. to create a com- play colors. Different band selections USU·
posite. multi.band image. ally result in different displayed images.
Tn,e color images are most common. with abnom~,1 colors for fearures. e.g.. green
and for these we assign the red-sensed band vegera1ion may appear purple or red.
to the red color on ouput. the green.sensed depending on the band combinations used.
band to the green output color. and dle blue- We rypically choose the bands that best help
sensed bond to the blue output color. This us identify imponam features.
__....,v·
J ----~--. n\.'l1r blu.c, peen. Of red color, :md
' mixmg 1he colon. to fonn ~ new color
(or each cell. Scatu1<"rs with mo~ than
I ~......- d'lrtt b11nd1 allow mo" co1nbin111ions
//, I ofl»nds and color au ig,uncQlf, wbid1
often aids in idcuti.fyiog Uoportanl fca-
:o I 'v
, /_ woter
rur«.
01
;
.,. :;s. • .,,.
..- . ....
i • '11
-
•
Chapter 6: Aerial and Satelltte Images 245
Aerial Images
Images taken from airborne cameras are
a prin:t,1ry source of geographic data. Aerial
photography ,quickly followed tl1e invemiou
of portable cameras in tile mid-I 9tl> c:enrury.
and became a practical reality with the
development of dependable airplanes in the
early 20th century (Figure 6-6). Ph01ogM111·
meh)'. the science of measuring geomeny
and mnpping from images, was well devel·
oped by the eruly 1930s. and there have ooeu
continuous refinemenis since. Aerial images Flg,urt- 6-7: AdUl1 pl101o ~bi lkrc often tllknt
underpin most large.area maps and surveys from spcci.,li.:r¢d nircr:if1, $1.lCh .:ai this low 3hitude
in most countries. Digital mapping cameras airplauc:. or &om bdicoptcn or higher-flying.
became conu,1on in the 21st cennny. largely J~craircraft (courtesy Sc-.ttbitd Ltd.).
supplanting a.erial cameras. and are often
carried aboard aircraft optimized for aerial suremenls are combined \\1ith ground sur-
surveys (Figure 6-7). Aerial images are rou- veys. In a second major application. image
tinely used in urban planning and manage. imerpreta1ion may be used ro categorize or
meat construction. engineering. agricullure. assign attributes to surface features. Images
forestry, wildlife management. and other are often used for landcover and infrostruC·
mapping applications. mre mapping. and to assess the extent of fire.
flood. or other damage. Finally, images are
Althougb there arc hundreds of applica-
tions for aerial image.s. most in support of
often used as abackdrop for maps of ocher
feamres. as when photographs are used as a
GIS may be placed into three main catego- backgrow1d layer on web maps or for soil
ries. First aerial images are often used as a survey maps produced by tl1e U.S. National
basis for mapping. 10 identify and outline Resource Conservation Service.
objec'ls. Measurements on images offer a
rapid aud accurate way to obtain geographic
coordinates. particularly \\1len image mea- Aerial Mapping Systems
There are aerial camera systems and
platfonns specifically designed for inapping,
The camera and components are built 10
minimize geometric distortion and maximize
image quality. Mapping cameras have fea.
nires to reduce image bh~ d\1e to aircraft
motion. enhancing image qualily. They
maintain or record orientation angles. so dis·
1onions crui be removed. These camera sys-
tems are precisely made, sophisticated,
highly specialized. and expensive, and most
often 11sed wben large,area. higJ1-resolu1io11.
accurnle images are required.
Modero aerial cameras are typically
Fiiur,r 6.-6: Acri3l stlf'\·cys began ,bor1ly 3ftCT mounted inside an aircraft. pointing 1brough
the dc,:dopmeot of n:liable airplanes: and por1a· an underside boy (Figure 6-8). The camera
blc. fl.bu-based e11m«a1 (cour1es:y Cauadi.itn
Go,•c-nm,e:nt Pll.()IOgr.ti>hic .-\.rcbi\ 'CS).
moum and aircraft control systems are
designed to maintain the camera optical axis
246 GIS Fundamentals
as near vertical as possible. Aircraft naviga- direction of the incoming image. and it is
1ion and comrol sys1ems are specialized 10 precisely orien1e<l 10 intersec11he sensor in a
support aerial pho1ograpby, with precise perpendicular direc1ion. Digj1al ·sensors are
posi1ioning and n.ight control. connec.ted to electronic- storage so that suc-
Aerial cameras sp«ializw for ipa1ial cessive images m.,y be sa,oed. Images are
data collection are large. expensive~and recorded a1a surface calle<l 1he camera's
complex. bu1 in principle they are similar 10 /0<:11/ plm,e. ideally perpendicular 10 1he
simple cameras. A simple camera consislS of optical axis. The 1ime. alli1ude. and other
a lens and a body (Figure 6-9). The Jens is condi1ions or infom1a1ion regarding 1be pho·
1ypicaJJy made of several iudi,~dual glass 1ographs or mapping projecl may be
elements. with a diaph,·ngm or othtr mecha- recorded by lhe camera, often as an elec-
nism to control the amount of ligh1 reaching 1tonic ltroder on digi1al image files.
the sensing media. dte digi1al sensor (or in Base scale and e:ae111 are important
1imes past film). These sensors have a char· anribmes of rem01ely sensed data. Image
acteristic dimension. sd. and for digital sen- scale. as in map scale, is defined as the rela-
sors, a pixel size, that when combined with tive distance on the image to the correspond-
1be flying heigh1 (H). and focal Jeng,11 (h), ing dis1ance on 1he ground. For example. I
de1ennine 1he groond resolu1iou and imaged inch on a I: 15.840-scale pho1og.raph corre·
are.a. An exposure control. such as a slumer sponds to 15.840 inches on the Earth's sur·
wi1hin 1be lens. se1s 1be lengll, of 1ime 1be face. As shown in Figure 6-9. image scale
sensing elemem is expose<l 10 ligh1. Cameras will be h/H. 1he ra1io of focal Jeug1h 10 fly.
also have an oprico/ nx,S, defined by the sen- ing height
sor orientuion. Toe op1icaJ axis is the cen1ral
Fizurt ~8; An e'JUlltlple of 1he 6QPb.i.nicated ~ ·,tcm (upper left) for controlling digital inagc collection.
bc:rc with a Lc:ica Gcosystc1m ADS40 digital aerial camcr,: (lower riah.1), These systems rcwrd *Od display
fliaJu p:1ths rutd Cllffi«a s111tio1u in rcal time. aud ffi3)' be used to plnn. execute, and 1n01ll1<>t" iin113c data col·
lcc1ion (courtesy L<ie.a Gcosyt.lcn\S).
Chapter 6: Aerial and Satelltte Images 24 7
Im.age ex.tent is the area covered by an lion will have a cell sile ofO.osn,000,
image. an<I depends on 1he physical size of which is 7.1 x 10·6 m. or 7.13 µi11
the sensing area or elemem (S<l in Figure 6- The realized or ground resolution on
9). lhe camera focal length (h). and the fly.
aerial images may be approxirua1ely calcu-
ing heigh1 ( H), according 10:
lated from equation (6, I), subsrinuing cell
dimension for sensor dimension. sd. In our
example, if the caruern bas a 10 cm (0. 1 m)
gd,sd " H/h (6.1) focal length. and is flown ar 3.000 m. the
ground resolution is:
o) b)
n-type slticon
( p· type silicon 110
,..__,....
• I E
-~ -i!
ooa
L
:, ':' 0.
v c
L
0
.."' ~
~04
-
0
5
-
~02
0
g o +-~~~~~~~~~~~
may be place<l in fronl of the lens, for exam. cameras, the.re is a vast ardtive of past aerial
pie. for protection or to reduce haze. images collected on film. Large-fonnat
Digital cameras typicaJly have a com- films. 240 mm (9 in) on a side. were most
puter control s~,ienL used to specify the common for mapping. Most film.based pho·
loca1ion. 1imi11g. and exJ)-OSurc; record GPS togn1phs from mapping can1eras have been
and aircraft altitude and orientation informa- converted to digital (omis. Their cbaracter-
stics and use are well described in the litera.
1ion: provide data transfer and storage: and
allow 1be opera1or 10 monitor progress and ture and are 1101 described here.
image quality during data collection.
Digital cameras may have several fea- Lens and Camera Distortion
tures to improve data quality. For e.xample. The crunera and lens in any system may
digital cameras may employ elec1ronic be a signiticam source of geometric error in
image motioo compensation. combining aerial images. The perfect lens--c-amera-
infomiation collecl'ed across several rows of detector sys1em wo,tld exac1ly project the
CCD pc,els. This may lead to sharper viewing geometry of the target onto the
images while reducing the likelihood or image recording surface. either film orCCD.
camera malfunction due 10 fewer moving The relative locations of features on the
parts. In addition. digital data may be image in a perfect camera system would be
recorded in long, continuous strips. easing exactly the same as 1he relative locations on
1he produc1ion of image mosaics. any viewing plane that is in front oftbe lens.
While nearly au presen1 and future Real camera systenis are nor perfect and may
aerial images will be collec1ed with digital distort the image. For example, tl1e light
from a point may be bent slightly when trav-
eling through 1he lens. or optical axis/focal
plane mis-alignments may s:kew the im,,ge.
CCD sensor t riplet both causing image distortion.
The mapping camera systenis typically
!blue carried on large aircraft are -engineered to
minimize systematic errors. Lenses are
designed and precisely manufactured so th.1t
image.distorlion is minimized. Camera parts
are optimized to ensure a fojtbful rendition
of image geometry. Sensors and motmts are
painstakingly aligned 10 reduce perspective
angle distortion. This optinrization leads to
Trichroid extremely high geornenic fidelity in 1he
prism '---&.Ul-0>/ camera/lens systent Thus, camera and lens
distortions in mapping cameras are typically
much smaller than other errors. for example.
pointing or terrain errors. discussed in the
nexl section. or digititing errors when con.
vcning the image data to forms useful in a
GIS.
Light reflected
from the ground
Fltm't 6-.12: Di.a;i.1al ct1rocras oftco us,: a prism or
o.d1a- mccb11.ni.s:m 10. scp3:l"llc and direct lig:bt to
appropriate CCO uown (11dn.1>t.cd from Lcic-.11
Ocosystcrus).
250 GIS Fundamentals
'- ~
fi&nrt' 6-ll: An r:x.amplc of rndial lcn, di&iortio11. common in omt leuliCs.. aod qtiitc lnrgc in lmscs
not desigued for aerial image acquisitiou. The 1d't-1nos.1 fi~ shows a perf«t lens. while the center
and ri~1 t'.i~ show spbctical or "bruTCl"di.s.t01tioii ou1wMd and Uiward. rcspccti\·cly. LcnSC"i mttSI
be c2rcfu.lly cntflcd 10 n11nimiu tbisd"h)r1ion, 11.nd thencalibr:ucd IO 1h.1.1 rcn~ming di,1011iom uuy be
mJ)C)\'cd I.W'O\l.gb sobscqi.,cul Ull.ilgc procc$$ing,
projec1ed onto a common. 1wo-dimensiona.l opernlors. However. some aspects of the dis·
plane (Figure 6- 15, right). Objecis above or 1onion may 1101 be controlled. and no camera
below the plane are venically projecied system is perfect, so 1bere is some geomeb'ic
down or up o:nto the horizontal plane. Thus. distortion in every uncorrecled aerial image.
1he top and oonom of a building should be The real question becomes. "is !he dis1ortiou
"flanened downward" 01110 the same (oot- and geometric error below acceptable limits.
prin1 in the datum plane. ln our ideal data given the iniended use of the spatial data1··
set. the tops of all buildings would be visi- This ques1iou is not unique to aerial in,ages:
ble. b111 none of the sides. it applies equally well to sarellite images.
Unfonunately. most aerial or satelli1e spatial data derived from GNSS and tradi-
images provide a non-oribographic perspec- 1ional ground swveys. or any other data.
rlre l"iew (Figure 6-15. left). Perspec1ive The larges, distortion in aerial images
views give a geometrically distorted image taken wi1h mapping c-ame.ras comes from
of the Eanh's surface. OiStonion affec1S the two sources: terrain variatfo,n and camera
relative positions of objects.. and uncorrected tilt. particularly when using an aerial mair
data derived from aerial im,1ges may not ping camera or when prope.rly calibrating
directly overlay data in an accunne and correcting non-n.spping cameras. e.g ..
onhograpbic map.The amou111 of distonion \\itb drone/UAV in~1ges. Atmospheric bend-
in aerial images may be reduced by selecting ing is relatively minorunder most condi1ions
1he appropri~te camera. lens. Oying heigb1. when collec1ing aerial imag<?S. bm may still
and cype of aircraft. Distonion may also be be unacceptable. particularly when the high-
controlled by collecting images underproper est-quality data are required! Esiablished
weather concLi1ions during periods of low me1hods should be used to reduce the typi-
wind and by employing skilled pilots and cally dominant till and terrain erroni. and for
[I
•
datum plane
~ -#-f----'tr's--~ / \--.-.-~~---,--,-
f'ii:,1.1rt- 6-15: Pmpc,cti\'c (Jcll) and ortbo(!raph.ic (rig)1t) \ i C'\,' f, Orthogrnpbic ,icws projcC"I nt ri$ht rmglcs to
the danun plane, a"i if ,ic-wing; from an in6m1c bC1gh1. Pcnp«ti\'c ,icws project from a fi,ccd newing loca·
tion on10 tltc a dntum pfauc with soroc a.pp:ltt111 lcrut due to ,·icwing: nniic.
252 GIS Fundamentals
non-mapping cameras. the previously ence datum surface al a righl angle at the
described lens and can1era dis1onion. nadir location.
The locations for poinls Aand Bare
Terrain and Tiit Distortion In shown 011 the ground surface. Tl,e corre·
Aerial Images sponding loca1ious for 1hese poims occur a,
A"and s· on the reference datum surface.
Terrain variation. defuied as differences These locations are projected outo the imag.
in elevation within the image area, is often i11g sensor or fdm. as they would appear in a
the large.s.t source of geometric distortion in photograph taken over this varied 1errain. ln
aerial images. Terrain variation causes relief a real camera. dte sensor is bebi11d dte lens:
displaceme111. defined as the horizootal however. it is easier 10 visualize the dis-
shifts in object positions on images due to placemen1by showing the seosor in front of
differences in ete,iation. the lens, and the geometry is the same. Note
Figure 6- l6 iUustrates the basic princi~ that the points o and bare displaced from
pies of reliefdisplacement. The figure shows their reference surface locations. o' and b'.
the image geomeuy over an area with su~ The point o is displaced radially outward rel-
stantial differences in terrain. The reference ative ,o o·. be<:ause lhe eleva1ion at Ais
surface (dalllm pf;ane) in this example is cho· higher tlmn the reference surface. The dis-
sen 10 be at die elevation of the nadir poim placement ofb is inward rela1ive to b'.
directly below the camera, Non 1he grow1d, because Bis lower than the reference datum.
and imaged at non the photograph. The Nole that any points that ha,~e-elevations
can1era station P is die location of the cam- exactly equal 10 the elevation of the refer·
era at the time of the photograph. We are ence damm will not be displaced. because
assuming a vertical pbo<ograpb, meaning the the reference and ground surfaces coinc.ide
optical axis of the lens points renically at thO!t pOifilS.
below the amera and in1ersects 1he refer-
Vertical Tilted
Side
view
R€SUlti~
distortion
Fl(ui-t 6, 18:: [sn11.~ dis1ortion <:3Ukd by a tilt in 1.bc <:a1ucr:1 optical uis rcfath-c 10 die ~round 1tufacc. llic
penipcc11,-c distonion. i;.bow11 at the: bonom right. rc:suhs fi'oin c:.hru1gc:s in 1hc: ,-iewing dist1Lncc: acrou 1bc
pbo1ograph.
254 GIS Fundamentals
' . ,,
.
Yp
,·
! y
J(
••
•• •• • • •
····,.............
-·
)(..
•••. rQIOIO'\
c:tkXl1 lhf; /j
· -·-.-· c
:
' ?
.•"f.. • /_.,,..
.···
...... !! ....~ .••
....·····----j ······-.... . ··········.·.......
. ••··' i ···••••
' j
.:
•.••·'y /
/
:
:
·······-..... ..!.:•.·•'.~·
!
...········ "
FJaurt ~19: Im11$,Crih ,1u:i;ks 11te often "-pceificd by tot:atiou, a.boul tbe X a.'tis (a113k ro). the Yaxia
i,nglc ~). and ihc Z n.,,;it (~ogle ~').
Chapter 6: Aerial and Satelltte Images 255
from vertical aerial images ,,1ien only apply· graphs cle.arly show thal tilt alone introduc.es
ing an affine uansformation (see Chapter 4). large errors. and terrain furtlter degrades
a simple proeedure used 10 register data lay- geometric accuracy iu aerial images 10 1be
ers. These error plots mimic digitizing from point where d1ey are inappropriate sources
partially corrected aerial images. common in for GIS data. Some form of correction. usu-
less-sopbistic:ated processing packages. or ally based on stereo coverage and projection
when GNSS data are sub-optimal. Note first geometry, is required
that there is zero error across all scales when
the ground is flat and there is no rill (bonom
line, top panel in Fignre 6-20). With tilt, Stereo Photographic Coverage
errors increase as image scale decreases, as As noted above. relief displacemenl in
you move from left 10 right in both panels. venical aerial images adds a radial displace-
Error also increases as tilt or terrain increase. mem that depends on terrain heights. The
Geometric errors can be quite large. larger the terrain differences. the Ja~er the
even for venical images over moderate ter· relief displacement. If two overlapping pho-
rain (Figure 6-20. lower panel). These tographs are taken. called a ~te•-eopnir. these
o._..,,~----,-:-·---~..,~------;.,~-------·--~-.,o_-_-_-·. ,-;-,-·-------;,..,-
Photo scate (1,0005)
.. . . . . -·-·
0
'--,,o,--'"'
,s--:20"'"'°-~2""'"'°-"'a"'o--:J-,s--,-,o-
Photo scare (1.000s)
Fi1111'f 6-20: Tem,in 111ld till effects on me:1;u porition:-.1 envr whco digi1izing from u21<'Qn"Ccted aerial
iwag-e,. OlS1:orti.oo in('~SC:s: when tih and temiu increase. ~ud u photo seak d«reascs. (from Bols:tacL
1992).
256 GIS Fundamentals
photographs may be used 1oge.1her 10 deter- ferent. Points are shifted by different
mine the relative ,elevation differences and amounts. and the magnitude of the sttift
remove distonion. depends on the distance from the observer
A srereomodel is a three-dimensional (orc.amera) to the objects. This shift in posi-
1ion with a shift in \~ewing Joca1ion is by
perception of1errain or other objecti tha1 we deCmi1ion the parallax. and is the basis of
see when viewing a stereopair. As each eye
looks at a different. adjacent photograph depth perception.
from the overlapping stereopair. we observe Many mapping projects collect stereo
a set of relative spatial shifts in objects, and photographic coverage. in whicb sequeolial
our brain may convert these to a perception photographs i.n a flight line overlap. called
of depth. e11dlap. and adjacem fligb1lines overlap.
Stereomodels are \~Sible in stereopairs call.ed sidelap (Figure 6-22). Stereo photo-
due to poml/ar. a shift in relief displacemeo1 graphs typically have near 65% -end lap from
due to a shift in observer loca1ioo. Figure 6- one image to the oex1 in a line. and 25%
21 illustrates parallax. The block (closer 10 sidelap for adjacent Hoes. Some digital cam-
the viewing locations) appears 10 shift less eras collec1 da1a in continuous s•rips and so
than the sphere wlteo the viewing location is only collect sidelap. Orone-based c0Uec1ions
changed from the left 10 the right side oflhe often have nearly 100% end lap or side lap.
objects. The displacement of any given point depending on lhe project and method for
is different on 1be left vs. the right views data collec1io11.
because the relative viewing geometry is dif-
• ••
'
••
fli::u1·e- 6· 21: An iUuitnitioo o! pamllax. tbc app,rnu rdati,·e ,hifi in die pos-ition or obj«1s wi.th a ,hifl in
lhc vicwc:r's position. Obj«1s that tttc fort.bet ll\\'llY (sphcn:,. nbo\'c) aP!)('tir 10 s.hi.ft more wbe11 ;a ,-icwe:r
cb3ug« posi1iou. This 1$ 1he basit ofstereo dcpd1 J)CTC<ption.
Chapter 6: Aerial and Satelltte Images 257
oenot
Image t
.... . .. .. .. ..
. -•.· ..... .. .. . . . . - . . ·. . . . . ....-. ... ·.. ...... .. .- ...... .•
•
. . . . . .. .;
. .. . . . . . - .. . . . . . -.. .... . ...-.
• •
. . • • •
. . • •
____.__ .... ____. ...a.. • ~·· J.& •
.
.
?.. .
:
..:
-
l
.: .
: ..
:
:
:
It-II' :
' sidel3D
••~
.. : .E'
.•... ~Ot
,.
Fli;:utt 6-22:: Aerial im1111,cs oftc:o o,·crlap to allow thrcc-dimcosiooal meas.UttUlClll:S aud the c:orrcc1ion of
teltcf dispbee11lC'.11t S~nni:al ii'n11get aloo3 a fiiWu p111b lltc: t"kc:n at i.mcr.·11.Js i-uch 1Juit d1c:n: is a11an:11 of
O\'a lnp, d)ow11 in tbc side ,·iew ln the top figure:. Flightfom n1oiy be $))need so th:111 tbe edges of '1icccui,·e
rows overlap. creating siddap (bottom !cit), Titc grouped photos ~rcmc a pbotom0$aic (bottom right),
258 GIS Fundamentals
A - p ---1
s h
\ I yB
..:r---"-f-"'---------:~ __,__
/ N
- D --j
-----P-----
1
Fiau~ 6. 23: Rdi:cf dispbccmcru mA.y be calcufatcd b.ucd Cll g.comc1ric mc:uurcnmu. Sil'l'lil:tlr 1ntl11glcs
$-N-C and o-n-C rc1ntc bcigbt, imd distances in lbe pbotogrnj>h nnd on the ground. We u,Ul'llly know Oy-
i.ug b¢iglu. H, and ~nu mca1,urc d And p on die pbotoppb.
Chapter 6: Aerial and Satelltte Images 259
Fla:u1·t 6. 24: Au c:nmpk of distortion rcmon1I wbcit CMl1fl3: an onboimat;c. A n~rly sttaij.bl pipeline
rigbt-of-wiy sp:ms wicmttcicd (!d i) a1id Con'Cctcd (rigtu) images, (rom 1bc: lowcnigh11et upper left in c:3Ch
i.m..gc. The p;11h *,Ppenn bcot in the im.agc 01) tbc lc:11 ai. it 11l~1c:ly c:liml.>s rid1,cs nnd dcscmcls into \'al·
le)'&, Using c:quat100, <kKribed in this s«ti.ou. tb«e di,tonioos may be reruo\·c:d. resulting in •be
ortboaraphic una;c on then.in. s.bo\\~ the: neatly straight pipeline tr.ajc:c1ory (c:our1c:.sy USGS).
fi_j:UN' 6-27: A fused image/LiOAR d11«-dimcosiooal dot.t s« for 1hc Matterhorn. iu Switzcrla1»d. co1Jec1cd
with a UAV. Point ckositics arc- sclcsct«I based oo the time :1.n1ilab)c aod area to be Sllltlp!cd. ao,d not fixed. as
with nniiy s:ucllilc :.od iu sou1c c~ses ad'i:i.l systctnS (courttiy scwcFly Lui.).
Fiaun 6-2.9: Photo interpretation i, the proccu of ide1uifyina fcatuT«, ou au llnaic. Photo interpretation
in , uppor1 ofOIS !)'piclllly iu\'oh·c:s dig:i1izing 1hc poiuts. liuc:i. or polyt0111 for c11.1cgol'ic:,of inttttSI frolU
a ~c:orcfCRn«d digitn.l or hardcopy im~gc. In the ex.,mplc llbo\'C, 1bc: bound.vies between diftcrn11 ngc:•
tallou typcs luwc b«n i<kuti6cd based on tbc: tone and tc::<.tu:c: ~ordcd in lbc linage, The aJTOW n1 the
lower ritbt shows au ''ioclusiou are:a". 001 ddinc:accd because it i, smaller than tbc: nllninuun inappinj
w1i1.
Chapter 6: Aerial and Satelltte Images 265
Most GJS software allow you to change 6-3 that some features reflect a few percent
the lxlse color assigned ro each image band. of the incoming radiation while orher fea.
and manipulate the ius,ge histograms 10 rures reflect more the 70% in those same
enhanc.e image display (Fignre 6-30). We bands. Differences may be obscured if dis.
may optimize the various image his1ogram play thresholds are 001 adjusted. reducing
tliresholds 10 best reveal the (eatures we the in(omiation that can be e.xtracted from
wish to identify. Analysts typically adjust at images. Detailed de.scriptions of more
least the upper and lower thresholds for each sophisticated image eohaucements can be
band so that the sensc<I energy spans the found in most introductory rem0<e sensing
entire range of brightnesses that a computer textbooks.
monitor can c!lisplay. Remember from Figure
Satellite Images
The previous sections described the small: but conve~ely. satellite C-OSIS for
basic principles of remote sensing and the small are.as ,,~th sateUite-s may be quite b.igb.
specifics of aerial image collec1io11 and cor- Satellite images may N?quire specialized
ree,tion. In many respects. satcUite images image processing software. Acquisition of
are similar to aerial images when used in aerial images may be more flexible.because
GIS. We aim to collect infomiatioo on the a pilot can fly or a drone sent up on shon
location and characteristics of feamres. notice. Many aerial images have bener effec-
However. there ac-e imponant differences tive resolution than satellite ima,ges. Many
between photographic and satellite,based of these disadvantages of using satellite
systems used for image collection. and these images diminish as more. higher-rcsolmion.
differen.ces affec1 1he ch.1racteris1ics and poimable scanners are placed in ortlit.
hence uses of satellite images.
Satellite systems have se\'eral advan- Basic Principles of Sate:llite
tages relative to aerial imaging systems. Sal- Image Scanners
ellites offer a ve,y high perspective. which
significantly reduces terrain-caused distor- Sc.anoe.rs operate by poiotin,g the detec-
tion. Equation (6.6) shows the terrain dis- to~ at the area to be imaged. Each detector
placement (d) on an image is inve~ely bas an ;11s1a111auecus field ofl'iew. or CFOV.
related to the flying height (H). Satellites that corresponds to 1he size of the area
have large values for H, often 600 km (370 viewed by each detector (Figure 6-31).
mi) or more above the Earth's surface, so Although the IFOV may not be square and a
relief displacements arc correspondingly raster cell typically is. this IFOV may be
small. Because satellites are flying above the thought of as approximately equal io ihe ras-
atmosphere. their pointing direction is very ter cell size for the acquired ima.ge.
precise. alld so till erro~ may be controlled. The scanner builds a two.dimensionaJ
There are additional trade.offs in satel- image of the surface by pointing a detector
lite vs. aerial platforms. Satellite in1ages typ· or detectors at each cell and recording the
icalJy cover larger areas. so if1he area of reflected energy. Data are typically collected
interest is large, costs per unit area may be in the across,track direction. perpendicular
High-Resolution Satellite Sys- short revisil times, 011 the order of one to a
tems few days.
There is a large and growing number of Spectral range. price. 3\'llilability, reli-
high-resolmion satellite systems. here rntber ability, flexibility. and ease of use may
arbitrarily defined as those with a resolution be(ome more impor1an1 ractors dn sele(1ing
finer than 4 m. This is me resolution long be.tween aerial images and high-resolution
available on the largest-scale aerial photo- s.11elli1e images. Sa1elLite data are anractive
graphs. and used for fine-scale mapping of when collec1ing data for larger areas. or
detailed features such as sidewalks. houses, where it is unwise or unsafe to operate air-
roads. individual 1rees. and small-area lru1d- craft. or because data for large areas may be
~cape change. Co:nunercial systems provid· geometrically corrected for less cost and
mg 30 cm resolutiou are in operation (Figure time. Aerial images may be preferred when
6-34). witl1 higher-resolu1ion sys1ems in 1be resohuions of a few to tens of centimeters
offing. This detail blurs the distinction are needed. or for smaller areas. under uar-
between sa1elli1e and pho10,based images. rower acquisition windows. or willl instru-
ment dusters not possible from space. Aerial
Image.s from high-resolution satellite images will not be completely replaced by
systems may provide a suitable source for satellites. but they may well be pushed
spatial dam in a n\lmber of settings. They are t~wards the finest resohllions and county-
typically required by ci1ies and businesses s,zed or smaller collections.
for fine-scale asse.t management. for exam-
ple. in urban tree inventories. conscn1ction As of early 2022, there are several oper-
monitoring. or stonn d.1mage assessment. ~tional satelHt~ sys1ems capable of global
Nearly all tl1e sy<lems have poin1able oplics image acqms1uonat I mresottuion or bener.
or satellite orientation control. resulting in with some as low as 15 cm. T11ese s.1tellites
and 1ela100 systems Me predomi.nautly con!·
Fi.&"!"' 6-:1-': A 0.3 11\ rcso!u1:ion imllJc ~fLltc lul.@:~lic ~inc in Wc11cm .4.rutrn.liil, dn tlOlmrn1ing 1.bc
<.ktm I,l\'ulablc from Ille lugbcs1-ruolu11011 u tdhtc unagmg $ystctru (courtesy Digit.,) Olobc.),
Chapter 6: Aerial and Satelltte Images 269
mercial enterprises, funded and operated by has a higher radiometric depth and spatial
businesses. \Ve will describe examples. but accuracy and more frequent revisit schedule
given tbe number of current systems. and than the preceding WorldView-3 ao<I World-
rate of development and launch of new sys. View~4 satellites. which car.ry a similar set
tenis. tlte description is incomplete. In addi· of panchromatic and visible/infrared bands.
tion. there are several recemiy Off-nadir resolution is poorer than 30 cm.
decommissioned high.resolution systems for but sat'ellites in the Worklview series can
which archive data are available and still image the e111ire globe at be,ner tl1au 50 cm
useful. e.g .. tbe lkonos satellite that operated resolution daily. Nadir images are collected
front 1999 througlt early 2015 (Figure 6-35). at .approximately I0:30 a.mM local time. a
and the Quickbird system. operntional from common characteris1ic of these polar orbit·
2001 througli early 2015. ing. sun-synchronous systems.
The ,vorldView Legion series are an Anotl1er set of high.resolution images
example of tine current generation of high· come from the Sare/lite Po11r l'Obse,,·atiou
resolution sarellites. There is a planoed con- de In Ttn1' (SPOT). version·s SPOT-6 and
slellation of six satellites. with a 30 cm pan· SPOT-7. These are au evohllion of a set of
chromatic and 1.16 meter multispecUlll mid-resolution satellites. SPOT· I through
modes. Multi$pectral bru1ds cover the ,isible -5.described in the nexi section. The higb-
and infrared portions of the spectnun. With resolu1ion salellites carry a 1 .5 m panchro·
pointable op1;cs. the system will provide up matic ru1d 6 m resolutioo mul1ispecU11l scan-
to 15 images ,of any one location each day. It ner. the latter with four bands spanning the
flau1•t 6,3S: AO.Sm ttso1utioo fusioo inl3gc o(Vcoi«. ltal>· from the Jkono-2 satellite. NOie tll'C Gtaud
Can:\] through dtc <:cuter of iiltuid, ~ud 00l'l1 wt1kc:t \'itiblc iu duuutcls irowid die pcri1u-ctcr(cowtcsy ESA
& Digital Olobc).
270 GIS Fundamentals
Fli:un 6,,36: A SPOT-6 im11gc ofBora-BONL dCttl011Str:u-ing: t11c lli~b re10lu1io11 O\'er II rcUltiwl~· la'lc
2rc:1. nl'nilnble fr.om tbc ~tcm (courtc:ty SPOT ln'll\gc Coip.).
Chapter 6: Aerial and Satelltte Images 271
Flgun 6..37: Cameron Peak fift: iu central Colorado. 1:11pn1ttd ;113 mc1t.t re-solution &om :fl Do,•c- satdl.ilG
i111hc: Pl.1U1c1 c-ons.1dL,1ion. Hundrcch or s:11c:lli1c, :i.llow image rcpc.:u time.I ofl1ourt or lcu. allowint
sp.t«-bascd fire monitoring i.n near real imc- (courtesy Pl1111ct Inc:,),
were described ht the pre\'ious section infrared range with a 100 m resolution. The
because they offer high resolution images as system bas a 185 km swalh width a, nadir
well as mid-resolution. The mid-resohnioo and a repeat interval of 16 days. When com-
sensor provided visible through mid-infrared bined with the similar Landsat-S mission.
bonds at IO to 20 m. SPOT data are rou- 1here will be an 8 day recum interval.
tinely used in a 111.unber or resource manage- Landsat-9 uses an instmmeol calle.d the
ment. urban planning. and other Operorio11al land lmager-1 (OLl·2) to col-
applications. lect non-thermal bands. The spocific bands
There is a lafl!e archive of early SPOT used were selected to be compatible with
images. useful for time series anaJysis and previous Landsat missions. and 'to improve
change detection. but the lates1 SPOT satel- cloud detection and aerosol/auuospheric
lites are nearing their designed lifespan. and hazean.,lysis. The Oll-2 also increases !he
at this time the.re is no plan for a replace- bit deptlt, or da1a \\idlh. from 12 to 14 bits
mem. over Landsat-8. gh<ing a broader and more
sensitive response. aod clearer. more
detailed images.
Landsat
Because Landsa1 was the first Earth-
The Landsat-'9 satellite is the Jmest in observing satellite system and it has oper-
the longesl running series of mid-resolution ated nearlycon1inuously since 1'972. ihere is
imaging sa,ellites . Landsai-9 collects a 15 m an image repository spanning fi"e decades.
resolution pancbr,omatic band. 8 mul1ispeo- The majorily of these images (Figure 6-38)
tral bands al 30 m in the visible, near-infra- are available free of charge to artyooe wilh
red. and mid-infrared portions of lhe an internet connection. allowing long-tenn
spec1n1m. and two bands in the thermal
Fij11N' 6-.lS: An c:"(l'lt!Ji>lc of ri L:i.ndsril•S im:igc, sliowing die Miu~~:!f.pi Rh·er Dch3. Mid,n:solu1io11 s:il•
cllu" are partii.ul.arly approprialc for rcgioun.l or otbc,r laxgc-3"11 )'$&, (courtc~)' NASA).
Chapter 6: Aerial and Satelltte Images 273
monitoring and analysis. Landsal·9 images data. Landsat is the basis of many statewide
are processtd and added to this archi\'e. typi- and national land cover mapping projects,
cally within a few days of collection. result- and it has been used 10 assess forest health.
ing in an inexpensive source of broad-scale urban growth, and water quality in lake and
images. This Jong time s.ries is particularly coastal areas. Landsat is particularly appro-
appropriate for change analysis. provided priate for change detection. and much work
the differences between legacy and new data has established methods for radiometric cor-
resolutions aod fonua1s are addressed. reetion through time and across sensors. so
Landsa1 satellites have carried a rnJ1ge of that the 1imeseriesofins1ges maybe used 10
sensors. staning with a Multi-Spectra] Scan· map urban growth, vegetalion change, and
uer a, 80 m resolution (MSS). a n,ematic trajeetories in water quality.
Mapper (TM) or Enhanced Thematic Map-
per (ETM+). with 30 m resolutions, and now Sentinel
tile OLl-2 wit11 an up 10 l S m resolution.
Band wavelengths bave varied. but gene.rally n,e Sentinel system. launched by the
included \1isilhle and infrared portions of the European Space Agency (ESA). is com-
spectrum. Tik! satellites have bad a 16 10 18 prised of six missions. including atmo-
day remrn inten,al. spheric. oceanic. and land resources (Figtire
6-39). Each mission consists of two satel-
Landsat is used in many projects world· lites. typically in offset orbi•s to provide
wide because of the breadd1 of radiometric ma'(imum coverage and frequent repeat
bands, lbe-large scan area far individual observations.
images. the Jong data record. and no-cost
Fii,1111' 6-39: Tilc- Sco1ind ,a1c-lli1c- sys1c-u.1 pTO\'idcs glob.,1 ni.id-t'C".$0lution &it:. o.c-ross ;i bro~ spec-.tr:il ~ ngc-
(('ourtesy ESA),
27 4 GIS Fundamentals
fla:un 6--40: Au iinagc iUustratin; the bl'OM atcn covc111ic :ind infotm:uioti ce.11e1u A\'lllill'lblc from the
VIIR.$ s)'ftan, TI1is compo1itc i.nugc shows IWO C)'cloncs lhal sw~pt ncroi;~ 1io11hcm A11$'lrnlia in Lite Fcbru•
ary. 20.20. as well" diff¢ttuccs fo ocean co)Of associ11tcd ,,i tb pbytoplaul..100 blooms. M.d l,10(1:5urfacc color
aff~1cd by ie>Olon· and ,·c.acuition conditions (courtesy NASA).
Chapter 6: Aerial and Satelltte Images 275
lion satellites in several ways. Bands are tai- have the 250 m resolution, one each in lhe
lored 10 provide enltaoced infonnation in red and infrared portions of 1he light spec-
specific windows. wi1h improved vegetation. 1mm. These data were somewhat unique
ocean color a11d productivity. land and ocean when introduced in that the resolution was
temperature. and cloud. fire. smoke. and sea finer lhan ~1e I km resolution of previous
icede1ec1ion. VIJRS data are freely available daily coverage satellite data. but substan-
via U.S. NOAA portals. tially coarser lhan Lands.1t. S POT. and mod-
MODIS is a NASA research system erate-resolu1ion satellites (F igure 6-41 ).
ne.aring the end ofits Jifo 1hat collects data at
a range of resolutions and wavebands, from Other Systems
visible through lhennal infrared bands. ll is
described here because ii was lhe first A number of olher systems provide data
global, coarse-resolu1ion system designed for GIS. perhaps chief among them various
for both land and water imaging. and radar-based satellite systems. Radar wave-
because there is an a extensive archive past lengths are much longer lhan optical remote
of images. Resolutions vary from 250 m 10 I sensing systems. from approximately one to
km. with a repeat frequency of e\'ery one to tens of centimeters, and may be used day or
two days for 1be entire Eanh ·s surface a1 I nigltl, 1brough most weather conditions.
km resolutiom. Thiny·six bands are collected Radar images are panchromatic. because
when operare-d in 1he I km mode. ranging they provide infom1ation on the strength of
from 0.4 µm ·10 14.4 µm . Only two bands tl1e reOecled ene,gy al one wavelength.
Fiaurc 6-41: .A MODI$ lSO m resolution ixmgc ofnortbcm lt.1lv and Switzcrlaod. n-c a,o\,·-co,-crcd
Alps cross thro.1gh 11.c center ohbis itnaj-::. ll0f1h of the Po Rh·¢r ,-alley in Italy. Small cumutus: clouds
arc "isiblc, a~ is tutbidity in the Mcdi.tcmnc:tn Sea nnd ,•rui11.1ion in bod CO\'c t (couttcsy :NASA).
276 GIS Fundamentals
Radar systems have been successfully used cialized. each is a potentially useful source
for topographic mapping and some Jand- of data for GJS. Some may introduce
cover mapping. p:articularly \\iien large dif- entirely new tecllnologies. while others
ferences in surface texture occur. such as replace orprovide incremental upgrades to
between water and land. or fores1 and existing systems. Given ~,e rapi<I pace of
recentiy clearcut areas. Operational sy1tems development. one is well served by keeping
include the ER.S-ll. operated by the Euro- abreast of space imaging ne,vs.
pean Space Agency: the JER.S-1, by the Satellite data often requires specialized
National Space Development Agency of processin,g before use io a GIS. but there are
Japan; and the Radarsat system. developed ,veb-arehives designed 10 ease &eeess. Data
and managed by t:he Canadian Space are provided that are corrected for auno-
Agency. spheric or sensor effects:. to remove distor·
There are several other airborne. and sat· tion, and converted to standard map
ellite remote sensing systems that are opera- projectious. Google Earth E11giue is one
tional or under developmen1. with the paper such archive. with a comprehensive stack of
by Ustin and Middleton listed at the end of satellite and derived data available through
this chapter providing a useful smmnary for several ~tandard progra1mning i1llerfaces.
environmental applications. and similar ref-
erences found in a web search by applic.atioo
area. Although some systems are quite spe-
Flautt 6·42: A raclariiuagc oftbc Tcidc ,·okano on Tcncri.fc Isl.aod. Spain. Radar i.in11;es arc useful for
mcailll'ina dC'\-atioll 31ld &urf.1.cc ccxt\ltc, 1ttid in11y be collcc1cd lhtougb clouds 11nd :u n,gbt (counc,y
NASA).
Chapter 6: Aerial and Satelltte Images 277
,_
fores• -ond
/ ~:c(
-
posture
N
I
I
"c
0
J) ~water
Spectral
domain
I
bond 1
• s f p p p r \ r r f f
s ' w p p p ' r f f f I
$ w w w P r
s w w w
• w w w
w ' w •
WW
•
,
'
p p '
• f p
.r
'
' ' ' '
r
p p p
r f
'
r
r '
' p' '•
f f
f I
' ''
r
I
'
Classified imo ge
t = forest
p = posture
w • water
r • row crop
w " w p p p
' p
w w w w w w ' '
PIP p p p s • sand
'
w
w
"w w .. w
w w w w w
• p p p r
w w p f r
r
r
w w w w w r r f I( r r r
f'ia u1~ 6...J : LnndcoYer Mid land uw cl~ssificatioo i.s a couunoo nppllcnlion of sa1cDi1c i1u.11gcs. The
&p«ttal rdlcctancc panems of ~ch cover type arc used to assign a unique laodco\'C1"da.u 10 each cell
TIN:se daltl tUIL)' tbni be inlpOltcd lllto fl OlS 3S II rastu&t11 layer.
278 GIS Fundamentals
'• -
Fizun 6-+I: Corine d111a. an e~ample of larJe-arca 1:wdco,·er data dt:rh·ed from satellite image&. These
data inctudc lbe arc,a surround.in; Lac Lanan. dtc ere.seem-shaped fc.amrc iu the cnuc:r. and dcpi« We
relatively high rdOluLion ,wailabte. ill this i,manec iu a continental scale.
Chapter 6: Aerial and Satelltte Images 279
~ilometer. As. the size of the study area the pixel resolution - you can't measure
mcreases. 1be cos1s of using plane or UAV- what youcau11 see. However. accuracies are
acqu~ed images may increase. Multi-image often much poorer than image pixel sizes.
mosaics are often needed, raising costs., until and it is sometimes a grave .mistake to
a1some point cos1s often SU!JlaSS satelli1e assume image resolution and accuracies are
images for th,e same area. equal. This is panicularly 1me Cor UAV sys-
Third. sa1elli1e scanners DlllY provide a tems., wbeJe the GNSS systems used in
broader spectral range and narrower bands drone positioning and for ground coordinate
relative to aer~1I images. As noted earlier, reference poin1s may not be 10 as high a sian-
sa1e1Ji1e scanners may de1ec1well beyond tlie dard as wllh plane.-based systems. Precise
visible and near-infrared sp«mnns 1bm are ort110..correc1ion requires se,,e.ral advanced
more-conuno.n in aerial scanners. If imporl- 1echniq11es. including 1en-aio and 1ilt
removal. lensdiston'ion removal, and ortbo·
ant fea1ures are bes! de1ec1ed using 1hese
pon1ons of 1he specinun. 1hen sa1elli1e da1a registra1ion. Professional-grnde UAV sys·
are preferrecL Broad-spec1n11n scanners are rems are usually op1imized for ltigb accura-
available for aerial and UAV sys1ems, bu1 cies. Less expensive.. semi-professioual or
1hese are rarer and 1end 10 lose !he cos, hobbyis1UAV systems are common. and
advaniage when compared 10 satelli1e sys- typically no, designed for accuracy. While
tems. There are often tradeoffs between the available a1 low cos1. 1hey often provide dis-
size of the area 10 be imaged. resolu1ion. and 1oned data. A system should be chosen
spec1ral bands used when selecling a sys1em. which provides bo1h the rtSolution and accu·
racy required.
Finally. accuracy mus, be considered.
Accuracy gei,erally can·1be much finer tl1an
280 GIS Fundamentals
Airborne LiDAR
A 111nnber of laser-ooS((!. Liglu dctectio11 Commercial LiDAR mapping systems
and ranging systems (LiOAR) are becoming are relatively new and have been used pri-
common. Lasers nre poimed at the Eanh's marily for collecting surface daUI from air.
surface from an a~rial or sa,ellite platform. craft and sa1elli1cs. As no1ed earlier. 1hree-
pulses of laser light emitted, and the dimensiooal LiOAR surveying from tripods
reflected energy is recorded (Figure 6-45). or ground vehicles is growing. but we won't
Like radar. laser systems are active because expand 011 them here.
they pro,ade the energy that is sensed. Aerial LiDAR collection sys1ems lypi-
Unlike rndar. lasers have limited ability to cally consis1 of a downward poi1>1ing
pene1ra1e clouds. smoke. or haze. LiDAR. a precision GNSS to record 1he
LiDAR systems have been used primar- plane's posi1io11 to a very high accuracy. and
ily 10 gather data about topography. vegeta- an orientation sensing system 10 measure the
tion. and water qrs11ity. Laser pulses reflec1 angle of1he LiDAR pulse relatiw 10 the ver-
back from the camopy and the ground. and tical direction. UOAR energy pulses a.re
1he strength and liming of 1he rentm is uS((! directed downward. Some eoergy from each
to estima1e gro1111d heigh~ canopy heigh,. pulse is reflcc1ed from vege1a1iou. building$.
and other canopy characteristics (Figure 6- or other features above the ground, but under
45). LiDAR signals over wa1er also typically mos, condi1ions. many signals reach 1be
result in multiple renll'ns. including wa1er ground and rentm 10 the airborne laserpla1-
surface height and various depths. so lasers fom,1. The. time. i111en•al between1laser pulse
may be used to measure water clarity and emission and the ground ren,m may be used
nearshorc water deplh. to calculate aircraft heigh• above the terrain.
Flying height is known from 1be GNSS and
the terrain elevation c.alculated for each
pulse. Pulses may be sem several thousand
=
times a second. scanning back a11d forth
across the landscape. so a trace of ground
heights may be measured from every few
loser centimeters to a few meters along the ground
first lose~ return
..,.,,.,. return pu se (Figure 6-46).
.,
0
Disa·e.1e.-,-etun1 LiDAR is most common.
'3 wherein the system records specific values
.,v
Q,
---vege:totlon for each laser p11lse downward. Typically,
.£
signal 1he firs, ren,rn from a pulse. lasr remrn. and
.,
0 perhaps one to several intem)ediate returns
are recorded. lr!m,form liDAR collects a
.§ ground
~
Fii=11~ 6-46: 1..iOAA $11mpling p111tm1. each do1 rq,rcsn11, a LiDA.R. return. Sc:11.n lines o,·crl11p, n::rohlng i:u
an ,1ue,:en disuibu1lon of returns ~t YCI}' fine "ales. Results arc usually swninarized for ra.stcr «ta or areas
that span iap, ..
(2 in) or better are currently possible when racy, over large are.as. than previously
LiDAR is combined with precise GNSS Md possible.
aircraft orientatioll measu.ren1ems. TI1ese LiDAR data have also been widely used
projects report d1e "average" point density, 10 estimate vegetation characteristics,
bu1 LiDAR re1urns are typically c0Uoc1ed in
induding tree heigh1, fores1 densily, fores1
swalhs across 1he landscape. wi1h individual wood amounts. m1ders1ory density, growth
scan lines discernible when \~ewed at large rates, and forest type (Figure 6-49). A large
scales. Projects are planned and flown such number of points reach ,ne grou.nd in all but
1hat an appropriate amount of overlap exists the densest forests. and the ground vs.
belween adjacent scans and adjacent night locally higbeSI canopy re1ums usually give
pa1hs, bo1h 10 avoid gaps in coverage and an esthmne of i:ree height 1bat is as accurate
areas wilb an una«epmbly low sampling as traditional manual measurements. The
densil)'. proportion ofliDAR returns is strongly
Processing extracts 1be mos, relevam related 10 canopy deosiiy. and to tree and
ren1m for the desired product. for example. forest wood mass. Crown shape can be-
the maximum first retum in a gi\'en square determined from dense LiDAR dala, which
area may be eximcted and assigned 10 a ras- iu rum helps separate forest 1ypes. LiDAR
ier cell when calcula1ing tree heigh1. or a also can provide da,a on understo,y plant
mean or minimum value when extracting density. even in full-canopied foresls.
ground heigb1s. Different processing of 1he because at high sampling densities LiDAR
LiDAR poim cloud will result in differem has proven useful a1 passing through small
extracted value.s.. canopy gaps.
Horizomal and veriical errors less 1han a There is often a choice be1ween LiDAR·
few centimeters a.re attainable. allowing the based and image-based elevation measure-
use of aiiliome la~eMto meMUi'e building ment.!. pattitulMly when tlie Mea to be ~m-
beigh1 (Figure 6-48). floodplain location and veyed is relatively small. on the order of tens
extent, and slope :and derived terrain charac- 10 hundreds of hectares (or acres). Small-for-
teristics, at much higher densi1y and accu- mat cameras. flown on drones aud properly
Flautt 6-48: An ex.ample of Li.DAR data Rod depiction o(buildu:11; hcighu. This: image $.hows: tower Mauhat·
tan. New York. iii late 200l (court«y NASA).
Chapter 6: Aerial and Satelltte Images 283
. ~;
... ...-
Fis urr 6--49: .AIJ urunelc of II LiDA_R. poll,, doiid, bcrc ~ IIWAth thra.1gh 3 forcllcd am. clispfaycd O\·e:r :\
tcmaio mode). f ;.u:h pornI rq,rcsc11b a LiOAA rct1.1ru. sbowi.11g rctlXrlH from the c.anopy 11.nd tallu trees.
sub-eauopy bnnchc:$ aud shrubs. aud deus c g:rouud rrnflis in cauopy aaps.
corrected and processed, may pro,1ide eleva- various versions. up to 1.4 when this book
lion dam with a few centimeter resolution edition was wrinen. The coo"ention defines
and accuracies. which rue also possible with the stancL,rd LiDAR exchange file with a .las
LiDAR. Li.OAR scanners are more expen- file extension. forexample, mylidar.las.
sive and heavier than small cantera systems, Also note that tbere are competing. non-
an<I if there is no need for sub-canopy infor- standard. compressed formats defined by
mation, then photo-based collection systems some providers. for example, ESRI suppons
may provide 'the needed eleva1ioo data at their own "optimized" LAS fonnat. and
lower costs. As the area sampled increases. Rapidlasso has specified a different com-
small camera drone systems rapidly become pressed fonnat, with the .la~ extension. used
unfavorable due to large photo and data pro- by the USGS National Map. Fonnats should
cessing volumes. an<I sbon drone night at a minh1mm be openly defined. with all
times between the need to replace power users having access to the file and storage
supplies. resulting in long times for larger specifications. and the ability to write inde-
area coverage. pendent code 10 read and write the files.
LiDAR data are often delivered in a
standard LAS fonnat. maintained by the
American Society of Photogranunetry and Image Sources
Remote Sensing (ASPRS). The standard National. state. provincial. or local gov•
defmes the file structure. con1em, storage errunents are common sources of aerial
order. naming. codes. and all other infonna- images. These photographs .are often pro-
tion so that amy user may be able to access. vided at a reduced cost. For example. 1.he
process, and distribute LiDAR data in a stan· National Agric.ulture Imagery Program
dard way. The standard bas evolved through (NAIP) provides coverage of orncb of the
284 GIS Fundamentals
lower 48 United Stales on an annual basis. graphic maps currently in use. Camera tilt
Images are usually c0Uec1ed in true color, and terrain variation may cause large errors
but color infrared images may also be on aerial images: however. methods have
acquired, typically at a resolution of Lmeter been developed for tJ1e removal of these
or bener. Pho1ogr,ipbs are usually collec1ed errors. Terrain-caused in,age displacemem is
during mid-growlng season. The NAJP pro- ,he basis (or siereo pho1ographic de1em1ina-
gram is coordinated lhrough !he USOA 1ion of elevations.
Fam1 Services Administration. and so the Sa1elli1e images are available from a
images are some1imes referred to as FSA or range of sources and fora number ofspecific
FSA-NAIP pholo_graphs. Online and hard- purposes. Landsa1. lhe firs, land rem01e
copy indexes are :available 10 aid in ideo1ify. sensing sys1em. bas been in operarion for
ing appropriate image mosaics. nearly 30 years. and has demonsrraied !he
Aerial images may also be purchased u1ili1y of sa1elli1e images. SPOT, AVHRR.
from 01her goveromeni agencies or from pri- lkooos. and 01ber sa1elli1e systems bavebeen
vate organizaiioos. The U.S. Geological Sur- developed llm1 provide a range of spatial.
vey (USGS) and U.S. Fores ISenice (USFS) spectral. and temporal resolutions.
rou1i11ely 1ake aetial images for s~ialized Aerial and sa1elli1e images often mus, be
purposes. The USFS uses aerial in>1ges 10 interpreted to provide usefi.d spa.1ial informa-
map forest type and condi1ion. and often 1io11. Aerial images are 1ypically in1erpre1ed
requires images a, a higher spa1ial resohuion manually. An analys1 identifies reanires
and different time of year 1han 1hose pro- based on their shape, size, texture. location.
vided by NAIP. The USGS uses aerial color. and brigh1ness. and draws boundaries
images in 1be developmeo1 of digi1al or1bo- or locatious. ei1ber on a hardcopy overlay. or
pho1ographs and maps. These organizations on a scanned image. Satellite images are
are also excellent source$ of hiilorical aerial oftea in1ef])1e1ed using auiouiati!<I or semi-
images. Many governmem agencies contrib· automated methods. Classificari-011 is a com-
u1e 10 a narional archive of aerial images. mon interpretation technique that involves
some of which may be accessed via the specifying spectral and perhaps ·spa1ial cbar-
internet. ac1eris1ics conunon 10 each feanire 1ype.
The choice of pho1ogrnphs or sa1elli1e
Summary imagery depends on 1be needs a11d budge1s
Aerial and sa1elu1e images are valuable of1he user. Aerial images often provide
sources of spa1ial daia. Pbo1os and images more detail are less expensive, and are eas·
provide large-area coverage, geometric Uy and inexpensively in1erpre1ecl for small
accuracy. an<I a permaneru record of spa1ial areas. Satellite images cover large an.v.1s in a
and anribu1e da1a. ru1d 1echniques have been uniform manner. ru1d sense energy across a
well developed for their use as a cL11a source. broader range of wavele11g1bs.
Remo1e sensing is based on differences LiDAR dala are becoming a widespread
among feanires in the amount of reflected source of spa1ial da1a. Oiscre1e-re1um
elec1romagnetic energy. Chemical or elec- LiDAR arc prevalen1. providing X. Y. and Z
ironic sensors record 1be runoun1 of energy coordinates for ground and abo,,.e-ground
reOec1ed from objeclS. Reflec1ance differ- feature returns. Mos1 new. high·Iesolu1iou
ences are 1he basi:s for images. which may in digiral elcva1ion models are based 011
tum be interpreied to provide infom,alion on LiDAR dara. and building and forest fea-
the type and location of important features. tures are routinely extracted from LiDAR.
Smh?\\~dc acquisitions are becoming com-
Aerial images are a primary source of mon. and system resolution and collec'lion
coordina1e and attribure da1a. Camera-based frequency are likely 10 improve 1hrough
mapping systems are well developed. and 1ime.
are 1be basis for mos, large-scale 1opo-
Chapter 6: Aerial and Satelltte Images 285
Suggested Reading
Atkinson. r.. Tm~. N. ( 19?9), ,1<1,·1111,~ iu RemQ!e Se11si11g aud GIS tlua/y#,, N~w
York: Wiley.
Benoit, B.. Ultre-Guerard, P. (20 14). 1l1e NCES Earth observation program. (;eosci-
eucer 011d Remote Sensing i\faga:iue, 3:4 l- 50.
Bolstad. P. V. (.1992). Geometric errors in uarural resource GIS data: tlie effects of tilt
and terrain on aerial photographs. Forest Scie11ce. 38:367-380.
Brock. J.C.. Purkis. S.J. (2009). The emerging role of lidar rem01e sensing in coastal
research aud resource management. Journal ofCoastal Resea,.ch. 53: 1-S.
Broderick, O.E.. Frey. K.E., Rogan, J., Alexander. RO.. Zimov. N.$. (20 15}. Esti-
mating uppersoiJ horizon carbon stocks in a permafrost watershed of Nonheast
Siberia by integrating field measureme111s with Laudsat-5 TM and WorldView-2
satellite daia. G/Scie11ce & Remote Se11si11g. 52: 131- 157.
Campbell. J.B. (201 I). /t11rod11crio11 ro RemoreSe11si11g (Slit ed.). New York: Guil-
ford.
Congress. S.S.C. Puppala. AJ .. Lundberg. C.L. (20 IS). Total system error an.alysis of
UAV-CRP technology for monitoring transportation infrastmcmre assets. E:11gi-
11eeri11g Geology, 247: 104-116.
Dial. G.. Bowen. H.. Gerlach. F.. Grodecki. J.. Oleszczuk. R. (2003). IKONOS satel-
lite. imagery, and products. Remote Sens;ug ofEm·iro11me111, 14:23-36.
Gillin. C.P., Cody. I'.. Bailey, S.W.• McGuire, K.J .. Prisley. $.P. (2015). Evaluation of
Lidar..cterived DEMs through terrain analysis and field comparison. Phorog,.om·
metric £11gi11eeri11g and Remote Sensing. SJ :387-396.
Goetz. S.J.. Wright. R.K.. Smith. A.J.. Zinecker. E.. Schaub. E. (2003). IKONOS
imagery for resource management: Tree cover, impervious surfaces .. and riparian
buffer analysis in the mid·Atlan1ic region. Remote Sensiug ofEm'il'o11me111.
8: 195-208.
Lefsk-y. M.A.. Cohen. W.8 .. Parker. G.G.. Harding. D.J. (2002). LiDAR remote sens-
ing for ecosystem studies. 8foscie11ce. 52: 19-30.
Lillesand. T.M .. Kiefer, R. W.. Chipman. J. (2015). Re11101e Se11si11g and Image l11ter·
preralio11 (7th ed.). New York: Wiley.
Mora, B., Tsendbazar, N.£., Herold, M., Arino, 0 . (1014). Global land cover map-
ping: Current status and future trends. Remote Sensiug and D;giral Image Pro·
cessi11g. 18:11-30.
Olmansoo. L.G.. Brezonik. P.L .. Bauer.. M.E. (2014). Geospatial and te.mporal analy-
sis of a 20-year record of Landsat-based water clarity in Minnesota· s 10.000
lakes . .Journal of the American Jfo1er Resoun:es Associa1fo11, 50:748-761.
POugmacher. D.. Cohen. W.B. Keuuedy. R.E.. Yang. Z. (20 14). Using Landsat·
derived disturbance and recovery history and tidar 10 map forest biomass dynam-
ics. Re.more Se11sf11g of £1wironme111. 15 I : 124--137.
Richard.s. J.A.. Jia, X. (2005). Remote Sensing Digi1a/ Image Analysis. New York:
Springer.
Ryan. R.. Baldbridge. 8 .. Schowengerdt, R.A .. Choi, T.. Helder, D.L.. Slonski. S.
(2003). [KONOS spatial resolution and image interpretability characterization.
Remore Sensing of£11\'iroumem. 16:37- 52.
Schowengerdt. R.A. (2006). Re11101e Sensing: Models and Methods for Image Pro·
cessiug (3rd ed.). New York: Academic Press.
288 GIS Fundamentals
Tigges, J .. Lakes, T., Hostert, P. (20 13). Urban vegetation classification: Benefits of
mullitempoml RapidEye satellite data. Remote Sensing of Environmem. 136:66-
75.
Ustin, S.L.. M; ddleton. E.M. (2021 ). Current and near-term advances in tarlb obser-
vation for ecological applications. Ecological Processes. 10: 1-57
Warner. W. (1 990). Accuracy and small-format surveys: 11,e influence of scale and
object definition on photo measurements. nc Journal. 1:24-28.
Woodcock. C.E.. Strahl er, A.H. ( 1987). The fuctor of scale in remote sensing. Remote
Sensing of£111'ironmem, 21 :31 1- 332.
Yan. L.. Roy. D.P. (2014). Automated crop field extraction from multi-temporal Web
enabled Laudsat dam. Remote Sensing ofEm·fronme111. 144:42-64.
Yang. C.. Everitt. J.H.. Bradford. J.M. (2006). Comparison of QuickBird satellite
imagery and airborne imagery for mapping grain sorghum yield. Precisio11 Agri-
c11/111re, 7:J:3-44.
Chapter 6: Aerial and Satelltte Images 289
Study Questions
6.2 - What is the electromagnetic spectrum. and what are the prb1ciple wavelength
regions?
6.3 - Define a spectral reflectance curve. Draw typical curves for vegetation and soil
through 1be visible and infrared portions of the specm1m.
6.4 - Describe the stmcture and proper1ies of digital sensors in digital aerial cameras.
6.S - What are the basic componems of a camera used for taking aerial photographs?
6.6 -A camera has a 2.4 cm square digital sensor. at a flying height of 100 meters.
and a focal length of35 mnt What is 11\e ground distance. in meters. of one side of
the area covered?
6.7 - A camera has a 32 cm square digital sensor, at a flying height of 8-0 meters. and
a focal length of24 nun. What is the ground distance. in meters. of one side of the
area covered?
6.8 - You are planning a drone mission. and know your camera has a 10 µm sensor
cell size. and a 34 mm focal length. What flying height should you set if you want a 5
cm ground resolution?
6.9 - You are planning a drone mission. and know your camera has a l61nn seusor
cell size, and a 28 mm focal leng1h. Whal flying height should you set if you want a 3
cm ground resolution?
6.10 - Describe the mos1 commonly used camera band combinations for aerial pbo-
1ograpby. and their relative advantages.
6.11 - \Vital are the major sources of geome1ric distortion in aerial images. and why?
What are other. usuaUy minor. sources of geometric distonion in aerial images?
6.12 - What are typical magnitudes of geometric errors in uncorrected a.erial images?
How migh1 these be reduced?
290 GIS Fundamentals
6.13 - Identify the type of distortion in the view on imaged square grids below:
o) c)
I
'
'
I
I I
6.14 . Jden1ify the ,ype of distortion in the view on imaged square grids below:
o) - b) c)
~- r
t 1 l
0.
\... ~
'-- ).,/
6.15 . A tall building is recorded on two vertical aerial photographs. the first photo-
graph at a nolllioal scale of I:20.000. tl>e second photograph at a nominal sc.ale of
1:40.000. n,e building is near the edge of both photographs. and terrain is level
throughout the photograph. Which image wi II show a larger displacement. d. as
shown in Figure 6-23?
6.19 . What is the displacement. d. in millimeters. shown in figllfe Figure 6-23. given
the following conditions: p = SO nun. h=40 meters. H= t .000 meters (there aire 1.000
mm to one me·ter).
6.20 . Whal is the displacement. d. i11 millime1ers. shown in figi,rc Figure 6·23. given
the following conditions: p = SO mnL h=15 meters. H= 300 meters (there are 1.000
nun to one meler).
Chapter 6: Aerial and Satelltte Images 291
6.ll - Why do the buildings lean in different directions il1 lhe ilnages below?
6.22 - Wbat is photointerpretation. and what are the main image characteristics used
during interpretation?
6.23 - How are images from s.1tellite scanners different from photograp·hs? How are
they similar?
6.24 - Wby is relief displacemem usually less for satellite systems than for aerial
camera images.
6.26 - Wbat are three criteria used in selecting the type of images for spatial data
development?
292 GIS Fundamentals
293
7 Digital Data
Introduction
Many sp•1ial data cuiremly exist in digi· Data are increasingly collected in digi1al
ral fomls. Roads. political boundaries. water fom1ats. GNSS. laser meas11remen1s. andsat-
bodies, land cover, soils. elevation. and a ellite scanners aU provide primary dala in
host of Other fean~cs have been mapped and digiial fonns. They are direcily u·ansferable
convene<! to digital spatial data for much of ro other digit,! devices and GIS sysrems.
the world. Because. these data are otie.n dis· where they may be further processed. Direcl
tributed at low or no cost. these existing digi· digital collec1ion should reduce iranscriplion
tal data are often the easiest. quickest. and errors aod help maintain so,1rce and process-
least expensive source for much spatial dala ing history.
(Figure 7-1).
boundary
.v-~ ~
elevation
digital graphic
Fl,gurr 7-1: EXJin\ple.s of free digial cb1i1 :w:...illlblc 2t tt n1.ngc oftbcmct, c:ctcn1s.. 1md ~ales. Vcctor(ldi),
nstcr(middlc), iu-a gcou-ferenc:cd diiitn.1grnpliic d.atn (rigl11) nrc sbown for Kau.,i. H;rwaii. USA.
294 GIS Fundamentals
Digital data are developed by govern- erect in response to each pan, zoom, or other
ments because these data help provide basic change in display.
public services s11d1as safety, health. trnns- There are many differences among
ponation. water, and energy. Spatial data aid WMS, WFS. atl<I WCS. but in broad strokes.
disaster pl3Dning. national defense. 3Dd
a WMS is for serving canograpbic data for
infrastmcn1re development and mainte- producing and displaying maps. while WFS
nance. Many nalional. regional. and local (\'ector) 3Dd IVCS (primarily raster) deliver
governments have realized that once these data and me1adaia in ways 1ha1 ease spatial
data have been convened 10 digital fomia1s processing and analysis. Details of the dif-
for use with.in govemmem. they may also be ferences are specified in standards docu-
quite valuable for ust outside govenuneat. mented by the Open Geospa1ial Consonimn.
Business. non-profit. education. and science. W\\w.opengeospat•1l.orgldocslis. Most data
may draw benefir from the digital spatial through services are eurren1Jy pro,sded as
data, as these orga.ni1.a1ions benefited in WMS. witb few systems supponing and
prior times from govemmem-produced using 1he editing/analysis functions available
paper maps. Some data couunonly available through IVFS and WCS.
throughout the United States and the world
are described in this chapter. Web services are bener and worse than
local da1a storage. Web sen1ices save space
on 1be local hard dri"e. and only the ponion
Map Services vs. Locally Stored of imeres1 from a large data se1 need be
Data acce.ssed. The most up.to-date iofonnation
We must distinguish between da~11ba1 provided 10 a wide set of users. Mony differ·
are available for transfer 10, storage on. and em kinds of data may be joined wgether
manipulation in a local computer (locally more e.as.ily. as accessing Web services typi-
stored). from those data that are available as cally ~uires a few mouse dicks. Howevet.
Web services. inc luding Web Mapping Ser- you may often not manipulate or change
vice (WMS). Web Fearure Services (WFS), WMS dara in any subs1antia1 way. and some
and Web Coverage Services (WCS). Digital kinds of analysis may 001 be supponed or
data were first dis-tributed on physical media. allowed. In these cases. local copies of the
then via the lnremel, bul lypica.lly as elec- data 1n.1y be required. or WFS or WCS
tronic files 1ha1 were copied on10 a local developed. Map sen•ices may also require a
storage de\rice for use. You mai11tained a fast and reliable io1eme1 conoec1ion. panicu-
copy on your device. and man.ipulated those larly for large ras1er da1a sets.
locally store<I data. A WMS eliminates the For the remainder of rbis chap1er. we
need for a local copy. will primarily focus on data ava;Jable for
A Web service is a standard way of serv, download. as lhese have the fewest barriers
ing geographic da1a over 1be Internet. GIS 10 use in analysis. Thtough time. many of
software access data via an lnternet connec- these da1a may be offered via Web services.
tion and display thl.'$e data on a local and software \\~ll ease use of Web-served
machine. al1ho11gb they are "sen·ed" from data in analysis.
some remote computing system. Image data This chapter inrroduces an alphabet
are most often sen1ed. bu! vectordata may soup of acronyms. many of which are famil-
also be pro\'ided. usually in 1he fom, of a iar 10 G!S pmc1i1ioners. We include a list a1
georeferenced map backdrop. The d.1ta don ·c the end of thischapter as a refer,ence.
reside on the locaJ bard disk and are deliv-
Chapter 7: Digital Data 295
-- ~-.-·-
·-· ,-
~
.·-
..... -
Flcurf' 7-2: Di,'Cm sds of alobaJ sp:ui:al d.:ua ll:l'C" M'llilablc, :alt11ou3b chc!'C' ~ oo one cal1n1 clcllnn3bo,lk.
Tbc Vni1cd N'nti01l$.. u,,1ion:a1 go\'cnu;i)cnl.s., flltd cpcci:.li.tcdeompil1111ions by lhc:mc itrc good .ou~c-s.. :is
e,tcmplificd by the Natural .E.arth project, hltp://ww,,·.d,11dcd«lief.cooifuonirallfwd¢.,:,htnal.
296 GIS Fundamentals
globol spalial dala sets, due to !heir leader- ical s1a1ions in 173 c.ountries. These da1a are
ship in 1he devef()pment and application of compiled. quality checked. and proc~sed 10
satellite images at a range of spatial scales. create gridded daia sets for nom,al precipita-
NASA operates DistributedActive Archive tion. Data sets of annual anomaljes, the num-
Ceu1ers (DAACs). cen1ral no<les for daia ber of gages. and sysiematic error are also
access. in addition to project-specific sites provided. This was an expensive and time-
for data distribution. e.g .. Landsat satellite consuming undertaking due to lhe numberof
and derived data (Figure 7-3). Global raster differe111 melhods used to collect and repon
data sets include elevation. land use. ec-0sys- precipitation. Considerable time was spent
teru type. and a numberof measures of vege- reconciling data collection melh.ods and
tation productivity, phenology. sttuc1ure. and resulls. A more complete description of
health. tltese data is found a1 lmp:/lgpcc.dwd.de.
University centers or ad hoc collabora- There is a similar effon for undersea topog-
tions are other rich sources of global data. raphy in the Global Multi-Resolution Topog-
One ex.1mple is the Center for lntemational raphy symhesis (GMRT). combining new
Earth Science lnfom1ation Net,,•ork. admin· ocean multi-be.am sonar data wi1b previous
istered by lhe Earth Institute at Columbia ocean depth data to continuously update
University (\\~vw.ciesin.org). 11seeks 10 ocean topography.
provide global data to better address envi- Globa.l data may also derive from a spe-
romnental problems. Another example is cific space-based mission or pla1fom1. e.g..
the Global Fores, Watch a11he University of Tropical Rainfall Measuring Mission
Maryland (http://glad.umd.edufprojectsl (TRMM) or 1he Shunle Radar Topography
gloool-fores1-wa1ch). Mission (SRTM). TRMM involved satellite-
Global spatial data sets are often orga- based measurements of rainfall in tropical
tti2ed ai'<ltlud a llierue. For example. the Max and sub-lropic~I regions from 1997 10 2015.
Planck Jnsrin11e ofEnviro,unent and Clima1e It substantially increased our catalog of
in Germany h,1s led an effort 10 create a grid- observations in previously spars.ely-mea-
ded daia set of historical global precipi1a1ioo sured regions. and the data are accessible as
by combining data from 40.000 meteorolog- rasters of various metrics. including mean
Fltntt 7~3: G.Jo~I fOl'd:1 CIUlopy CO\'U d«l\'cd from NASA Land:Ult s11tcllitcs (courtesy NASA).
Chapter 7: Digital Data 297
4,
- Boundaries
-.., - - Roads
1-- • Coast
--
. )
•
Forest
POI
Flgurt 7~.t: Au c:trunplc orOpcuStrttcMap data forao flf<a Ul Galicia. ln llonb\,·cllctn Spain. Voluur«r
colL'tbonuora <:rc3te: lmc 2nd 11.nributc <bta rcprc«mi,ig imporrnn1 (c:tllltt faycrs, ,u l1igb dct:\lL
---
. --
--·\
( -·-- •
data (Figure 7-6). The stan<lardization of
weights and measures is a p.rimary ftmction
-·--·-
. of most national governmentS. and spatial
----
-- data may be viewed as measurements or
• • ---·..
·--
,.,,___.. ,...
Fia:urt 7-7 : Dig.ital dc,1ttioo mO<k1, (DE..'1.s) arc a~·a,ll.ablc tii ,11~0tl$_r«0lutio1_, fllld -co,:craJ: ar<~'·i
for mosl o( the world. This fi~ ocat Mono Lake 1ll cas.tcm C~bf~1a was ~u«d usu~ . t:i a, ,u ·
t1blc &oil~ 1hc USGS. A uring ofLwa flows is 11.pp:tmlf on 1.bc ngh1 s1dc of the unagc. ~id glX1..'ll bl:c1
nnd ,.,,Ile)'$ to the left,
302 GIS Fundamentals
ing the number and density of measured much of the United States. Currently. the
elevations. Accurote elevation data may be USGS delivers these DEMsas pru, of the 30
collected over broad areas wi1b the appropri- Elevation Program (3DEP). through the
ate seleclion of aerial mapping technologies. USGS National Map portal. Data are avail-
From the 1950s tm1il the Ja1e 1990s. mos1 able at a 30-meterresolution for.almost all of
elevation data were compiled using precise the United States. and I 0-meter resolution
mapping aerial c.ameras, complemented by for the lower 48 states and Hawaii, and al I
op1ical ground surveys. Since the Jaie 1990s. and 3 m for a large and expandi11g area. The
LiDAR mapping lrns been combined wi1h underlying LiDAR data are also a,•ailable
GNSS to more accurately and rapidly map for download. Note that global 30 m SRTM
eleva1iou. These various GNSS and survey and Aster data, described in the global data
me1hods are discussed in Chapter 5. and sec-tion of this chapter. are available for the
aerial images and LiDAR in Chapter 6. United States. but are general.ly inferior to
Laser-based elevation mapping and DEM the JO m or bener resolution data where both
generation are now common. and wilJ be exist. There is a special 5 m radar-based data
used for the fores,eeable future to create the set restricted to Alaska.
highest-resolution DEMs. GJS users should be cautious because
While LiDAR coverage extends each there are several versions of DEM data for
year. many areas :have not been flown. many areas. and they should generally use
Where LiDAR DEMs are not ye1available the more current. higher accuracy. or higher-
in 1he United Slates. most DEMs have been resolution data. The existence of various ele-
developed using metric aerial photographs. vaton data sets, covering various time peri-
Pho1ognunmeuy has been used since tlte ods. does provide the opportunity 10 monitor
I 930s 10 map lines of a cons1an1 eleva1ion change through 1in1e. for example. broad-
(contours) and spot heights, and data have scale mining modifications in K-emucky
been developed for much of tlte Earth's sur- (Figure 7-8). The most cnrre111 USGS data
face using these techniques. are best accessed through the USGS Earth
DEMs wi1h J.•. t 0-. and 30-meter hori- Explorer or National Map portals.
zontal sampling frequency are available for
f'ia111•t 7-8:: The c,ctcut of ruouniaiutop mnon1I slrip mining in eastern Keotuck)' is e-\;deo1 in this ~pllt-
ison ofolder NED data (]cft) aud ycat 2000 SRTM clc\'ation data (ri~t), Rid~ctopi att li¢lt colo~d. \'alley
bonon-.s d.trk colored. Mines 11ppc::1.t M :irut ofunifonn 10!1< on ndicto~. .Mine 11ifcS l.:tt:idcd o, abo\'c,
h.a\'C c:<paodccl $\~antUllly, while l.tirgc DCW mines have aho b,ceu developed (b).
Chapter 7: Digital Data 303
f
I
Fiaurr- 7-9: An c~:.imp1c of mb-~sio cbtn obtnim:d from 1bc Nntioml Hych.gmpby O;uasc1. A number of
fc:,l\n types are rq,rcse11tcd. incfod-ing catduncots. w.itcr flowliues... and lakes. Tabu.Jardftt:1 arc pro,i<kd
on ctti..matcd :1,·craic flow. catcbme:ot llt('a. aud other impo1111nt attn1n1t«.
304 GIS Fundamentals
20 km
13 mi
Flgutt 7-10: An u .:uttplc of i:tdtcd HUC dm.in11g:c ltre.M for 11 portion of du: C11pc Fc:11.t River i 1:.; N011h Caro,.
lin.,. Higher l:llJC <.kt:igm,1,lqr-$ uc5t within lower du-igu11tors, s,o HUC-12 nrc~s lclfC coot;,ined wil.hio HUC-$
tmit5.
providing a nes1«1 set or drainage areas for characteristics are available at Imp://
areas larger tban Che HUC-Scatchments. cfpub. epa.gov/surf/lncatefondex .c fm.
HUC-8 ca1cbmeot.s may in turn be split imo There are od,er improved hydrologic
smaller HUC-10 and HUC-12 catchmems. data, called NHDPlus and NHDPlus HR.
this last size typically 1he smallest delinea- Managers oeed consistent elevation. stream.
tions widely available. and watershed boundary dala seu at high
The United States EPA also provides resolution 10 soh e m..1ny water resource sci·
1
data on wa1ers and watersheds of various cnce and planning problems. Tb.e original
types and formats.. organized to correspond NHO produced in the early 2000s focused
10 the HUC data at some levels. EPA River on hydrographic data from I:100.000 scale.
Reach Files organize data in a series of ver- Subsequent work has focused oo improving
sions, from RF I through RF3 data. RF3 d.1ta the accuracy, consistency. and tools to sup.
are designed 10 provide a nationally consis, pon NHDPlus data. a11d witli subsequent
tent hydrographic database that records versions using improved digital elevation
geography and assigns unique identifiers 10 data and enforcing consistency with other
all surface ,vater features. It aJlows the data sources.
hydro logic ordering or reaches so that larger
There is an emetging system for storing.
rivers and seg.meots may be accurarely finding. and retrieving hydrologic data asso-
defuied, along wilh river connectedness and
ciated with the CUAHSI project
Oow direction. RF3 da1a also record the (",vw.cuahsi.otg). CUAJ-ISI is a Na1ional
locations and characteristics of additional Science Foundation funded proj-ect involv-
elements, i11cludimg gages. dams. and other
ing mo!\' ~J~n 1,Q yoivenitie$ !9 iupp.on
hydrologic feaiures.
hydrologic science and education. CUAHS
Ri"cr reach d.ata are precursors to NHD HIS is an internet-based system for sharing
data. and so contain much of the same base hydrologic dala via a Web se1'' ice. As noted
infonnation. Rf files are available for mos1 earlier, a Web service is a sel of protocols
ofthe comiguous United States. Tab1tlarda1a that allow communkatioo amo11g computer
on water chemistry and other watershed programs over the internet. In G-IS these
Chapter 7: Digital Data 305
Web services are most often used to stream- ers do normaintain their conneclion through
line daia sharing and access. The CUAHSI wa1er bodies. Despi1e 1hese limila1ions. they
HIS is designed 10 aid 1be in1egra1iou and are often used because 1bey may be more
sharing of disparale hydrologic data, such as appropriate for statewide or regional analy-
s1ream gauge. precipilatiou. river locaiiou. sis involving only the nu\i.n s1e.ms or larger
basin 1opography or oilier basin charac1eris- rivers in a region.
tics. lntegra1ed. consistent. continent to
There are orher bydrologic da1a sets worldwide hydrography data are also avail·
available for the United States. more gener- able from tlie ''Namral Eartb" data projecls
alized. and mos1 often used for analysis or (www.naturn.learthdata.com). These da1a are
display over larger areas. The USGS pro- in1ended for use in cariograpby. and no1 pri·
duced digital. n.11ionwide cL1ta sets based on marily for analysis. as they have been gener-
paper I: 100,000, I:250,000, I: 1,000.000, alized and made consiste.nt primarily for
and smaUer-scale maps (Figure 7-11). and display ralher lhan geographic accuracy.
these are available from various state. Different hydrographic daia are offered. lat·
national. local, and private sourc.es. These ge1ed al a range of small scales. for regional
data show larger rivers and a Lim.iced sec of through global mapping. A limiled sei of
attributes for each river. most importantly anribu1es is avai.lable. including reach or
river names. These data are also nol hydro- river names and cartographic widths.
logically co111inuous. in thal many of1he riv-
Flgurr- 7,11: AJI cxaitiplc of USGS lcg:11.cy I: 1.000.000 1lydtologie cb1.t1, wi1b the n.,iioo11.I d11.1a Kt
(inKI, right) 1md ll portion in cn,c-cnJlrl:ll C.1Jifomi1t dq,ic1i.ng 1bc lc\'cl of dctn.il in thew d.111t.
306 GIS Fundamentals
High-Resolution Digital Images scanned and are available from the United
States EROS Data Center. Image sets
Digital itnages are available from a include the historical black and white aerial
range of sources. including na1ional. state, photographs. nation-wide programs of the
and county governments. or from private 1980s and 1990s. higb-resolutio.n coostal
co11trac1ors. sa1elli1e imaging companies. images. radar. and other special collections.
and resellers. High-resohuion digital image
data are typically collected every five 10 ten The High-Resolution Onbophotographs
years by the USGS. in jl'1rtnerships with (HROs) series are among the highest resolu-
states or 01her goverrunent agencies. Image tion. wide.Ly available image data sets. The
archives go back to the 1950s for most of the HRO images are collected and distributed
Unired States. with state and coumywide through the USGS with currem coverage for
coverage as far back as the 1930s. Nation- about l/3 of the U.S. These data are often
wide coverage was completed in the l 980s collected at 0.3 m (I ft) resolution, and at
and 1990s through the NHAP and then times up 10 IOcm (4 in) resolution. Because
NAPP programs. at scales in the l :40.000 to they are onhophorographs. object base Joca-
t :60.000 range. These fom1ed a primary 1.ions ha\'e been correc1ed for tilt and terrain
basis for digital orthopboto quads. or DOQs, distortion at ground height. To,vers. build-
the first onhographic. high-resolution digital ings. bridges. and other tall objects often
images with narional coverage. Most other appear to lilt, as these structure heights
images from before the early 2000s we.re above grotuid are not corrected on the
mm-based. although some have been images (Figure 7-12). These images are use-
f iJlll'f' 7-11 An.c~mp)e: of a tligb-R~lut~on Ortbophotogrupb (l;IRO). dis.triboted b)' the U.S. Ge:olog-
1C11l SUl'\'c y. ludi\'ulual pcno11Hat1 be 1dcntifie:d oc:ir lbe 011.w<>lc m the upper kft pa,.1 o(lbc .l$1.aud. as
we:11 :1..s 1hc ttll.l\lC a nd sb:«low to dte: low« rigbL
Chapter 7: Digital Data 307
Flznrt 7-13: An c:u.mplc of a historical aerial photottrnpb from the 1940$ (left) :u>d ZOOS (riglu). for an
att:i in ctit-ecntnil M.inncSO(:l. E:uiy dc:,·dopmcnt wai rcstric1ed 10 i:ic-:rtt the 1.llkc alonj tile cop ml\tiin of
the 1940 phoJO, while b)' l00$ 1hc ;m.:3 had bcc:o(iic co,inplctcly sub,:z-b,init cd. Pl1010,~hs tU:l)' be used 10
map c,incut in.&astroctutc and rcso~cs. :wd llmr c.hangc through tlllK.
308 GIS Fundamentals
Images are most often collected as nalu· National L and Cover Data
rat color. digi1al 11erial photographs.
allhough some1imes infrared bands are also While land cover is impor1rui1when
collected. NA£P images are onhorectified managing many spatially distributed
and provided at I· and 2-meter ground reso- resources, data on land cover are quite
lu1ions. with corresponding horizontal accu- expensive to obtain over large areas. These
racies at 5 to JO nt Oala are typically data are often scarce. at low categorical or
provided in au NAD83 UTM coordi11a1e sys- spatial resolution. and rarely available over
tem corresponding 10 the image area. broad areas. While individual states. coun-
ties. metropolitan areas. or private landhold-
NAIP images are most useful as a base ers have developed detailed land cover
for digitizing. panicularly when infonna1ion maps. there have been few national effons co
on vegerarioo type or condition is important map laud C0\1er in a consistent manner.
(Figure7-t4). Leaf-on NAIP images are There are four consistent national data sets
more useful for ,napping vegetation. because available, based on satellite dat• from a con-
differences are most often expressed in the sisteut set ofcategories. and a legacy da1a set
color. brightness. and texture of foliage. from the 1970s and early 1980s.
While the uarural color images typically
used for NA.IP im<1ges are inferior to infrared The National Laud Cover Database
image.s., substantial infonnation on vegeta. (NLCD) is the most recent and detailed
1ion can be collected. and sometimes 1he source of national land cover information.
NAIP images include an infrared baud. This. NLCD versions are produced in a coopera-
plus the annual image colfec1ion cycle, make tive effon by a number of United States fed-
these images a valuable source of spatial eral government agencies, under rhe Multi-
data. Resolution Laud Characteristics (MRLC)
Ftgutt 1 -1.a: An dtll1nple NA(P i1n:1gc $:bowing wcdmw:ls (Ai lll~s {8). fores, (C). :ind rc11idnt1i:1l "rc:\S
(0). The5C im.1g« arc ~t1icularly nKfiil fen- b.odco,·er and land use aw:Hmci11s ruid dc1«ti13g fiuc-dc1ail
changes at ruumal time $ICps..
Chapter 7: Digital Data 309
Consortium. so these are sometimes referred accuracy and categorical deiail possible
to as MRLC dara. The consoniwn's goal is a through spec1111l data alone. These data
consi.s1ent. current land cover data record for include digital elevation. slope. aspect.
the oontenniaous United States. NLCD data Bureau of Census popula1io.n and housing
were created in 1992. and then with an density data. USGS LULC data. National
expended set of categories approximately Wetlands Lnventory data, attd STATSGO
e.very two to three years from 200 I through soils data.
2019. Plans are 10 continue production into Data are processed in a tmifonn manner
the funire (Figure 7-15). within each state or region. and a n.1tional set
NLCD land cover classifications are of categories and protocols is followed. All
based primarily on 30 m Landsat Thematic classifications were subjected to a standard·
Mapper data. NLCD 1992 land cover was ii.ed accuracy assessment, a.od reported and
assigned to one of 2J classes. full coverage delivered in a standard fom,at. Accuracy
is obtained from adjacent or overlapping. assessmenis were based on NAPP or other
cloud-free Landsat inmges. Multiple dates 111ediWlt· to high-resolu1ion aerial photo·
are often acqwired in order to improve accu· graphs or. in later versions. with higlHesolu·
racy and categorical detail through pheno- 1ion satellite data. Areas were stratified
logically driven changes. For example. based on 1he images. and sampling units
evergreen fore.sis are more easily distin- defined. Photointerpre1a1io1u of land cover
guished from deciduous forests when bath were assumed true. and coinpared 10 NLCD
leaf-off and leaf-on inmges are used. 0~1er classifica1ion assignmen1s. Errors were
spatial data sets are used to improve the noted and reported using standard methods.
Fi;urr 7-15: A 1:imc series of N~tiou3,J L,,nd Co,-cr O~L, {11-'LCO) for ~u n.rc-:i north of Las Vcg:i.1., l\'V.
NLCD c:-atc:-gorizcs bnd c:over ioto 2J classes. aud ~ pro,;&:d in a JO.meter r.\ster «II fonnat, In this
scrin:. incrcasi.u; u~niz.atioo is sbo\\1l in dtc scpreadil1a toucs ofdarker red. with •iric-ulflll'al ar~s: iu
lig:httt IOllC:S of gray {COW'1c$)' ~iRLC),
310 GIS Fundamentals
NLCD 200 I analysis was refined to canopy cover, and an Urban lmper\lious
yield morecateg<>ries. higher accuraey. and a product shows the percem of rainfall-imper·
more uniform classification. Landsat data vious surface for each classified urban pixel.
from three periods. digital elevalion dala, important in estimating runoff a11d flood
population density, road locations. NLCD hazard.
1992. and city liglns data were used {Table
7-1). and previous data recoded to create a
consis1en1 time series. The base data were NASSCDL
also used to estilu,ate perc;ent impervious sur- The National Agricultural s·tatistics Ser-
face. and tree canopy density. NLCD 2006 vice (NASS) produces yearly Cropland Data
and 201 t again refined methods. incorporat- Layer {CDL) da1a, land cover maps that
ing information on previous classifications. foctJS on distinguishing major crop types and
maintaining categories bm improving. accu- ro1ations (Figure 7-16). These dma are cre-
racy and uniformity. ated from a combination of exis•ing land
The MRLC p:rogram also producesother cover data for nonagricultural lands, multi·
regional to nationwide data streams. The date images from mid-resolmion satellites
Rangeland Condi"lion. Monit<>ring. Assess- such as Landsat and Resourcesal· I. coarse
ment. and Projectfon data (RCMAP) provide resolution but higher frequency MOD IS data
percent cover for each 30 m pixel of shrub, for pbenological discrimination. and various
bare ground. sagebrush. and annual herba· vec-tor d.1ta layers to improve classification
ceous vegetation for western U.S. range- accuracy.
lands. Tree canopy data show U.$.-wide tree
Tahir 7-1: "NLCD 2011 land cover classes. Classes have varied sliglnly 1!,rougJ1
vers ions.
Water Shrub/and
11 open water 51 dwarf shrub
12 perennial ice/snow 52 shrub/scrub
CDL land cover classific.ati-On is based While NASS-COL data are the most up-
on exieusive ·field surveys conducted by the 10-date and accur.ne land cover classification
United States Depanmen1 of Agriculture. for agriculmral lauds. 1bey have limi1a1ions.
Fields are vis,ited, airphotos obtained, and Classification for oonagricu ltural lands are
fields. farms. and regions classified by domi- not as rigorously grom,d tru lhed as agricul-
nan1 crop types and rotations. Observed crop mral data. and depend on ol<ler NLCD clas-
types are compared to spectral data from sat· sifications. Land cover is classified only for
e1Ji1es, and a dassifica1ion algorithm devel- coun1ies with agriculture, although this is a
oped. Classification methods have changed surprisingly large proponion of the country.
since 2002. the yearnationwide data became The 30 to 60 mcell size is quite good for
available anm,aUy. Class assigrunen1 accura- such a large-area classification. but still 100
cies are generally between 85 and 95% for small for field-level assessments. and is bet-
agricultural crops. ter suited to fann.fevel and larger analyses.
COL dam is produced annually for most
regions. allowing analysis of 1rends in pL1nt·
ing. crop rota'lions. and harvest Data may be
d0\\~1loaded for regional. statewide. or sub-
srate areas. in standard fonna1s and coordi·
nate systems.
--
tOro.t«tQ'/ ~~
Ag""1,,1lt1,1,-I
---
--
o -
. ....
-
.....
w.,. W'II.IISo,b Ott ~
Cl""'"'-
--
- or.~~
- o..,s.,,.c;,..
-----
n -.
o--
- °'""",...,.,,,.
--
Hon~lieu.lltnl
---
D oi.•'"-~
--
~-
-,-~
--
FJaurf' 7. 16: A11 example ofNASS Croptnnd D!'lt:l L:'1.}'tt i11f0111.Li.1i011 on npiculNr.11 fand cover, bac f<n II
regioo io eastern K.amas. TIJc p.dtcm is dominated by row crops. chiefly soybcom and c()ru, tbc rectaug11·
lu blocks tba1 iocrc:asc io ,izc toward the oortb~slem portion of the figure (councsy USDA).
312 GIS Fundamentals
·...--
· ~UNI.._.,
-. .
-~~'-·
0,_._.. v....
·--
E--
.._.._~,_..
·----
~,....
___
,....._..................
• .._,_.......,w_
.._,_
·--
.-- . ...
U,.W,,,
•o-·-
"-"-...._
,_,...,.__
attn I ....
fignN> 1- 17: An c;,1-'lmplc ofC..CAP d111a. \\-ith n more dset11,ilcd set oflnnd c(W« da$$c1 1h20 other
$0lll"CC$ ";th oatiouwidc co,·crage.
Chapter 7: Digital Data 313
- •
~
•
"
,I;_
• , ,,?
j~
~~ ••
•
•
- ... 'I
,.;tC.:
. ,_!...
•
......
l!'N'hlwllO
.. ·.~. ..........
•
Slvubltw,b.t«ou,
Mbf)Cf htn»(~
.,.,Mb:tcl T1hlvitl1b
Ul#.Pood
F'iau"' 7.18: An a.imple of n111ioii;.11 wc1l~nclt in\'critory (NWI) ci.t1a. Digi1.i1 NWI ~t.'l arc ~wiil:ablc for 1nos1
of the Uuiied States. and prodde information on the 1oention lIDd ~bllfllCteriJries of ,,·etla.nds.
Spring photographs typically record times of and a modifier indicating lhe wetland is
highest water tables and are most likely to intermittently exposed (G).
record ephemeral we1lands. There is sub- The mi11i111u111 mhppi11g 1111/1 (MMU) is
stantial year.10-year variabili1y in surface the target size of the smallest feature cap-
wat·er levels. and hence there may be sub- rnred. Feamres smaller than the MMU are
s1an1ial wetland omission when pho1ograpbs not recorded i.n these data. l\'WI das, typi-
are acquired duri.ng a dry year. cally specify MMUs of between 0.5 and 2
NWI data provide infonnation on wet- ha. Mi.ills vary by vegetation ,ype. film
land type through a hierarchical classifica- source. region. and time period. MMUs are
tion scheme. ,vi1h modifiers. Wetlands are typicaJJy largest in forested areas and s.maJl-
categorized 3$ pan of a lacus1rine (lake). pal- est in agriculmral or de,-eloped areas.
ustrine (pond). or riverine system. Subsys- because it is more difficult to detect many
tem designators then specify further forested wetlands. MMUs also tend to be
anribu1es. to record if the "~tland is peren- larger on smaller-scale pbo10graphs. The
nial. interminent.. linoral. or deep water. Fur- tv!MU, scale, and other characteristics of the
ther class and subclass designalors and wetlands dala are available in map-specific
modifiers pro,1de additional information on metadata.
wetland cb,iracteristics. A shorthand desig- NWI da1a do not exhaus.tively defme the
nator is often used to specify the wetland location of we1lands in an area. Because of
class. A wetland may be designated the photo scales and Jllethods used. many
LIUB2G. as systelll= lacus1ru1e (L). sub- wetlands are not included. Stannory wetland
system= linmetic (J ). cJass = unconsoli- definition typic~lly includes not only surface
dated bottom (UB), with subclass- san<I (2),
314 GIS Fundamentals
water, but also characterislic vegetation or lhe coolinental United States, developed
e,idence on the surface or i,1 1be soils tba1 from smaJJ.scale maps. NATSGO da1a have
indicates a period of san1ration. Since this limi1ed use for mos, regional or more
saturation may be transient or the evidence delailed analyses, and will no! b,e further dis-
may 0011"' visible on aerial pbo1ograpbs. cussed here. S1a1e Soil Geograpl,ic
many weliands m.ay be omined from 1he (STATSGO) daia are imermediaie in scale
NWJ. Nonelheless. NW! dala are an effec- and resolulion. and Soil SUf\•ey Geographic
1ive 1001 for idelll ifyu1g lhe Joca1ioo and (SSURGO) da1a pro~de lhe mos, spa1ial and
ex1em oflarge wellands. 1be 1ypc of we1land. ca1egorical de1ail.
and for directing funher, more detailed
SSURGO daia are inlended for use by
ground surveys. land owners. farmers. and planners at the
large fann 10 counly level. SSURGO maps
Digital Soils !Data indicate the geographic location and extent
of the soil map uni1s within the soil survey
The Narum! Resources Conservation area (Figure 7-19). Soil map unilS iypically
Semce (NRCS) of1he Uui1ed S1a1es Oepan- correspond to general grouping. called
me-nt of Agriculture bas developed three dig- phases. of de1ailed soil mapping 1ypes.
i1al soils da1a sei.s. These da1a se1s differ in These de.1ailed lll3J)ping iypes are called soil
1he scale of lhe so,irce maps or da1a. and 1h11s series. There are approximately 18,000 soil
in the spatial detajl and extenl of coverage. series in 1be Uni1ed S1a1es. and several
The Na1ional Soil Geography (NATSGO) phases for lllOSI series. so !here are po1en-
da1a se1 is a higWy generalized soils map for
F'izm~ 7.J9; An ex11,1nplc of SSVRGO digi1~1 $0il'-da.1~ iw\\ibblc &oru the NRCS. Each poly~on rcpre$Cnlf
a soil mapping uni, of rcfati.\'cly wiifonn '°il propeni«.
Chapter 7: Digital Data 315
tially a large number of map units. Only a MUIR data report a range of values for each
small subset of series is likely 10 occur in a soil property. Ranges are deienuined from
mapped area, typically fewer than a few representative field-collected samples for
hundred soil series or series phases. A few 10 e.ach map unit or from data collected from
thousands of dis1inc1polygons may occur. similar map m1i1s. Santpies are analyzed
SSURGO data are developed from a using standardized chemical and physical
combination of field and pho10-based mea- methods.
surements. Trained soil surveyors cooduc1 a STATSGO digital soil maps are also
series of field transec1s in an area to deter- available, at asnmller scale and over broader
mine relationships among soil mapping units areas than SSURGO soil da la. STATSGO
and terrain. v.ege1aiion. and land use. Aerial data are 1ypically created by generalizing
photographs at scales of 1:12.000 10 SSURGO data. STASTGO map units are
l :40.000 are used in 1he field to aid in loca- larger. more generalized. and do not ueces·
tion and navigation through the landsca~. sarily follow 1he same boundaries as
Soil map unit boundaries are then interpreted SSURGO map units. ln addition, STATSGO
onto aerial photographs or corresponding polygons contain from one to over 20 differ.
orthopb0togropbs or maps. Typical pb0to ent SSURGO detailed map units. Each
scales are I: l 5.840. l:20.000. or I:24.000. STATSGO map nnit may be made up of
These maps are then digitized in a manner thousands of lhese more detailed SSURGO
that does not appreciably affect positional polygons. and many different SSURGO map
accuracy. Soi I surveys are often conducted unit types can be represen1ed within a
on a comlly basis, so county mosaics of STATSGO polygon. STATS•GO data provide
SSURGO daca are conunoa SSURGO dara infonna1ion on some of this variability. Data
are reported 10 have position.ii accuracy no and properties on multiple componems are
worse than IJ m (43 ft) for approximately preserved for ead1STATSGO map 1mit.
90% of the well-<lefiued points when
SSURGO daia are compiled at I:24.000
scale. Digital Floodplain Data
SSURGO data are Jinked 10 a Map Uni1 Floods cause billions of dollars in dam-
Interpretatioos Record (MUIR) auribuie age each year iu lhe United S1a1es: losses
database (Figure 7-20). Key fields are pro. could be rech1ced wi1b the effective applica4
vided wilh lb~ SSURGO daia. including a lion of GIS. A first slep is the mapping of
unique identifier roost often related to a soil flood-prone areas. TJ,e Federal Emergency
map unit,. known as the map unit identifier Management Agency (FEMAJ develops and
(muid). Tables in 1he MUIR database are disseminates flood hazard maps. commonly
linked via the muid. and oiher key fields. known as floodplain maps (:Figure 7-21 ).
Most tables contain the muid field, so a link These maps locale the b0t1Lt@ry of areas
may be creaied between 1be muid ,.,,lue for a ,,~th a I% or higher annual ,chance of flood-
polygon and 1be nmid value in anoihertable. ing. commonly known as 100-year flood-
such as the Compyld table (Figure 7-20). plain maps.
This creaies au expanded 1able 1hai may be FEMA occasionally updates these
further linked through cropname. classcode. floodplain maps. The objectives are 10
or other key fields. These kinds of lab le develop nmps of flood hazard ,ia an
structures and linkages are discussed in improved process, with better input dala. in a
Chapter 8. unifonn digilal fonnaf. and 1o integrale map
Variables include an extensive se1 of soil creation ioto ongoing local and s1ate govern-
physical and <hemical pro~rties. Data are ment mapping and planning efforts. Updates
reported for water capacity. soil pH. salinity. are particularly important given Ille changes
deplb 10 bedrock. building sui1abili1y, and in precipitation intensity and willt sea 1ewl
mos1 appropriate crops or other uses. Most rise due to changing climates.
316 GIS Fundamentals
r •
--... le:]
- -
'1 LOOKUP TABLES
MAP UNIT TABLES AREA TABLES
~~" ..., s..... -=
..... ..."""''...
«.&l:Ulll
ll$t>Q,d
....
K~ """"' ........
....,.,
"""""" """l,tle.!c,c:,
S1st04
_......
,1i:1~
"'.,._
,1,tod
(flfy{.Ol.k ,
CIO,cOOC
...
-"""" =- ~- J
""""" _,,
nv.c;:,nc lonc!fmo
,...,,,... .,_.
""''"''
""""
..,.,,.
"""""""'
.,,_,,
=l~~I
~tll(I
""""'' .........
"""""
.......
, 1c1u,
«ltt.>t
<dd<'<
""'"""
""""'
J
"""'
""""
""""""'
,COl';!Of\
toll(l1'9
ollwtfOm
•
-=- _
...... -..... -~ 1
LAYER
TABLE
"""'
rI .Cl)N.t>ON.JIT TABlES
'"""
"""
""""" """"
""" ..... ......
...... , '"'""'
-
""""" "''"'"
""""" '""'""'
'"""' ""'"""'
I.......
COl'l;)IIOmc ffll(11 SUI Code
"""'' .....
"""""'
lCl ··-..... ~ -·
...ooqllod
''°'' _. .......,..
"""
f'IC>l-¢1 $lTlfb
...,,..
...,,...
..... -·
"'""
MQ,U,1 I
........
'"""""'., """""
Mopped urnt of o
eeeil cloy soil f'..1lwl
......... =
"""'
PCl"lt GIW
-
..........,...
...... ......
""''""' "lc:rllpct
-~·
R'°""
"""
""
nrone
043CeC
M.oound ·.--.:muld
mulcl 0.t3CeC '~=~~·~0.C3CeC
l~d r YlduMs com
crop~me
munome Cefes com .I Ct'OpnO!'M; yldunrts bvshel/ocre
lrryld 80
~
--._.,muid 0-43CeC Toxcioss
closcode.- 124f loscode 124'
reaction. acid
f1J.Urt 1·l0: The database scbcma a~sociatcd with the SSURGO digit,I soils data. \ ariab!" dcmibinJ
soil chnnctmstic~ ate provided~ a ut of rclaiable fables. Reys ll1 e~h Ulb]e. s.bowu i.u bold,. e!~wide
n<:«st 10 i1ems of intc:ru1. Codes provided with the dii,it:iJ gcopnphic (L,rn., e.g., the muid, pro,:, dc ~ link to
tb'*' data t.ablc,. TilC rcl.atiou of a m.a~d 10il polygou to attribute cbta l5 $.bown in the c.-uruple 111 the bot·
tom. The ,nuid l$ rcla1cd from the MAP UNIT nnd COMP tables. wbicb in mm an: med 10 acc~s otbtt
\':uinbles tbtough ::.dditionn.l l:e)'S..
Chapter 7: Digital Data 317
Fl; urt 7-.lt: An cxatnplc o(FEMA Ooodpfain d:ua fot:. ~ 3ior.1 ne11r Motgan, Geo'1lJ'. USA.
Floodplaafl maps are used for a number revising maps are defined as part of the data
of pUi'pOS<!S. eliief among them setting Oood evaluation pr(>(ess.
insurance rates. Over l 9.000 communities
participate in the National Flood Insurance
Program (NFIP). This suppons federnl gov- Climate, Geology, and other
ernment 10 guarantee Oood insurance for Environmental Data
conm)wtities with floodplain management Other spatial environmental data sets are
ordinances. Ordinances reduce flooding available. including clima1e,, water chemis-
risks for redevelopmem and new construc- try, energy resources. Here we provide
tion. thereby Jeducing losses. examples, but there are many others.
Digital floodplain m•ps •re produced to TI1e National Climatic Data Center
define regions within a IOO-year floodplain. (NCDC) maintains historical climate records
Boundary accuracy may be challenged. usu- for the United States. aod provides their data
ally by individual landowners, businesses. or through a Web portal
municipalities. and the proposed adjust- (https# w\\,v.ncei.nooa.gov/maps-and-geo-
ments evaluated and included in the flood- spatial-products). Recording srn1ions may be
plain map if they improve accuracy. selected by various criter~,. including geog-
Maps are most often produced by coop- rnpby. measured variables, or length of
erating teclmical panuers (CTP). Expertise, record. Climate data ba"e beeo con,•ened to
trnining. an<l ,demons1rnted capabilities are spatial fields. and are distril>uted through the
required of eiich CTP. Best technical prac. PRISM initiative (prism.oregons1a1e.eou.
tices for digital floodplain data development see Figure 7-22).
are defined by FEMA. and training is offered Mineral resources da1a are available
to teach best available methods and increase from ct,e United States ~ological Survey. at
dalO quality. Protocols for verifying and hnp://mrdata.usgs.gov/. These data iuclude
maps of basic nalional geology, as well as
318 GIS Fundamentals
di--· p•- -4
ferem record types. A'°llec1ion of records
repons the localion and attribute infom1ation
about a set of census features, including the
location, shape. addresses and other census
auributes for a county. Ther.e is an identifier
based on the United States Federal lnfonna·
tiou Processing Standards code (FlPS) that is
used to identify the file and record type.
Census data are distributed as ESRI
Geodatabases and as shapefiles for \larious
geographic unils. In addition. specialized
Flgurt 744: TIGER dat11. pro\'idc 1opoJOllc11J software Jl"Ckages are available to ingest
encoding of ~ior., li.ucs (c.bains). n.ud polygons TIGER/Line and related ceusus and other
(U.S. Depl. of Comm~e),
federal government data files to data layers
in specific GIS formats. Dara are collected
on business activities. health. crime. among
TIGER/Line files contain infonnation to other topics. These dala may be extracled in
identify s1ree1 address labels. Starting and a customized manner.
ending address numbers are recorded corre-
sponding to sianing and ending nodes (Fig- Many United S1a1es government data
ure 7-25).Addresses may then be assigned are provided with codes compatible with
within the address range. The system does Unite<I States Census Burem, data. For
not allow specific addresses 10 be assigned example, the U.S. Department of Transpor-
to specific buildings. Howe\'er. it does tation, and the U.S. Center for Disease Con·
restrict the addresses on a city block 10 a li.111- trol deliver with codes needed 10 link
-
101-119 Oak Ave U;FT
• 100·118 RIGHT •
sro,r NO<Ht E"'1,,,_
Fisur" 7-25: TlOER do.la pro,·ide address nui,ges for line <cha.in) segments.. The" ranges in.a)' be dis·
tributcd acrou lhc line. a.hina, appro:cirua1c building. locanoos on utrcct (1J.S. Dept. of Conuncrec).
320 GIS Fundamentals
Traffic Deaths
per 100.000
CJ No Report
0 <10
- 10to20
- 20to30
- Over30
Fia.urf" 7 26: Trnffic fa1:a1ily <bin. sbowing ~,·en'lge uurobn-of de.11th$ per J00,000 pc1$0US o\'cr J99i-2006.
4
dc:ri,·ed &om data Rportc:d by the: U.S. Ccnlc:1'$ for Oisca~ Control. ni.esc: data are published "•itb links to
U.S. Ce1m1s rcCO$;uizc:d _-cov,i.pbics. as 11tc much odlc:rda1a c:olle¢ted by the: U.S. federal a,o,·e:111mc:lll.
Hc:re, F«~J ln(01m1uion Procctting Sra1i,brd {FrPS) c<>dc:s were: published ,,itb I.be: CDC' d.ull, :utd used
to link to CO\lllf)' b,o,,1ndnry files. Note the gcncrnUy high dctitb rntc:$ in some: ,outhcm ;md inle:rior w«tc:m
cotmtiC$. and th.it lNc:w Yod. driw:r5., counter to reputation. appc.ir to be amonj: lb<: &afe5't ou tbc: roads.
fiau~ 7-27: Oata, a,iiilablc for dowulo!ld may be in a UOJMargct da'-' mode1. be~ ,·cc1or polygom• !hat we
wish to use in nstcr processing. A \'e,ctor 1nodcl bolds no info1'lll3tiou for areas ouisidc of polygoui. R..'lS-tcr
1nodcls ddi11c ,-alud C'\"Cl')'Whctt iu tbc:it cxtau. "No ~UI" \'alucs nu~· be :.ssi311cd for,~ w:iddincd
rcgioos. which m3y c-onsimi.u fortbc-r processing. until they tire c::bn.ugcd.
Suggested Reading
Broome. F.R•. Mcixlet. 0 .8 . (l990). The TIGER d:,tab.'l&e stmccurt. Ctwrogn,pli,r ,md Get.r
grapluc lnforn1atio,r Systems. 17:39-47.
Di Lu'bo. M.. A.mold. J.G .• Srinivasan.. R. (2004). lntegruiou of SSURGO 013pS and soil
parameters within a gfi3graphic information system and nonpoint source pollution model
system. Jo1m1nl o/So,1 aJtd muer co,,serwmo,r. 59: 123- 133.
D,inca.n. O.T•. Aldscad1. J.. Whalen. J.. Melly. S.J .. Gonmaker. S.L. (2011). Valida1iou of 1he
Walk Scor<.'" for estimating neighborhood waJkability: an analysis of four U.S. 1uetropoli·
t.to are-as. Im ematio,iol Jow·11ol ofEmfromne111ol Ruem"th ;,, Pub/,c Health, S :4 l 60-
4 L79.
Gcscb, 0 .. Oimoen, M .• Greenlee, S., Ndson, C.. Steuck. M .• Tyler, 0. (2002). The national
elevation dalll-Sel. Phorogmmmetnc £,igi,uwillg n11d Remote Sensmg. 68:5- 11.
Goodchild. M..F... A.n$eJin. L.. Oeiduuann. U. ( J 993). A fr~uework for 1he areal i,m:rpola.tioo
oi socioeconomfo dam. E,mromnem and Plnm,mg. 25:3$3- 397.
Harvey, F.. Leung. Y. (Eds.) (2015).. Adwmces m Spatral Data Hondli11g a11d A1inl)'s.is. Bertin:
Sptlllger.
Homer. C•• Hu31lg. C.. Yang. L .. Wylie, B.. Co..,u. M. (2004). Development of a 2001 natiOn.'-I
landcovcrdatabase for the United States. Photogmmmetric £11gi11e0·i,rg a,rd Remote
Se11Swg. 70,:829-840.
Lytle. O.J .. Bliss. N.B .. Wahmttn. S.W. ( 1996). lnterpreting the Stme Soil Geog.niph.ic Oa1a-
basc (STATSGO). Goodchild, M..F., S teyacrt. L.T .. Parks. B.O., Johnston, C.. Maidment,
O.• Crane, M.. GleDdinniug, S. (Eds.). GIS a11d EJwfromnemnl !dodeli11g: Progn1s.s and
Resea,.ch /$$11es. Fort Collins: GIS World.
Marx. R. \V. (19S6). The TIGER system.: automating the gc-ograpbjc structure of the United
States Cens.us. Goi•ernmem Pubhcatio,1$ Rc\'ieh'. 13: 13 1-201 .
Maune. O.F. (2007). D1gual £/enmon Model Technolog,u<md Appl1cnno,,s: The DEl1! User'.f
Manual (2nd ed.). Bethesda: American Society of Pbo1ogr.uumietry and Remote Sensing.
Smith. B.. Sanct·well. 0 . (2003). Accuracy aad ~olution ofsbunlc mdar 1op0graphy mjssioo
data. Geophysical Research Leners, 30: 1-20.
Chapter 7: Digital Data 323
WiJen•.B.O.• Bates, K..M. ( 19.95). The US Fish and Wildlife Service's Nntional Wetlaods
luventory Project. Vegernfio. I lS:153-169.
324 GIS Fundamentals
Study Questfons
7. 1 - What are some advantages and dis.1dvantages of using digital spatial data?
7.2 - Wha1 are lhe 01os1illlportau1 ques1ious you 01us1 ask before using already devel-
oped spa1ial da1a?
7.3 - For each of tl1e following data sets, tell us who produces tl1em, wba1 are !he
source materials. what do lhe dal3 sets coutain. their grain sizes and accuracies. and
bow 1.hey are delivered: NAIP digital pho1ograpbs. NHDplus. digital elevation 1.nod-
els (DEMs), digital Ooodplain data, National Wetlands lnventol)' da1a (N\VJ). TIGER
census da1a. aod national land cover data {NLCD) sets.
7.S - (deu,ify and describe lhe cbarac1eristics of three different sources of digital ele-
vation data. \Vhat are the pros and cons of each source?
7.6 . Visit one of !he websites mentioned in this chap1er. or in !he appendices at 1he
end of this book. and download several data layers of an area of interes1. If you have
access to a GIS, try to import 1bese da1a and display 1hem.
325
'' Cumbcrtand
Fr.antlfn
23005 297.600
23007 JQ,767
3.152
4·,S17
H~0<:k 5$,09 6,089
u
'6 Kennebec
23009
23011 U1.818 2.463
7
-Knox 39.881 ?·95!.
....,
23013
• Oxford
Uncoln
"°" 3S.104 1,813
•
10 PenobSCOl
23017
23019
sa.•n
I.SUlO "J.]10
11
"
Piscm3quis
S.-.a.idllhot
23021
230'3
16,863
36,111 ...
U . l36
" Somc:flC!t
W.ilOO
23025
23027
50,74 7 10.606
3'.&31
"
1S W.:,$hlnaton 23029 31,495
2.209
8.430
16 York 2lOJI 211.185 ,.m
Fig11~ 8- I: 0:1.111 in 3 GIS indudc botb sp;i1ii1l (left) ;ind a1tn1>u1c (right) c:omponmts..
326 GIS Fundamentals
tions. A somewhat standard set of DBMS blank entries in our database. We may have
tools and methods have been developed and thousands oforders with one or ·1wo items.
are provided by many vendors. However. ifwe have one order with 50 dif.
feren1 items~we have to add enough col-
Snidents often struggle with relational uums to accommodate 50 i1em/quami1y
databases al first, and often ask. "Why pairs. even Cor orders \\~th one item. As with
bother? Can't we just use a spreadsheet?"' orders I. 2. 4, 6. 7. and 8 in the table below.
Many more people are familiar with spread- many of the cells will be empty. More data
sheet fomls. programs. and manipulation, adds to storage cosrs aod causes longer pro-
and don't undersr~nd why we bother to cessing times. This flat file stmc1ure is prone
adopt DBMS. A sliort example may help to be borh slow and space hungry.
explain their value.
There is another disadvantage. Note the
Consider ~,e file shown in Figure S-2. two orders from Paul Smitli. This requires
representing business orders. Each row copies of his name and contact infom1a1ion.
records the purchaser. an order number. and This wastes space and makes ed iring more
the items ordered. Spreadsheets typically cumbersome atKI error prone. If Paul Smith
present data Hke this in a single. ,;fla1 file:• changes his phone number. we must search
Because orders may contain multiple items. through every line in our database and
we need mul1iple columns with copiesof the change every instance of an order that con-
item/quantity pair. For example, ordernum. tains Paul Smit.h's phone number.
ber five by Atom Ant includes two items.
two B52s and two CR7s. while order num- F1mctions and programs may be wrinen
ber lhree by Paul Smith has four items. or a to address the inefficient use of space, s.low
total of S columns for items. Larger orders processing. and difficult editing in fiar file
would require additional colwnn~. forut11s. Tbis requires specific kuowledge of
1he file strucn1re. for example the arrange-
This Oat file strucnire is sub-optimal. ment and number of columns. \Vbile these
We either have 10 limit 1he munber of items workaro11nds are possible. they are often
per order (rarely a good thing for a busi- coroplica1ed and require subsramial program
ness). or else DOI kllow bow many columns maintenance. Database management S)":i·
our database might have. which would com- 1ems were developed to overcome 1hese
plicate programi11g and management More redundancies aud inefficiencies. among oth-
importantly.,~ CM easily ha~ mostly ers. While spreadsheets or other flat file for-
1
nome surname Oddress phone # ordu # item qt• irem qly ,tun qly rte,n qly
Leo O..roche, 112 Beol SI 5-1307 1 CR7 I
Rudy Vo!entiru 1 H,sponolo Dr •-2706 2 F15 t
Poul Smnll 99 Upstate ltl 0·0000 3 GTO 3 F15 1 652 I SR71 1
Adam Sm,rh I Woll S1 1-2334 • 626 l
AIOOI Ant 685 Honbor Rd 4·1222 5 652 2 CR7 2
N,n,am Smrill 202 o,nkytO\\n 9-9199 6 Fm 2
Alice Poul 5 Free Sr d-4178 7 SR71 l
Paul Smith 99 Upstare ltl 0-0000 8 FIS I
mats may be 11sed for simple c,ollections of Database Componen1ts and Char-
da1a. DBMSs are bener for mos1 applica- acteristics
tions tba1 process large amounts of data.
The basic components of a trnditiooal
DBMSs s>rovide other advantages. They database are doth items oralfribmes. the
may provide dhin lndepende11ce. a ,:aluable indivisible units of data like name. popula-
characteristic when working with large data tion. or area (Figure 8-3). These ite,ns can be
sets. Data independence allows us 10 make any characteristic used to describe things.
d1anges in the database stmcture in ways Attributes may be simple, for exatnple, oue
that are transparent to any use or program. word or number. or 1hey may be compound,
This means res1n1cruring the database does for example, an address data item that con-
not require a 11ser or programmer ro recode siscs of a house number. a su-eet name. a Ci()'.
ormodify their procedures. and a zip code.
DBMSs may also provide for mulliple
user views. \\tidi different data provided in a
Attnt,,.Jte
different Conn 10 each program or user. A or Item
DBMS may allow centralized conttol and
maintenance ofimportant da1a. One s.1aodard Nome <tPS . Ar¢,O PopOn
= ""
copy oftlte daia may be stored and up<la1ed
on a reguJar, known basis. time stamped or
versioned number to aid in management. A ·--
Vhc)r~
c - .,..,. ..
,,,. "
..... .."
?17(1
and near rapid transit stations fora pros.pee· different resources may be more easily inte·
1ive buyer. Tile applications server may gen· gra1ed. Ifa company decides 10 redesign
erate three different requests - one to 1heir data e111ry in1erface.1hey may do so
idenlify houses in a price range, a second to e.asily if 1he user interface is distinct fromthe
iden1ify good. school distric1s. and a third 10 tiers below. They do 1101 have 10 worry abo111
find rapid mu1si1s1ations. The applications how the a1>plications server or transaction
server may tlten perfomt the operations to manager access lhe databases I0\ .:er down.
1
determine where these important cri1eria are The u11egratio11 or a new database 1echnol·
lllCI. ogy is often easier wi1h multi-tiered ard1itec-
mres.
The uppennos, tier of mul1i-1ieroo archi·
tec:rures is typically a user f111afoce (figure Multiuser access adds Sllbstantial com-
8-5). This lier may be a display by a single- plexity to processing. For example, 1he
purpose or 1opic-specific program such as a server must ensure that whe.n se\'eral copies
G!S. or a Weli>-based interface wi1h 1he pri- of a database are accessed, clmnges to 1be
mary purpose of gathering requests from the database must be reconciled! on resubmis·
user. and presenting infonna1ion back 10 1be sion. If 1he updates from 1wo clien,s con0ic1.
user based on those requesis. such as when one diem dele1es a record
while a second client modifies a value for
Multi-tiered archi1ectures are adop,ed 1he same record. the program muSI resolve
primarily to insulate the user interface from 1he differences: perhaps one user has higher
the processing and data a, lower 1iers. :111d to priority. the most recent cha nges are
allow access to a more diverse range of data enforced. or a message is sent to an opera,or
through the lowest tiers. The parts are easier noting the ambiguity.
10 change when they are isolated. and new.
----
PostgreSOL . ArcServer/SDE i~~ IBM DB2
Database
Q Oracle Spatial
~ Mietosoft
't\MySOL·spa1lal ..:::- SOL Se,ver
Figure S...S: M:Ulti-ciettd arcbitechltc. 11nd common soITT\·11tc 11ltcm:uh•es. DAu1 Ilk st<>ttd :knd ac,cesscd &om
d.i.ttbMct on 1)1e bonom tier, i,nd d"'2 \'iewed m\lle$IS m~e ~I the upper tia. ApPliC':111:ions scn•ers net upon
the rcqucst and tr.mslo.te/procen i.ilfonnation paned between ,•Pf?Cr and bonom ttcr5, (n u·ell-~,·cd 5)'5•
terns tha1 adlicre to defined staudatds. the components lWI)' be tuL"Ccd, e.g.. )'OU mav :1<:c«,& da1a iu ~SOE
from QGIS. or d.11a stored i:u PostGIS from Af'CM11p (ad."lpted from Boui)dlcss). ·
330 GIS Fundamentals
Forests
Fore.st Nome Foresr·ID Location Size
Nontohoto
Choracteristics
I N. C<lrobno 184.447
Tro11Name Feature Olff.:ulty
Cherokee 2 N. Coroh110 92.271
Bryson', Knob v,s:c:1 E.M
FOt"tsts Trolls
'"""'"""'
--·
FJautt 8-7: Forest aud trails data in II tclatioi111J data structure. Rows bold records 11S.soci-
:i.1cd ,vi1b :m cnlit)', aind cohum11 l1old ilcnu. A kc)~here Forest-ID, i, u_~cd 10 joln l~blcs..
data in ways l bat help us more quickly relnrlonsltlp diagrams. also lmown as E-R
search, select, and update our data: and to diagrams. We "ill not describe E-R dia-
isolate our data from tailing programs or grams here. as llley are more appropriate for
changes in com1>t11er hardware. more advanced courses.
Figure S-7 shows a join of our forest and Given 1heir importance.. there are some
trails data in a relational data stmcture, restrictions on keys. For example. null val-
1hrough 1he k.ey Forest-ID in the Forests and ues are no, allowed 10 be pan of a key. There
Trails tables. This shows how keys allow us may be many potential keys. (cohunns that
to break our data up into severnl tables and uniquely identify each row). but typically
reap all the b<'nefi1s described above. while one is chosen for use. called a primal)· k<l)•.
providing a mechanism to link between Most tables in the database will ha\'e a pri-
tables. Note that this linkage of separate mmy key. and keys in a tabl-e are frequently
,ables is usua lly 1ra11SJJ<1ren110 11\e end user. ust<l 10 combine tables. Some keys are used
in that all or subse1s of1he forest and trails 10 inde., and add aexibility in selecting data.
data in Figure 8-7 may be displayed on
screen or printed as one continuous table.
Data from three or more distinc1 tables in a
DBMS are often joined, and columns sub-
sets displayed in what appears 10 be one
table 10 a user.
A database is often represen1ed in a
schemn. A schema is a compact graphical
representation of the conceptual model, lhe
entities. tables. keys. and the relationships
among them. It is often presented in a stan-
dard shorthand notations, usually via elllity-
332 GIS Fundamentals
•
5
d
x
blO<k boo
mauve 11,1y °'dest
older
'6 d
g
black bin
dun hoge young
oh:ler
c) union
ID 'YP< color size og•
I
6
0
g
blue
dun
big
hooe
ol d
your19
IO type cok>r .,. age
union I 0 bfut big old
d
IO rypc color ..,. og•
'
6 g
block I°""
dun
older
huge young
young
2 e green big young 2 c gtt<II bog
4 d black big O!dtt
2
I 0
b
• blut
dun my
bog old
old
join
2
I 0
b
blu<
dun
bog
liny
old
old
b
3 b 3 b ctun Tiny old
• 0 4 0 blue b<g old
Fiaurr 8-& R.elatiQl\nl al_geb~ .is originn.lly ck6oed supported eigl1l OJ>ernlor,:. ~ fU"S'I four, ~ , bown
her<: l"C$trie1. proj"t, uruoo. and join (modified froro C.J. Date, 2004),
334 GIS Fundamentals
Attribute data
Al'C•IO IO.Sf~Ol'G next-ore l"/!l<
7 I 6 2 A
2 I 4 c
3 3 2 5 F
4 • 2 5 0
5 6 7 A
6 5 I F
7 3 6 ;;
Coordinate Data
Arc-ID coordinates pointer to
next arc
I , ,.,, '"''"' ...,ll..,1 , r~,-'43-.~J,-,•.-..-3~....-'-,,'=::)
c, . 1u,io1J102.,3..,..3, r,.,,.,,i,., ~·l""'b
c I , 1,ru.••I ~ll·~ . .#3 Jf ,,.... 4"' ~,1 ~
Fia,u1•t 8-9: A small example ofa hywid database sys1cm for spatial data. Attribute data an: sto,r¢d in a
tt katioillll 1abk. while cootdiiuitc d111a arc, stotcd in II network or 01bc-t sirucnitt.
Chapter 8: Attribute Data and Tables 335
~,>OUtbem.cu~~l~..vucoµ~tll
-
Jldd• v~
.......
...
-
(
_......
•
<•
)(
)(
<
>•
)(
)(
>
I•
)( UU )( •
)( .... )( AHO )(
)( ..
Olt
)( HOTIN )
)( NOT )
t'igu1~ 8· 111>; A q\!¢l}' building oycof the &M often pro\·ided in OlS softwan:. here from Q(HS.
-
Selcct1ou expr«s1011$ may be buil1 in the bottom paud by clickin; oo field,. opcnuors. and ,·alues il'l
!he upper p.'11teli.
336 GIS Fundamentals
FiJ01't 8· 11: Simple selceiioo. ap~lyiug oz.c cri1i:rion to sel«1 records (left) . aod compound sd «.tiou.
applyin3 multiple rcqu.irane:111.i (n$)1t).
Chapter 8: Attribute Data and Tables 337
01iera1ions. l{ere. 1.be square brackets choose copied in a 1ex1 file or olher reposito,y. and
rows with a Lo.nduse vaJue equal to Urban, applied when needed.
and Mill Rote ,,.,lues equal 10 B. Row 5 is 1he
only record sa1isfying 1hese cri1eria. The While dalabase queries are typically
curly brackets select those rows that are not applied 10 iables. we must remember 1hat in
in a City. and wi1h a Densi1y grea1er 1han zoo. a GIS. tbe 1ables are usually connected in
This selects Row 3. and 1he flllill selected set some way 10 geographic feaiures. Seleclions
includes both rows. by the OR operation. of iable elements imply the selection of asso-
Selection opern1ions JllJIY ge1qui1e compli· ciated geographic elements. II is always a
ca1ed. and long. complex selec1ion "sen- good idea to verify that the selection works
tences·· may be saved. tJmt is. the syntax as e., pected when fir.;1 using a sof"vare. Ver-
ification is often easies, by "iewing the
Complex selection:
records with [(Londuse = Urbon) AND (Mill Rote = B)] OR
(NOT(Municip =City) AND (Density > 200))
ID Area Landuse Mur\lcio Densitv M,11 Rote
I 105 Urban City 1.112.2 A
selection results, either on the table, the does not yield the same set of records as the
geography. or both. Figure 8-14 illusirates expression
the results from three separate selection cri-
teria: a) that county population be greater (NOT (Londu,e • \lft>on)J ANO
than 50 persons per square mile. b) !hat the (NOT (Mu,,;e1po11,y. coun,y)) (8.4)
me<lian age be less than 40 years. and c) 1ha1
housing \'acancy rates be greater than 10%.
The rigbunos1 p0nel shows counties rerurned Parentheses or other delimiters should be
from a query specifying that criteria a and b used 10 ensure tUla.mbiguous queries.
and c. all be met irhe accuracy of the query Relational databases may support a
may be quickly verified by inspecting maps st111c111red query /n11gunge knO\Vll as SQL
of the componcm and final selcc1ions. (pronotmced both "sequel" and "'ess k-you
Figure 8-15 demonstrates 1ha1 queries el"). SQL was initially developed by the
are 001 generally dis1ribu1ive, For example. Imema1iooal Business Machines Corpora-
if OPI and OP2 are operations. such as AND tion. bm is supported by a number of soft·
ware vendors. SQL is a nonprocedural query
or NOT. then.
language in 1ha11be specifica1ioo of queries
does no< depend on the s1mcrure of1he da1a.
Ol'l (ConditionA OP2 CondmonB) (al) The language can be powerfuJ, general, and
transferable across systems. and so bas
is not always the :same as become widely adop1ed.
SQL allows 11s lO both defin.e and
(OPI ConditionA)OP2(0Pl ComfmonS) (82) manipula1e data. Oata types and tables con-
taining variables of a given 1~ may be
For example, specified. Standard operations are used 10
manipulate data, for example, 10 select,
delete, insert, and update records in a data ..
NOT ((Londus.e • Urban) ANO
(Munid pollty = County)) (8.3)
base. Long or complicated quer,es may be
saved in text files. or as scripts. that iuay be
debugged. modified. or used later. These
scripts may be quite long. and n,ay some-
,o,.
gr-eo·~r lt'Ol'l
FJaui-e 8~14: Compoocnt and composite sckctiou criteria. applied to c°'intics io the oordtcMtenl U1Utcd
St11tcs. A ,'UU2l check of aittcmntt <>fll1c COlUj)Oiitc qun y (d) t1pi1u t 1ubcompoi1aus (:t·c) it oftnt bdpful.
«peci..,Uy when lwniing,.
Chapter 8: Attribute Data and Tables 339
8· 15: Sc1cctioo operations may 001 be di,tributc;d, and tbc o«krof appli~tion is ,·cry impom.01.
J,"ia:ui-t
\Vhco the NOT o~tion is applied afttt the AND (left:). a diffcttnl set of records is sc:J«1C'd than whco
die NOT opcr:uiou is ;11pplicd before the ANO (righ1 s.ick). Or<kr of opcmtion i• inlp<nttu11, and :tmbigu.ity
s.bould be rctJ,o,~cl by using p;ircnthcsc$ or othcrdelimitc".
times be accurately described as programs. operations are not easily represented. How-
given 1heir complexity and capabilities. SQL ever SQL is adap1able. so various SQL
authoring utilities have be-created to help extensions appropriate for spatial data have
wril'e, rest. and automate these scripts. been developed and are wid.ely used.
Because SQL wasn'I ini1ially designed
for spatial data processing. inany spatial
340 GIS Fundamentals
Joining Ta bles
Relatfonal da'labases are so powerful in 1able to tl1e 1arge11able, and "copyiug" tl1e
part because we can structure out data to values from the ma1cbed rows 10 the ou1pu1
reduce duplication. ease maintenance. aud 1able. The "alues aren'1 truly copied. but
give flexibility: much of this flexibility is rather associated and displayed 1oge1her. to
because we can join .ables. Ajoin. also sa"e space and speed opera1iou.
known as a relate.. uses columns in one table
to match rows across tables. Joins were illus-
Primary Keys and Joins
trated in Figure 8· 7 and Figure 8-8, part d.
but additional examples are warranted.
We ·ve noted earlier tbal kevs are cmcial
Joins are based on join i1ems. In their 10 relational databases. and as ;uch they
simples• form. a single column in one 1able have ct.nain special cbarac1eris1ics. There
is matched to a colwnn in another table, and are se"eral kinds of keys. most illtportant 1he
a new 1able displa.yed by combining rows for priuuuy key, a chosen item or items that
matched "alues. Figure S· l 6 shows a sunple uniquely ide111ify each row in a table. We
join betwe.en two tables. Here. Code..A and often create and generate unique numbers as
Code..B. respec1ively. are used. If we call keys.. for example. a social security number
Tobie Athe ',arget table:· and Tobie B the is unique to a person; a parts ID number a.re
..source table,·· then our join consisls of often unique to an objeci: SKU munbers are
matching 1be values for a row in the source unique to items in a store. JO_A can serve as
a primary key for Table Ain Figure 8-16
I Fr 1001 x Ag x l
2 Cm 1001 x Cm y 6
3 Ag 9999 z Fr z 8
4 Fr 1001 x Fp q 10
5 Fr 9999 y
6 Ag 1001 x
I Fr 1001 x Ag I
2 Cm 1001 x Ag I
3 Ag 99<J9 z Fr 8
• Fr 1001 x
y
Ag l
5 Fr 9999 Cm 6
6 Ag 1001 x Ag I
Fia;urt 8-16: Thi~. figure illu~tr.ncs a simple join. Tobit 8 is joined to Toble A. matching the ,Codt_B ,,.1.
uc:s 10 com-spood..iu3 Code_A ,·alucs, 10 create Out Tobk .
Chapter 8: Attribute Data and Tables 341
because the values w1iquely identify the Step 2 in Figure 8-17 shows a similar
rows. Table B in Figure 8- 16 bas three col· match for the join variable with a value ofY.
umns that could serve as primary keys. Corresponding rows are matched from Toble
Type_B. Code..B. or Size..$. These are all B to Tobie A. and written to Out Tobie. There
candidate ke,•s. because each could serve as is only one matching row in this case. Step 3
a prins,ry key. One of the candidate keys is shows a match of the join variable Z value.
anointed the J!)rimary key. Again, there is a cross-table matching, creat-
ing new rows in the Ovt Tobie. Since there
Figure S-17 illustrates the join concept. are no q values in Tobte A. i11 this panicular
A join matches corresponding rows by key version of a join. there.are no rows written in
values. In Step I. Codo..B. X values in Tobie the output table.
8 are matched 10 the COdfLA. X values in
Table A The first row in Out Table is acom· Note tha1 this example is for illustmlion.
posite or row I from Table A (IO.J\•l. bm dataoose systems usually don' t follow a
Type_A,Fr, Botch.J\•1001) and row I from sequeniial process as shown. nor write a new
TObie B (Type_B·Ag. Sizo..B·I). A similar table. Sorts and indexing are often used to
1na1chu1g is perfonned for each x in Tobie B. speed matches. and niatched dam are shown
and row written in the Out Toble. in what appears as a single table. but simply
Step
I
2
3
Cm
...
fr
....
1001
!CO!
x
z
)(
..
Cm
x
y
I
6
2
4
""'
Fr
1001
1001
x
x ••
I
I
'5 • ~ • ••
Fr 9999 y F, z
6 ... 1001 )( ro q tO 1001 x •• I
I fr 1001 x I F,
c..
1001 x
x
.
., I
Step
z Cm
3 ...,, ....
1001 x
z •• x I
l 1001
.,
I
2
'
5 r,
!COi
•999
x
y s Cm
r,
Fp
y
z
•
•e • •r
• ..
5 ....
Fr
1001 x
y
x
Cm
•
I
I
6 All 1001 x 10
-
1001
"'
Fr x .,
x I 1001
., 1
. .• ....
I fr 1001
z x
2 Cm
...rr
1001 x
x l ""' 1001
z F,
I
Step 3 0999 z I
'
3 4
5 Fr
1001
9999
x
y
Cm
rr
y
z
•
• 5
• ••
Fr
F< ....
1001 x A;
y
x ., •
Cm
t
6
•• 1001 x "" q 10 1001 t
Flgurt· 8. 17: .A c0t1ccp1w.Jiiation of how rows :u-c corubincd in n. join process.. Join i1cms 1\fc ,c:quc1uially
matched using Cocle_B from the '"S0\1Nc" 1ablc (Table 8) lo COJTC$poading vafoq for CoidLA in the "tir·
get" t1blc (Tobit A),
342 GIS Fundamentals
displayed from their originaJ tables: no new or cross joins, in which all rows in one lab le
table is wrinen umless expressly requested. are combined with all rows of another table,
for example. a crossjoin of Tables A and Bin
Also note that non-matching elements Figure 8- 16 would result in a table ,~th 24
are discarded in this example. We may spec- rows (6 rows for A times 4 rows for B).
ify a joiu lhal saves some or all of the non-
matching elements, and we should be aware Mas1eri11g the differences between these
of these different join variants. types ofjoins is perhaps a bit advanced for
an introductory GIS course. bul software
The primary key. or an item that could may set any of these different types ofjoins
serve as a prim,11)' key. is usually used as a as a default method. and they mny nm be
join item in the ••source·· table of a join oper· explicitly identified by name. These and
ation. This is illustrated in the Join in Figure other differenl types ofjoins are covered in
8-16 ru1d Figure 8-l 7. Co<fe...B in tl1e source depth in most introductory database books.
Tobie Buniquely identifies each row in that but can be confusing 10 distinguish and
table. and is used 10 link to Table A ,~a apply without some practice. We introduce
CO<fe.J\. As described later. when items tha1 them here 10:
are not candidate keys for 1he source table • wam you of the differences between
are used in joins. you often ge1 ambiguous or
e1TOneous joins. different types ofjoins. and to empha·
size that different types ofjoins will
Fig,ire 8-16 and Figure 8-17 iUustrate 1151,alJy produce different results. even
the most common type ofjoin. k_nown as an whe-n applied to 1be same data. and
im,e1'j<>i11, where unmatched rows are dis· • 10 stress that there are standardized
carded. AU alternative outer join saves the
names for different types of joins.
infonnation for non-matching rows. placing
blrulk or nnll valu.es for missing items (Fig- although not all GIS softwares use
ure 8-18). There are both left- and right· them. You should verify how joins
outer joins. depending on whe1her 1he target work when first using new software.
or source nonmatches are in the target or
source tables. There is also a naturaljoiu. in Figure s.1 8 illustrates a difference
which equally uarued columns aren't copied. between inner and omer joias. The center
Join
by
1001 .
I f'r 1001
Type
Ag x .....2I
Cm
Fr
- -
1001 y
2 Cm 1001 Cm y or it may . 3 Ag 999 x
Ag Sp w yield Fr 1001 .
3 9999 4
4 Fr 1001 5 Fr 999, .
5 Fr 9999 6 Ag 1001 x
6 Ag 1001
~
o)
2 Cm 1001 y
3 Ag 999< x
6 Ag 1001 x
Flgm'f 8. 18: Examples of inner (;11) t11td omcr (b) join&. No1c 1b:u :.n iilncr join only ilwc, minc&i11* rows,
while an o.11cr join~"" ,111h.Ks forbo1b matcbing and urunatclllllg rows..
Chapter 8: Attribute Data and Tables 343
-
Hunter Tobie ?110 M.:lleogris 90 IOPQ'o'O <m103
101211'<1
'"""'
,,..
F"'r
Norn,
Mo'1<
H~••0,
10
9110 I.,.. / /
190Z CtMn CMOditnSIS
1010 Sylv!IOC)US rol);.tSTUS
9/26/02
.C/!/89
w.op,,, '1')2l0
Bugos 8"y obt2m
1010 Atl09 Nbl'Clts 9/29/l9 l>afly O\Jck 9re.01
Roo~II Tedoy 1902''~
fudd Elm<r 1010 "- 91JO ~ ( U S virglf'IIOnul 8121/U ll<>""" ....s1
Covote
t.oPtoir ...,..,,,., •ttOI~\
wnee
0000
1902 Phcscotorc1o.s c•n~ua 8115/0S Tdy Ster Cz.a1·
9110 Rong1fcr 10,onctu, IZ/2,V1.l n0,m3
""""'' .....
Prunory '~ey,
Source Tobie
'i
\
0001 Callus 9?Mtu,
4110 AnTiloc(lpt'O cmtnc:.cno
ona111 81\J Bird
10191$7 Pro.,,._
,110 Ccococc;yx ,01•or.-t011U1 'tlll~ Rd Rla'WIU Fnlly
polygon. We mighl also have yel another ins.lances ofY in ld3, but they unambigu-
1able with his1orical population da1a for each ously match witlt 1be one value ofYin Id4.
zip code. Zip codes in 1bese 1ables may serve
as foreign keys ilt joins to the polygon table. We ofleu nm into problems when we
anempt joins with items 1ha1 have a many-
Most joins should involve a one-lo-one lo-one or a many-to-many relalionship.
or a one·tO-ruany re1ationshjp between the These are often considered ill-ma1ching
source and 1he target join items. This is why keys. in 1bat results from a join can be indc-
we usually use a primary key in our source 1erminate - you can't predict the results in
table as the c.olumu for a joi.n, or we use can· advance. or they may change due to spurious
didate key as a join column. This avoids a factors. such as pseudo random effects of
many-10-one relavionship between source row ordering. Since you're oftt>JJ not sure of
and target colwm:1s. which often causes the results you'll get, many·tO-o:ne or many·
problems. The following parngraphs illus- to-many rela1ionships from tile source 10 tar-
trn1e these problems. get keys are rarely a good idea. ily requ~ing
the source item in a join to be a primary key
A one-10-one relationship means that or candidate key. Avoid these erroneous
there may be one and only one instance in a joins. We use a colwnn or set of columns
join item of a 1arget table 1hat matches one that uniquely identifies the rows. of the
and only one instance of a join item in a source table. so that there is a one-1<>-0ne or
source table. The left side of Figure 8-20 one-to-many relationship in our join.
illustrates a one-to-one match for the items
Id! and ld2. Each value of ld2 matches only Figure 8-21 shows an e.,ample ofan
one value of !di. NOie Ihat not all values of inappropriate join. The item Typ,t. in Source
Id! have a match in ld2. Tobie is 001a key. and thisresulu in a many-
to-many joilt There are ~vo rows in the
Tables may also be unambiguously Source Tobie with a Type value of Fr. Both
joined if 1here is a one-to-many relaiionsbip rows may fairly match the Fr key values
between die source join item and tlle target found in the Target Tobie. resulting in au
table join item. The join on the right side of ambiguous assignment for the values of
Figure 8-20 shows a oue-10,maoy relation- Code for those rows. Bolh Vand N are
ship between the source item ld4, and the equally supponed. hence our resulls are
targe1 item IdJ. Note there are three unc.enain. as shown in 1he Outp~tt Tobie.
1 0
--
One-to-one, inner Join
ldl k l }cf2
'l
$VU\
(I,
Restilt
Id
2
...
• &
m
--
One·to-mony, mner jOCO
lc3 lot
y
Idl s,
Result
Id k>c s7
wo
I
y
2 b y J
'd
y ""to 4
3
' 51 • ' • z z •• 7
d 2 6 5 c x pr x pr
'~ " 9
6
•r v or y pr 4
y M y nn •
x rl x 11 9
z
""' z 7
Fia,urt 8·20: Au c:umplc of a one-to-one and onc·to-1nany rd.a1ioosb.ips b<tw«n 11.bles.
""'
Chapter 8: Attribute Data and Tables 345
Jo.n
by
l,P<
F,
I r, 1001 VO<N
I 1001 Fr v
2 Gm 1001 v
2 Gm 1001 Gm v ] Ag 9999 x
3 Ag m9 F,
F, 1001
N
• Fr 1001 VO!'N
's Fr m9
}JJ x s F, 9999 vorN
6 Ag 1001 x
6 Ag 10-01
ritutt 8--l l: An cxrunplc of a inaoy-to-many table join. whieb should be a,-oidcd. bcc.a\as.e it may lead to
unp«dictablc rt:sults.
• ,
Vtm'tOl'II STATE COUNTY -o-J-cr:IPS PO?(k)
store FIPS - so CDn,ic,c.1icu1 Forti~ld aa:! s
w-
CoM«:tk:u· 9 8572
'""""""' 9
3
Ntv.• Momps'lff"t
r.<1re FtPS • 33
CoMectlc.ut
Massc!c.llusc:,,J
Mcs~CfK.lsc·t,
Borns!Cble
Bcrltshlrt
9
25
25
",
J
1091
U22
13'1 9
.... ....••
New Htlmp.s.m Su!1IYOI\ JJ 19 '-0.5
~l'IOlk h\of\d Bristol l 50.6
Rholk I$·<11\d 3 1671
..
Connecticul
stale FIPS • 9
Rtiode is'Gl'ld
Vinnonl
Vf,'fflOfll
Vtrmont
·-
WashlngrOl'I
il<Mw,11..
"'""""
so
so
so
50
"
9
'
J
123~
359
369
574 I
.
Flgurt- 8. 22: A c<n11C:llcr121cd kc)', here ll1c cotubi1lcd sl:o"ltc RPS (sFIPS) ru.d C"()W'lt)' FlPS cadu (cFIPS).
346 GIS Fundamentals
• ,
Vtm'tOl'II STATE COUNTY -o-J-cr:IPS PO?(k)
store FIPS - so CDn,ic,c.1icu1 Forti~ld aa:! s
w-
CoM«:tk:u· 9 8572
'""""""' 9
3
Ntv.• Momps'lff"t
r.<1re FtPS • 33
CoMectlc.ut
Massc!c.llusc:,,J
Mcs~CfK.lsc·t,
Borns!Cble
Bcrltshlrt
9
25
25
",
J
1091
U22
13'1 9
.... ....••
New Htlmp.s.m Su!1IYOI\ JJ 19 '-0.5
~l'IOlk h\of\d Bristol l 50.6
Rholk I$·<11\d 3 1671
..
Connecticul
stale FIPS • 9
Rtiode is'Gl'ld
Vinnonl
Vf,'fflOfll
Vtrmont
·-
WashlngrOl'I
il<Mw,11..
"'""""
so
so
so
50
"
9
'
J
123~
359
369
574 I
.
Flgurt- 8. 22: A c<n11C:llcr121cd kc)', here ll1c cotubi1lcd sl:o"ltc RPS (sFIPS) ru.d C"()W'lt)' FlPS cadu (cFIPS).
Chapter 8: Attribute Data and Tables 347
appear only once in the Olllput Tobie). keys - C,y in the Coun1y Toble Wliquely
although the "copief' art 1ift tn diSplaytd. idtntifies each county. Md Dis!JD in the Dis-
trict Tobie uniquely identities each district.
Note that the joins are one-to-many in As noted before. source items in joins are
both cases: one and only o·ne value in the often keys. W>iquely identifying the rows in
sourc.e columns may match many row values the source tab.le.
in the target colwnns. Also note that each of
tlte join items in the source tables are
Counry
TC!)'e.
""''"
w.....
c~
/ School Tobie
~hf
""""
~
"" IV?<
~
l)nlll)
,._
Oistr.C-1 Tobie
-
I lli'11D ~ r ~ m
'
12
""""" """" • ..... .,. 113 O M:Ctt
I .c 624 ~ F'IQv")S
"
JJ
e«:n1t'Ol u,1tes
s..,,.ec...,,,
McJI llJ
12 ~U\on, 2
""" •s
.....
·-
Ou1pu1 Tobie
Sci>• I,,... Cty l)s,(0
~tt:nl:Stnt
"""" •z•
II
"
ee,,1tn101
........
I ... ?-.a:er
... S<,r
R F19;m
l:t F~in,
11 CenTtol l!:ktt 113
' Mdl
"
scrro~s
_.._
..........
I
2
2
,.,.,
..,., .,
lll
9$
''"'
Pok I
M.-
I) Mc¢ee
,.......,,,.,
Flcutt S..23: An c:wnpk of a mulli·tablt joift. based oo di.ff'Cl'C'Ut k:eyi. Hett. County and Oistrid tables
MT joined ro 1bc School Table, to cn:.alc: 110 Outpu1 Table.
348 GIS Fundamentals
Keys and Functional Dependen- would require a s!arch of all records for sev-
cies eral columns in tbe table. This storage
redundancy is costly both b<?cause it takes up
The previoos sections should poim out disk space and ba:ause each extrn record
the need to care.fully stmcture otr tables in a adds to the searcl:. and access ti.mes. A sec-
relational database. and that keys in tables ond problem comes with changes in the data.
are especially imponant. Poorly designed For example. if Devlin. Yamane. and
tables can suffer from serious problems in Premovic sell the parcel tliey jointly own
perfonnance, consistency. redundancy. and (first data row). deleting the parcel record
maitueuam:e. Oa1a 1ha1are stored in large for 0,tvtin would purge rbe database of her
tables may be redundant or with wasted address and tax p,yment hiStory. lfthese
space. and long searches may be needed 10 data 011 Devlin were n.,quired later. they
select a small set of records. Updates on would have to be reentered from an e,1eroal
large tables 110,y be slow. and the deletion of source.
a record may result iu the unintended dele-
tion of valuable data from the database. We may place rela1ional databases in
Smaller. carefully constructed tables are usu- nonnal[onus 10 ,void many of these prob-
ally more useful. lems. Data are suucmred in sequentially
higher norm..11 fom1s to improve correctness.
Consider lhe data in Figure S-24. in consistency. simplicity. nonredundancy. and
which land records are stored in asingle Stability. There are several levels in the hier-
table. Attribmes il}(. IU(lc Pu, (.XI .. JD. AJV~··· archy of nom:i.."J forms. but the first 1hree
man. Tship-10. Tship_nome., Tho I-odd. Own- levels. known as the first tl,roogh third nor·
10, Own_nome. and Own_odd. Sc,me infor- mal fonns. are most common. Data are usu-
mation is Stored redundantly. forexample. ally structured sequentially. that is. first all
changing the Atdermon for Tship-lD 12 tables are convened to first nomial forms,
would require changing many rows: identi- then convened to second and then third nor-
fying all parcels with Yomone as an owner mal fonns as needed. Prior 10 describing
-·-
Wtt'·IO Aldumcn
2)0)
f-ll)
"
,,,..,.,.....
·~· n,cll-Olld
!SW
O,o,n,JO
122°"""""' -~ -
°""-""'
11.l
- .,, ""'u
,:tr,1!
- c..., ........\
618
9413
l)tS,,,,
"
lZ '"'
151'1 33/ v......... 72.Jolu, ;
..........···"
{ .........-··-·"'"'''''''''"'"'-·····-·····..·······--···-..·..-···-···................................
O,,,.JO
,_ --- 0v."l·I0 C.,u,dd
.............
......
O.,w,l<I
190 """"""'
Prest~ 12...cla.,tOl'I
'" Yc:m::rt- 72 JOfln
12-.C.IOyf'On ........,.
890
""""" "' '"""""
f'i.&m'T' 8-24: Land records 00.ta in ui.nonoalizcd fonn. The table is: sho,m in two par1, b,c1:;1u-1c it is: too
w1<k to fit across I.be page.
Chapter 8: Altribule Dala and Tables 349
normal fomis ,ve musl introduce some ter· We also see tbat Nome is functionally
minology and properties of rela1ional iables. dependent oo CNum. lfwe know a value for
CNum. say. 2. we know the value of Nome
As no1ed earlier. rela1ional 1ables use \\rjll equal Ext. We see thal the converse is
keys 10 connect da1a. Any column(s) 1ha1 also true here. CNum is also functionally
uniquely defo1e(s) lhe rows of a lable is dependent on Nome. No1e 1bai this is 001
called a candida1e key. The primary key for always tme. as shown for Ctype and Thread.
indexing a 1ali>le is chosen from me se1 of
c.andidate keys. using a single.column key if CType. - > Threod is tme,
possible. We 'typically manage lhe chosen
key s11(b 1ba1 i1 will always be vaLid as a pri- bu1
mary key for the table. The Porcet-10 isa
primary key for 1he 1able in Figure 8-24. Threod -> CType is not 'true
uniquely iden1ifying each row in the table.
Why? Because for the value of Thread
Fu11ctio11.al dependency is another equal to 14, CType may be eilher Eor Er. vio·
imponam concept A1tribu1es are func1ion- la1ing our definition of f1ul(tional depen-
ally dependenl if at a given poinl in time, dence.
each value of 1be dependem anrib1ue is
In our example in Figure 8-24. we may
determined by a value of ano1ber attribute.
know lhal O,,n...odd is func1ionally depen-
Figure 8-25 illusmlles 1be concept of dent on Own_nome. In otber words. eacb
ftmctional dependency. The table conlains a owner can only have one resident address.
pans lis1. wilh ID as the primary key, and a e.g.. we may not allow 1he enuy of a second
par1 Nome. CNum. Ctype. Thread. and Angle resident address. Therefore, for a given
at1.ribu1es. 111.i ID is unique for each row, and Own_name. for example. Pnestovie, !he
so by definition, all other items are function· Own_odd is determined. In a similar way.
ally dependeu1 on 10. lfwe know the value there is only one TO\mship name,
of IO is 1. the:n we know 1he pan Nome is Tship_nome. for each Town Hall address.
Tec. We denote this as show1L Tholl·odd. or
IO-> Name Ow,v,ome · > Own..odcll
Tship_nome ·> Tholl·add
Remember, these indicate that Own_add is The First and Second Normal
functionally dependent on OMu>ome. and Forms
Thall-odd is l\1nctionally dependent on
Tship__nome. We must always bear in mind We begin creating 1ables in 11ormal
that this function.,J dependency is something fom,s by firs! gathering all our clata, often in
we enforce. Unless we place safeguards a single 1able. Nonna! fonns 1ypically result
during data enlry and manipulation. we may in many compact. linked tables. so it is quite
change data so that we ··break.. the func- common to splil tables as the database is
tional dependency. for example. by adding a 11on11a/i:ed, or placed iu nonnaJ rorms. Af1er
second owner address for an owner name. nonnalization. the tables have an indexing
system 1hat speeds searches and isolates ,ial-
Fmwtional dependencies are 1ransitive. ues for updating.
so if A-> B, and 6 -> C, then A-> c. This
notation means that ifB is functionally Tables with repeal grouping.s. as in the
dependent on A. and C is functionally depen- table at the top of Figure 8-26. are 11111101·-
dent on B. then C is functionally dependent mali:ed. A repeating group exists in a rela-
ODA. tional table when an attribute is allowed to
have more 1ban one value represented within
While relational database designs are a row. Owner-IO repeats itself for dwellings
flexible. the use of keys and l\tnctional witb multiple owners.
dependencies places restrictions on rela-
tional tables: A table ;s iu firs, ,,ormnlforw 11'1ieu
•There cannot be repeated records. that there are no repeat colunms. The Land
is, there can l>e oo two or more rows Records table at the bottom of Figure 8-26
where au attributes are equal. bas been uon11ali:ed by placing each owner
in10 a sepa1ate row. This ls a table lt1 the first
·There must be a primary key in a table. nonnal form (I NF) because each colunu,
This key allows each record to be appears only once in the table definition. A
uniquely idetrtified. t NF is the most basic level of table normal-
·No member of a column that forms ization. However. the INF table strUcntre
pan of the primary key can have a null still suffers from excessive storage redun-
value. This would allow multiple dancy. inefficient searches. and potential
loss of data 011 updating. First nonnal forms
records which could not be uniquely have an advantage over unnonnalized rabies
identified by lbe primary key. because queries are easier to code and imple-
ment. Tables in I NF are us11ally convened to
higher-order normal forms. usually 10 at
least third nom1.1l form, 3NF, but il is useful
10 understand second 11ormal fom,s before
describing 3NF 1ables.
A table ;sin second 110,111nl fonu (]NF)
ifi1 is itt fkst 1,annnlfonn nud evm,, ,1011-key
our;bwe irjimcriona/61dependem ou~,, on
the p1ima,y k~~ or on transitivefimctfonol
tlepe111/~1cies 0/1/,e primary key. Remember
that functional dependency meaus that
knowing the value for one anrib'Ute of a
record at11oma1ica.lly specifies the value for
the func.tionally dependent attribute. The
non-key a1trib111es may be direccly depen-
Chapter 8: Attribute Data and Tables 351
o...n-10
>37
890
.
0.,,u,omc
Y<lma""
°""-"""
72.Jotus
Prurov;c: 12-.ClayTOtl
. .
o,,rHO
890
231
.
-·=
"""""""
Sl\<tmOn
.
o...n.....odd
tl..eleyfcn
...,.,..,.....
OeSllvo 35E <53.)-qb-d
618
618 O<S>lw "14 Cron• 35€ 890 '"'"""
PrurOYie 14G1ay1on
0cs0v, G....,, 35E 231 s-....
618
q,413 Johnson "12 S,ch lSW 337 Yomciii:t 72..loM
f i111rt' 8°?6:Rc1!1tion11l 1nbtcs in U1monniliied (top) mld fil'll nonnil fo11Us (bo1101n).
den1on !he primary key 1hrough some func- eel changes hands and 1he seller does not
1ional dependency, or !hey may be own anotl1er parcel represen1ed in !he table.
depeudem 1hltougb a transitive dependency. lbeu informa1ion on 1be seller is lost.
The Lond Records table in I NF at lbe bottom
of Figure 8-26 has only one possible primary Some ofihese disadvantages can be
key. 1be composi1e of Porcet-lD and Dwn-lD. removed by converting the firs1 nom,al fonu
No Olber combination uniquely identifies table to a group of second nonnal fonu
e.ach row. However. this table is not in sec· ,ables. To crea1e second nonnal form tables,
ond normal fonn because it has non-key we make every noo·key attribute fully
attribmes tba; are not fuoctionaUy dependeo1 dependent on a primary key ln the new
only on the primary key attributes. For cables. Noie 1ha1 u,e I NF table will often oo
example. Tsll ip_nome and Thollodd are split in10 two or more t.ables when conven·
f,UJ<lionally dependen1 on Tship-lD. ing to 2NF, and each ne.w 1a'ble will ha\'e its
own key. Any non-key anributes in 1he new
The Land Records 1able a1 the bonom of tables will be dependen1 on 1he primary
Figme 8-26 is repealed at the top of Figure keys. Thebonom of Figure 8-27 shows our
8-27. This iable exhibiis !he primary dis ad· Land Records convened 10 second normal
vamages of the first nom1al fonn. Parcel-ID. form. Each of !he tbree tables in second nor-
Aldermon, an,d Tship·IO are duplicated when mal fonn isolates an observed functional
there are multiple owners of a parcel, caus- dependency, so each cable and dependency
ing burdensome data redundancy. Each 1ime will be described in mnl
these records are updated. for example when
a new Afdennon is elec-1ed. data must be How do we sys1ema1ically apply this
changed for each duplica1e record. lf a par- 2NF criterion. that the non-key anributes be
352 GIS Fundamentals
func1ionally depe:ndent only on the primary already in INF by definilion. be-cause all
key. directly or through a transitive fuoc- non-key anribmes will depend on the pri-
tiooal dependency? We must I) specify the mary key. However. if our primary key is
primary key. 2) ide.ntify the main functional more than one column, we may .have further
dependencies. and 3) project 1he I NF table work 10 coover110 2NF. focusing on depen-
across the key and dependency cohuuns. dence on the componrois ofthe primary key.
First. we m11s1 identify the primary key. Our second step is 10 identify the func-
IJt our example here. the simplest primary tional dependencies. We know that parcels
key is the (conca1ena1ed) key that is 1be com- occur in o,~y one 1owuship, and that each
binaiion of Porcel-10 and Owner-JO. If our township bas a unique Tship.10. a unique
primary key is a single item. 1.hen the table is Tship_no.me. a unique TholLodd~and one
JohMM
""""'° "
r~-ttt
12
12
,,
12
,....,..,......
8'-<h
Brd•
6'-<h
lt\dll add
!SW
I~\'/
l5'//
Ow!l-lD
122
337
,,,
890
o.,,.,,..,,.
°"""
Yom<ln<
lffstO'Jk.
°""-"""
123..p,,o
·~
72.Jotu:t
ti._doyron
-
618 OcSM Omni l>E Suon,,
o.s..... Cr<1,it )SE 890 12_doy1on
618
" Prfl'.$1(1'.~
618
903
Jo-·
DcS•Jt.1)
"
12
Cron•
11,n;h
J5E
1,w
23!
33, y"""""'
°"-""'"'""
72J<,t"'
Alderman.. Th.is means that if we have identi- dependencies. These par1iaJ functio nal
fied a parcel by its Porcel-ID. 1he Alderman. dependencies are represented in the tables
Tship-lD. Tship..11ome. and TholLodd are Land Records I and l ond Records 2 in Fig-
known. We assign a unique. identifier 10 each ure 8-27.
parcel of land. and lhe Alderman.
Tsh1p_nome. :and TholLodd are all dependem Fina Uy. nole thal we need lo tie lhe own-
on tl1is identifier. If we know the parcel iden- ers to lhe parcels. These relationships are
lifier~we know these remaiofog values. This prcsemed in the !able Lond Records 3 in Fig-
is the defini1ion of l\m<tional dependency. ure 8-27. Note 1hat some parcels are jointly
We represent these ftmctional dependencies owned, and so 1here are muttiple owner IDs
by: for some parcels.
Poree.I-IO - > Aldennon The three tables Lond Records I through
3 satisfy the conditions of a second nonual
Parcel-ID -> Tship-lD form. Second normal fonu eliminates some
Porcel-ID • > Tship..nome of the redundancies associat:ed with the INF.
Note !hat the redundancy in storing the
Porce.1-10 - > Tho!Lodd informalion on Alderman, Tship-ID.
Tship..11ame. and Tha!Ladd t,a,~ been sig-
These fu11c1iona I dependencies are
nificamly reduced. and the minor redun-
incorporated in a new table named U:md
dancy in Ow,\Jlome has also been removed.
Records l in Figure 8-27. Editing the tables becomes easier: for exam-
Second, oote Lha1 once. Own-IOis speci- ple. changes in Alderman email modifying
fied. lhe Own..name and Own_add are deter- fewer records. Finally. delet"ioo of a parcel
mined. Each owner has a t1nique identifier does not have lhe side effect of deleting lhe
and only one name (aliases nol allowed). infomiation on 1lle owner. o,m-ID.
Also. each owner has only one permanelll Own_nome. and Own..odd
home address. Own...nome and Own...o<ld are
fm1c1ionally dependent on Own-IO. The The Third Normal Form
f1mc1ional dependencies are:
Own-IO -> Own_nome The 2NF still contains problems,
although they are small compared to a table
Own-ID-> Own...add in INF. They can still suffer from transitive
The Porcel-10 and Own-IO are called func1ionaJ dependencies. Ifa transitive func·
partial fimc1ional dependencies, bec~1use. tional dependency exists in 3 table. tlten
while bolh ar,e depeude111 on 1he primary there is a chain ofdependencies. A transitive
key. 1hey area ·1 dependenl on each other. If I dependency occurs in our example table
have a unique Por(el-10 . I know additional named Land Records 1 (Figure 8-27). Note
infom1a1ion about some of the coluuu1s for tba1 Parcel-IO specifies Tship-10. and Tship-
any row in th~ table. bm not all or !he col- ID specifies Tsl>ip_oame, ThalLadd and
mnns. If I know 1he Own-ID. I also know the Akfenmon. Io our notation of functional
values of a set of colwnns, but again, not all. dependencies:
When we have a concatenated key. we mus, Parcel-JD - > Tship-10
identify 1.hese in our data, and they guide us
in how 10 fur tl1er split our table. and
How do we get to 2NF? By projecting Tship-JD -> Tship..11ome, TholLadd,
the INF table across tlie primary key and Akfermon
f,mcrional depe.11dencies. Remember. proj-
ect is just a way of saying we subset the col- This caus~ a problem when we delete a
umns. here guided by the functional parcel from the database. To delete a J)Orcel
354 GIS Fundamentals
we remove the parcel from tables Lond However, the table Lond Records l in
Records 1 and Lend Reco.-ds 3. In so doing. Figure 8-27 is 001 in 3NF because the func-
we might also lose the relationship among tional dependencies for table Land Recorcls 1
Tship-IO. Tship.J')ome., ThOILOdd. and Akter- are:
mon. To avoid these problems we need to
convert the tables 10 the tl1ird nonnal fonn. Porcel-lD . > Tship-ID
Tship-10 ·> TshipJlome. Thollodd.
A table ;s iu the thil'rf 11onnnlfonn (]NF) Alderman
ffmul ou/y iffor t!n?1}'/uuctio11al depen-
dency A -> B. A ;s a super keJ~ or 8 is a mem- Tship-ID is not a super key for the table
ber ofa candidate key. This requirement (it does 1101 uniquely identify the row). nor
means we must identify transitive functional are Tship_nome aod ThotLodd members of a
dependencies and, remove them, typically by primaty candidate key for that 1able. Remov-
splining the table that contains them. The ing the 1ransi1i\>e fiinc1ional dependency by
tables Lend Records 2 and Land Records 3 splining the table will create two new tables.
in Figure 8-27 are already in the 3NF. each of which satisfies the criteria for the
because the keys for these tables are super 3NF. Figure 8-28 comains the rabies Lond
keys. Owner-IO uniquely identifies the re.st Records lo and Lond Records lb. both of
of the row in Lond Re.cords 2, and the con- which now satisfy the lNF crile;ria, and pre-
ca1eruued key of Porcel-ID and Tship-ID are serve the infonna1iou con1ained in lhe 1NF
the rows iu Lond Records 3. table in Figure 8-26. Note 1hat P,aroet-10 is
now a super key for Tobie Jo and Tship-lD is
33/ Yamane
t2J~nc
12.JOhJ:S
..__ 230J 122
2303 337
890 ~.stov,c 12....cloy;o,,
457 Suor't2 2303 890
"53~
231 swmon 6.C.J'IChmOnd 618 457
618 890
••• 231
9473 337
ti&urt 8-28: Owmn.biJ> d.ltil in third oonnal form. with the fi.anctiooal dcpendeocics (FO) ®tcd at
the top of dtc table.
Chapter 8: Attribute Data and Tables 355
a super key for T<1ble lb, so the 3NF criteria the most common method for organizing and
are sa1isfioo. manipulating anribtne data in GIS.
A general goal in defining a relational Selections. or queries. are among the
database structure is 10 have the fewest most common analyses couduc1ed on anri-
tables possible that contain the important bule data. Queries mar!c a subset of records
relationships and to have all 1ables in at least in a 1able. ofieu as a precursor 10 subsequent
3NF. Normal forms higher than three have analyses. Queries may use AND. OR. and
been described and provide funher advan- NOT operations. among oth.ers. alone or in
tages: however. these higher fom,s are often combination.
more limi1ed in their awLica1iou and depend
on 1he intended use of 1.he database. Keys are important in slructuring rela-
tional data iables. PriJUary and foreign keys
While rela1ional tables in nonnal forms are defined which are used in joining tables.
have cenain useful characteristics. and rela- Primary keys may sometimes be concate-
tional databases are 1he standard 1001 naroo. or fonnoo from several colunms.
adopted by mos1 large organizations. 1hey rather 1han jns1 from one column. Emry and
may suffer from relatively long access limes manipulation of key values ~soften con.
for specific queries. Databases may be orga- mained so tba11ables may timcrion properly.
nized around usage. or de11ormali:ed for the
most common processes. particularly when Relational tables are often placed in nor-
databases are quite Ja,ge and high speed mal fonus to improve correcmess and con-
access is paramount. These deoorrnaliza- sistency. to remove redundancy. and 10 ease
tions typically add extra columns or penna- updates. Normal fonns seek to break large
nent joins to the database structure. This cables into small tables 1ha1 ~omaiu simple
may add redundilocy or move a table to a functional dependencies. This significantly
lower nonnal fonn. bm these disadvantages improves tbe maintenance aJJd in1egri1y of
often allow significant gains iL1processing the database. Nom1al forms may cause some
speed. The need to denormalize tables bas cost in speed of access. although this is a
diminished with improvements in computing diminishing problem as computer hardware
power. Howe\·er, denormalh:a1ion may be improves.
required for ex1remely large da1abases, or
where access speed is of prim•ry impor- Object-relational database systems ha"e
tance. been developed that incotpc,ra1e the mong
typing and domains of object-oriented mod-
els wi1h 1be flexibility. logic, and ubiquity of
Summary rela1ioual data models. These e\'ohuiouary
improvements 10 1he relational approach will
A11ribu1e da1a are an imponan1 compo- continue as dalabase teclu1ologies are
nent of spatial da1a in a GIS. These d.11a may ex1ended across networks of computers and
be organized iu several ways, bul data struc- !he World Wide Web.
nires 1ha1 use relational iables bave become
356 GIS Fundamentals
Suggested Reading
Adler. D. W. (2000). /BM DB] Spar in/ Extender - Sparinl darn within n RDBJ.fS. Pro-
ceedings 27th International Conference on Very Large Databases. Rome. ll'aly.
Arcnar, D .. Zeller. M. (2004). Desig11i11g Geodntnbnse,: Case Studie, in GJS Data
Modeling. Redlands: ESRI Press.
Aronoff. S. ( L99 I). Geoirnphlc Jnfo rmm/011 Systems: A i\lmmgeme111 Persp.ecrfre.
Ontario: \VDL Publicauons.
Date. CJ. (2004). A11 /11troductio1110 Dntnbnse S)•stems (8th ed.). Boston: Pearson!
Addison-\'lesley.
Frank. A.U. {1 988). Requirements for a database management system for a GIS. Pho-
rogrammein·c E11gi11eeri11g and Remote Seusi11g. 54: 1557- 1564.
Lorie, R.A., Meier, A. (1984). Using a relational DBMS for geographical databases.
Geoprocessi11g. 2:243-257.
Milne. P.. Milt.on. S., Smith. JL. (1993). Geographical object-oriented datal>ases: a
case snidy. !111e,·nm,'011a/ Joumnl of Geogrnpl,ical b,jonna1lou Sysrems. 7:39-55.
Obe. R.O•. Hsu. L.S. (2011). PoSIGIS i11 Action. Greenwich: Mam1ing Publications.
Rigaux. P., Scboll. M., Voisard, A. (2002). Spatial Dnrnbases Will, Applicati:011s To
GIS. San Francisco: Morgan Kaufmann.
Teorey. TJ. (1999). Dntabnse ,\1odeli11g n11d Desigu (3rd ed.). San Francisco: Morgan
Kaufmann.
Ullman. J.D .• \Vidom. J. (2008). A First Course in Dntnbase Systems. New York:
Prentice Hlall.
Zeiler. M. (1999). Modeling Our World: A11 ESRJ Guide 10 Geodnrnbase Desigu. Red-
lands: ESRI Press.
Chapter 8: Attribute Data and Tables 357
Study Questions
8.3 - Describe the difference between single and muhiple user views.
8.5 - \Vhiieh single columns in the following table may serve as keys?
8.6 - \Vbich single columns in the fol.lowing table may se1:ve as keys?
8.8 - What are the eight basic operatio11s fonnally defined by E.F. Codd for the rela·
tional model?
8.9 - What is the primary reason that hybrid database models are used for spatial
data?
358 GIS Fundamentals
8.11 - Does an AND condition result in more, fewer, or the same mm1ber of records
as 1be compou,cu1 pans? For example. is the se1 from:
condi1fo11 A AND condition B
the same. bigger, or smaller 1han the set from condition A alone. or condition 8
alone?
Chapter 8: Attribute Data and Tables 359
8.12 - Identify the states meeting each of the following selection criteria., based on the
1able below. No1e 1ba1 these dam are for the 1990s. and may 001 reflec1 co11di1ions
1oday:
a) S111okcrs < 20"/o
b) Smokers > 20% and illiteracy < JO
c) No1 (non-federal 1a.,es > 9)
d) UUteracy < 7 or income > 22,000
e) Ge1 more federal aid than p.1id in 1axes. and non-federal taxes > 9
O[ Firearms dea1hs < IO and income > 21.000 Jand 001 (smokers > 20}
Freorm
srn~ers Income Inlterocy deaths / Non·FedttOI Fed. Aid I
FIPS N"""" (1') ($/person) (%J 100.000 Tox Rot•(%) F•d Taxes
,o Okldlomo 24 7 17.646 12 !3 l 9 .8 I 48
8.13 - Identify 1hc cow1trics from the following table (dnln from the 1990:.) thn1 meet
the following cri1eria.
a) Per capita euergy use > 4.000 and popula1iou -< 20.000,000
b) Infant morrali1y -< 7 and life expectancy > 79.0
c) Per capita energy use -< 4.000 or [(population > 40 million) and (car thefl -< I)I
d) (Per capita energy use < 4,000 or (population > 40 millioo)) and (car theft -< I)
e) not (population > 40,000.000)
t) Population < 20,000,000 nod not (car theft > I.S)
8.14 - \Vhat are normal fonns in relational databases? \Vhy are they used. and what
are the advamages of putting da1a in higher uonnal fonns?
Chapter 8: Attribute Data and Tables 361
z
-
8. 15 - Sketch the output table resulting from an inner join shown below:
I di POS
WO
eo
Id2
x
y
tm
5
1
A rt A 6
y pr Q 4
y nn N 3
R rt L 2
Q mn
SJ 6 - Sketch an outer join for the 1able shown in the previous problem.
8.17 - Define 1he basic differences be1weeu !irs1. second. and 1hird 11on11al fonns.
8.1? - Lisi !he single·tolWlUl funtlional dependencies in !he following table, using
the arrow noiation described in this chapter.
ID Size Shope Color Age Source
l lorge rourtd blue 10 A
2 medium rourtd green 5 8
3 smoll round red 10 c
4 medlum knobbed green 5 D
5 medium knobbed green 5 E
6 lorge round blue 10 F
7 lorge round blue 10 A
8.20 - Lis• 1he single-column func1ioual dependencies in the following table. using
the arrow notation described in this chapter.
ID Size Shope Color Age Source
5 medium round blue 5 A
2 large round green 10 8
3 smoll round green 10 c
4 large knobbed green 10 0
5 medium knabbed blue 5 E
5 medium round blue 10 F
7 large round green 10 G
363
Introduction
Spatial dntn mwlJ• is is tl1e applica- ses. Many operations incorporate botl1 lhe
tion of opera.tioos to coordinate and attribute and coordinate data. and the
related attribute dala, often to solve a attributes must be fi.t.rtherselected and
problem. We may wish to identify high modified in tlte course of a spatial analy·
crime areas. to idcmify road segments that sis.
need repaving. or to find the best area for The discussion in the present chapter
wind rurbines. There are hundreds of spa· will expand 011 rather than repeat 1lte
tial operations or spatialfimctions used in selection operations neared in Chapter S.
spatial analysis. and all involvecalcula· This clL1pter describes spatial data analy-
tions with coordinates or attributes. ses that involve son. selection. classifica-
Spatial operations are often applied 1ion. and spatial operations that are
sequenlially to solve a problem. the out· applied to both coordinate and associated
put of each spatiai operation serving as attribute data.
the input of the next (Figure 9-1). Pan of
the challenge in geographic analysis is
selecting appropriate spatial operations Input, Operations, and Output
and applying them in the appropriate Spatial data analysis typically
order. involves using data from one or more lay-
The table manipulations we described ers to create output. The anailysis may
in Chapter 8 are included in our definition consist of a single operation applied to a
of a spatial operation. Indeed. the selec- data layer. or many operatio.ns that inte-
tion and modification of anribute data in grate input data from many layers to cre-
spatial data !:ayers are inch1ded at some ate the desired output.
time in nearly all complex spatial analy·
Fiaurt 9-1: A sequence of spa1i:il opcr.ttions is often applied 10 obtain a desired 6nal <bl.fl layer.
364 GIS Fundamentals
Spa!ial operations may also require G/obol operations use data values from
several inpurs or generate many ourpurs. !he entire input layer 10 determine each out·
Terrain analysis functions may take a raster put value. lbe value at each location
grid of elevations as an inpul data layer and depends in part on the values at all input
produce bo!h slope (local steepness) and locations (Figure 9-2. bouom).
aspect (the slope directioo). In !his case. The set of available spatial operations
two oulputs are generated for each mput
depends on !he data model and_type of
elevation data set. Averaging may use mul· input spatial data. Some operauons may be
tiple inpm layers to produce a single output e.asily applied to raster or ,·ector data.
layer, for example. annual rainfall found by While the details of the specific implemen-
summing 12 monthly rainfall raster layers. tation may change. !he concep1 of the oper-
The output from a spatial operation ation does not. Other operations may be
may be spatial. creating new spatial data possible in only one data model.
layers. or nonspatial. producing scalar val- Data model characteristics \\>"ill de1er-
ues or a table. A layer average function mine how any given operation is applied.
may simply calculate the mean cell value The specific implementation of many oper·
found in a raster dlata layer. The mpm ,s a a1ions. for example. multilayer addition.
spatial data layer. but the output is a single depends on the specific data model. A ras-
number. ter operation may produce a different out·
come than a Vo!C!Or operatiom.. even if the
Scope 1hemes are meant to represent the same fea-
rures. In a like manner. !he specific set and
Spatial data operations may be charac- sequence of operations in a s.patial analysis
terized by their Sf"'rial scope. the extent or will depend on the data model used and the
area of 1he input dlata that are used in deter- specific operations available in the GIS
mining the values at ootput locations (Fig· software.
ure 9-2). Spatial o_peraiions may be
characterized as local. neigbborbood. or Spatial scope provides a good example
global, to reflect the extent of the input area of this influence of data models. Cells 1n a
used 10 deiermine !he value a1 a given Out· raster data set have uniform size and shape.
pm location. A local operation applied to a raster data
layer bas a weU-defmed. repeatable area. In
Local operorio11s use only !he data at contrast. polygons usually vary in size and
one input location to detennine the value at shape. A local operation for a vector poly-
that same output location (Figure 9-2, top). gon data set is likely 10 have variable size
Attributes or values at adjacent locations and shape from one location 10 another. In
are not used in the operation. Figure 9-2, the local operation follows a
Neighborhood operations use data state boundary. Therefore. the operauon
from bo!h an input location plus neart,y applies to a different size and shape for
locations to determine the outpur value each state.
(Figure 9-2. center). T_he extent and relative Neighborbood analyses are affected by
imponance ofvahues m !he nearby region 1he shape of adjacent state.s i_n a similar
may vary. but the value at an output loca- manner. Summary values such as popula-
lion is influenced by more than just the tions of adjacent stares may be greatly
value of data found at the corresponding influenced by changes in neighborhood
input location. size, so great care must be taken when
interpreting the results of a SJ?3tial o~ra-
tioo. Knowledge of the algoathm behmd
the operation is the best aid co interpre1ing
!he resulls.
Chapter 9: Basic Spatial Analyses 365
5
Loco! opero1ion:
1990 5
population
density 20 32
32 12
2
6'-<----1
Neighborhood
operotion·
number of
odjocent 6 '
stoles
5 J
10
6 '
J 1
While mOSI operations miglu be coo- re,·erse is mtt for network operations.
ceprually compa1ible with most spa1ial dara which are generally easier roapply in vec-
models. some operatiousareeasier10 apply tor models. ln many in.slances. it is more
in some rncdels. Most neighborhood opera- efficient 10 coo,·en the data lletween data
rions are quire easy 10 program when using models and apply the desired operarions
rasrer dara models, and can be quire diffi- and, if necessary. conven the results back
cult when using vector data models. The to the original dara model.
366 GIS Fundamentals
'
Arkansas
States entire ly
north of
Arkansas
States both
entirely north of
Arkansas
and
larger than 84.000 sq. km.
•
..
all counties with a population > I000 per- argument specifies a condition on a vari·
sons per mi2. wh;le the right shows those able named oreo, and the second argument
with
., a density less than(<) 250 persons per a condition on a variable named formJn·
DU"'. come. Features are selected if they satisfy
The set algebra operations greater than both argumems. that is. if their oreo is
(>) or less lhan (<) may 1101 ~ applied to
larger than 100,000 ANOfonTI.Jncome is less
nominal data. because there is no implied than 10 billion-.
order in nominal data. Green is not greater Expression 2 in Figure 9-5 illustrates a
than yellow. and red is 001 less than blue. Boolean NOT expression. lllis condition
Only the set algebra operations equal 10 < ·> specifies that all fean,res with a variable
and not equal to (< >) apply to the.se nomi- state which is not equal to Te.xos will return
nal variables. AIL set algebra operations a true value. and hence be selected. NOT is
may be applied 10 ordinal dara. and are also often known as the negation operator.
often applied to intervaUratio data. This is because we might interpret the
A CAB A AND 8
•
flaul't' 9--6: E.umpks of
c:tprcsiiom, iu Boolean
a1lgcbl'$, nnd nbcir out·
<Otu¢S. SubaC"eu, of three
rc3toc11 Ate sclcc1cd by '--'C
cotnbining. ANO, OR,
~od NOT coudi1iom: in
Boolean cxpt"cuious. NOT(A Cl< B)
A!lr i Ub-lltC':I Of f'Ollf NOT(A Al<) B) (AAN08)0RC
of sub-.,rc.:it c13.)' be
5C'l«l<d by the COITCCI
Boolean combination.
370 GIS Fundamentals
(Coun1y • Rice)
AND
(Wshed • Connon) Scon
Flp rt 9. 7: Au clCalltplc
of :t Boolc:\n sdcc-1ion
~pplicd to a Kt ofco,an·
tics and \~ten.beds in tl1<e
M.idwcS-1c:1n United Olmsted
$11110:s. The mid sb.i<kd
area i.s wit.bin the Cruwoo
Rin:=r wa1mhed. a rtn'bu-
tary of the. Mi:ssi.ttippi
Rl\·« , while the dadtci t
shaded ru-ea is also ..-.. ;1bin
Rice County. Minnesota.
with Missouri on one side and a differenl qualifying as adjacent, thell Colorado is
state ou the other side may be flagged. and added 10 the selected adjacent set (right
the list of states adjacent to Missouri Figure 9-Sb). l11is is another illustration of
extracted. an observation made e.arlier: there are often
several variations of any single spatial
Adjacency is defined in Fig\trc 9-Soas
sharing a boundary for some distance operation. Care mus1 be taken to test the
greater than zero. Figure 9-Sb shows how a operation under controlled conditions until
different definition of adjacency may affect tl1e specific implementation of a spatial
selection. Tb.e left of Figure 9-81> shows the operation is well understood.
state of Arizona and a set of adjacent Features may be selected based on
(shaded) westcm states. By the definition proximity. Proxintity selection typically
of adjacency used in Figure 9-So. Arizona requires a set of selecting features and a set
and Colorado are not adjacent. because of target features. All the target features
they do not share a boundary along a line within a specified distance of the selecting
segment. Arizona and Colorado sh.are a feantres will be chosen. e.g .. all weather
border at a point. called Four Comers, stations within 60 km of a watershed may
where they join with Uiah and New Mex- be selected to es1in1ate rainfall (Figure 9-
ico. When a different definition of adja- 9). Selecring only stations within the ,vater-
cency is used. with a shared node shed may provide poor eslitnates ofrainfall
a)
b)
AdJocency Aeljocency
• shared line required - shoted node o t II ne reQutred
Cotoroao COli)rodO
•• Arlzono Arizona
Figure t .8: Exflmplei of s.dc-ctions based on lldjac-c:ncy. 11) Mis~sti. USA is s.hown on 1hc left 11nd aOs.1111c:s.
:idJ~Cctlt 10 Miuouri sbowu oo tbc: righ.1. b) Different dcfini1ions of 11dj:1cc:uc)' mnill in diffcrcn1 sc:l«1ioos.
Colorado is oot 11dj11c-a1t to Arizona wbc:u line t1dj11c-CU('.y is required (kft). Wt i.5 wbc:u t»de adjac-c:ncy is
ac-cqn«l (rig.bi).
372 GIS Fundamentals
near the edge of the polygon, but using all a given distance of a set of polygons. but
gages is inefficie,u because infonnation not those that are within the set ofpolygous
from gages very faraway often isn't. belp- themselves. Users sho\lld clearly identify
ful. An adequate exteriorproxmuty ts ~ho- tJ1e selection tool function and outputs.
sen. usually by visual inspection. previous Proximil)' scll:j;tion. and most selec-
experience. or pr,elim.inaiy tests. •.nd. the tion processes more generally. typically
proximity selection apphed. Proxmuty 1s only select features that meet a given set of
usually calculated implicitly within tl,e criteria, The process often does not crea1e a
operation. in thar first the source reanires separate. new data layer of the selected fea-
and selectable features are provided, a tures. Selected fean1tes are marked On·
proximity distance and metl1od specified, screen and in the corresponding data table.
and on mm1ing tile operation. the selected bm still are pan of 1he source data set. .
set is identified. Less frequently, software Tiiere is often an additional export step 1f
requires a muhi-s1ep process in which the these selected features are destined for a
source feamres and selection layers are new data set. This explicit crea•ion
identified, and a selection layer is cre.ated. approach is usually adopted becaus~ selec:
This created layer usually contains poly- tion is often a inulti·step process. with van·
gons defining tl1e area within tl1e specified O\IS different selection tools applied
distance of the source fean1res. Tins poly- successively to arrive at a target set of fe.a-
gon layer is then used 10 select features tures.
from the targe1 set.
We often add indic.ator or classification
There are several variants of proximity variables to our data tables when we have a
selection. Oue variant selects all features complex set of selection criteria. particu-
that are at least p;anially within a give!-1 dis- larly when combining both spatial ~od tab-
tance of a set of reantres. AnoUier vanam ular selectioils. These l!ldicaiot va,,ables
selects ouly features that are entirely ~side record the membership of feanires iii
a given distance of tile o\ller boundaries of groups that match or don't match a se1 of
a set of polygons. A third variant selects conditions. Figure 9-10 illustrates a selec-
only those feamres tltat are entirely within tion for a set of counties that both contain
. •. .
.~, ·.: .:u~- n•~:~~:.:,;i.v,
:" '"-" •
.a,
~ ,
,
-~ · .t.'".t
·"'
.'JM(
~~~!,'
~
'• t J".:Y •
v
...,
.tt• \.. . .n• .•,....a• ..a,·~
....( ·-
.,. .
· ·"··
•
. u.,
·"' ·"'> . I
)
.
•
.
'
• :iu
.- • .:o• ...., ..,.,. ..,,.-i.
( ..».. . 11• ..-, •
·. ( \•
,\... ,
••
•
••• . ••••
,:1<1• • .npee ' ~~ , • ,-.,.1 · "'" ·• • •
.x...,....rr, ,:u
i.t , JIU .a.:'*J
L..~:;.
, YI '
,1:f1'~·"' 'u :n,,,s
•,VA ' : ),11
' 1
• ' ) · '
' ' ,,.., ,m .ll,! 111 , •
.N1 .-.."77~· , IIU ,n.J
• fl .
,110
..... . u..
'"'.U-.>\O
°" , •. -·~· '- - - •"';- ~ 0
•
0
• • •
fiiUn.' 9.9;An c:u:ruplc ofa sp111ial selcc1ion ~s~ on, ~xU?iir, ulecting rt'lllfal~ ga.gcs oca; 11 wa~l"$bcd
(points. Id _P!,UCI. a\·c:ragc sbown). All gages are 1dcnhbcd w11bin a 60 bu proxurut)' of tbc ,,a1crsl1
boundary (113h1 p111"1d, li3bt sqtl,Vd).
Cha pier 9: Basic Spatial Analyses 373
part of a target watershed and hazardous manner. It is good praclice to flowchart all
materials (Haznun) storage sites. A munici- mulli-s1ep spatial analyses. and the value of
pality might 1mdertake an ups1ream inven- the flowchart grows with rile comple.xiry of
tory if they measure a spike i.n toxic the analysis.
chemicals in their water supply. Ha2ma1 Caution is helpful when applying sub-
sites are dis1ribu1ed 1hroughou1tbe stare of sequent sp:llial operations to a data set with
Georgia. The Allamaha River drains much selected features (like the r.ainfall gages in
of the cen1r31 ponion of the s1are. A spatial the rigln panel ofFig11re 9°9) since many
selection identifies counties l1la1 contain a subsequent operations are possible. Some
portion o f 1.he watershed. and a table sele~ operations by defauh only ac1 on selec1ed
1ioo identifies l11ose com1ties containing feamres. while 01her operations apply 10
Hazma1 si1es. A column may be added 10 the entire data set 1l1e choices are often
the-county table and values assigned to software-dependent, and vary by operation.
record this set of potential source counties, and so you should consult the documeuia-
to be used in subsequem analysis. tion or rest e.acb new spatiail operation
Figure 9· 11 shows one flowchart ofrhe when firsl using it to avoid unintended
geoprocessing analysis in Figure ?-10. results.
While the analysis is relatively simple. and Comai11111e111 is another spatial selec-
the steps primarily operations on the tabu- tion openuion. Comainment selection iden-
lar data. the llowchan sl1ows 1he da1a. spa- tifies all features that contain or surround a
tial operations. ru.1d orde.r in a succinct set of target features. For example. the Cal-
figur~ 9. 10: An c:.'C.'lmpli: of i:ndic~tot ,·Wblcs 10 ttcord sc1 mcmbdship. We wU.h co idc;iuify cowuies with
IJ.i1U1rdou, 1natcri~b site$ (H:.iaiut, upper left) 1lu11 cour.:iin p~t1 of the Alu1tll3WI R.h-cr w11tcn.licd (upper ccn,
ttt), We may fIM appl)' a ~tial sel~tioo oo counties with the watc~d boundary (upper right). tbcu a
cable sclcct1«ll oo lbc coinc1dcut counties to identify those wi.1h Hazm111 sil« (lower left). A column. Jt.crc
n1Utlcd foSc-1. n12y be lidded to ickruify 1.hc t dccicd cou.utics il'l fwthct proccuiug (.bonoo, center and ri$ht).
374 GIS Fundamentals
Counties
County with
HozMot ~ HozMot
Sites \ Sites
~ ' Creat e InSet item:
~_J InSet = I if HozMot > 0
Inset • O otherwise
Altomoho
River
Waler·
J County
HozMot
\
Fi;un !t· lZ: Au example of~ sc:lc:c:tio11 baKd oo c:ontaiwncnt. AU 1it11tes: c:oni:afoiug a
portion of the Missiuippi Ri\'tt or i.1s cn"butaric:s 11rc selected.
its tributaries. A query is placed. identify. data for each geographic object. These
ing the features ctuu are contained. here the modified anriblltes may in rurn be used in
Mississippi River oetworl<. and the ta.rget further analyses. such as fo·r more complex
features that may potentially be selected. combinations in additional classification.
The target set in ctlis example consists of Classification may be used for many
the lower 48 states o(the United States. All other purposes. One common end is to
states lhat contain a portion of the Missis- group objects for display or map produc-
sippi River or its tributaries are shaded as tion. TI1ese objects have a common prop-
pan of the selected set. erty. and the goal is to display tbern with a
uniform color or symbol so the similar
Classification objects are identified as a group. The dis-
play color and/or panern is typically
Clns.s{f,cation is a spatial data opera- assigned based upon the values of an anri·
tion that is often used in conjunction with bute or anributes. A range of display
selection. A classification. also known as a shades may be chosen. and correspoodiog
reclnssificm;.c,, or recodi11g, will categorize values fora specific anribu~e assigned. The
geographic objects based on a set of condi- map is then displayed based on tilis classi-
tions. For example, all the polygons larger fication.
than one square mile may be assigned a size
value equal to Lorge, all polygons from 0.1 A classific,a1ion may be viewed as an
to I square mile may be assigned a size assignment of feamres from an existing set
equal 10 Mid. and all polygons smaller than of classes to a new set of classes. Weiden.
0.1 square miles may be assigned a size tify fearures that have a giv,eu set of values.
equal to Small (Figure 9-13}. Classifica- for example. parcels 1llat are above acer-
tions may ad<l to or modify the anribllte tain siz:e. and assign them all a classifica-
tioa value, lil tl1is case tlie class "large."
Parcels in another range of sizes may be
assigned differem class values, forexam-
ple. "mid" aod "small." The attribute that
stores the parcel area is used as a guide to
assigning the new class value for size.
TI1e assignme111 from iupu, anribute
Size values (area) to new class values (here.
Smoll size) may be defined manually. or the
Q ,.w assigmneut may be defu>ed automatically.
. L""!l" For manual classifications. the class transi-
tions are specified entirely by the human
analyst.
Classifications are often specified by a
table or array. The table identifies the input
class or values. as well as tl>e output class
for each of this set of input values. Figure
9- 14 illustrates the use of a. classification
1able to specify class assignment. Lnpu1 val-
ues of A or 8 lead 10 an omput class value
of 1, an input value of Eleads to an ou1pul
value of z. and an inplll value of I leads 10
an output value of 3. The table provides a
complete specifica1ion for each classifica.
tion assigumem.
376 GIS Fundamentals
Source layer
A
E
A _eJ Classification
r toble
A I
B In Ou1
B Output layer
A I
I B I
E 2
I 3
t
Figure 9· 14: The cl.assi6c111ion of a d1cm:ttic fayer. Valuc-.t 11tc $i\"C'l'.l 10 s:pccifie anribut« ln t1. <:tusifi·
c:nion tnble, which is u,cd, in nirn. to MS:ign d:h,c~ i1) Jin output la)'c:r.
Cha pier 9: Basic Spatial Analyses 377
• -
--t··-
;-a-
... - - ,-
r •. :J .I'--'
·~''il/'i.
·--{-.;...o:..i \',f,'-Q~~~
- -d,....\j'.1.,-.
_;,{;)> ...
'><~-·
.... =,.-- \.-- ,/IJ/~::.... - -· ..
\-.i ..../ - L._-
·? .__ J .._: ~ :-1 ,.,,
~frj'P-;Zr!
.1,:-f!!~
~
~·~ .
States west of the main
~ '."~ • ....__ ' f/ I branch of the Mlsslssippl
"· I , ' - '"\ River assigned I. east of
! ~ \ \J the River ossign-ed O
Clossiflcot,on t able
sfote name ,s_fts-t
Atobcomo 0
Arizono I
Arkonsos I '0
Coloro<lo 1
Connecticut 0
.
Wvomino 1
fl~u~ 9. 15: An example of ll binruy c l.Msific:uio,n. Fci1fttf'cs ,ire_pbccd i111<> 1wo claucs in;, biMty cliw:ifi··
cal1ou. wes:t (J) .ind C-3$1 (0 ) of tbc M.issinippl Rh·c:-r, The i:la5sific..ition toblc codifies dtc anigomcut,
378 GIS Fundamentals
A poteruiaJ drawbac.k froman auto. population, and we would simply use this
mated class assignmem stems liom our level. However. in many ins1ances the
inability to precisely specify class boundar- classes are no1 defined. and we must
ies. A mathematical fonnula or algoritlun choose them.
defu,es the class boundaries. and so spe- Figure 9-16 includes a bar graph
cific classes of interest may be split. Thus. depicting the population frequency distri-
the analyst sacrifices precise control ove.r bution; this type of bar graph is commo,tly
class specification when an autc,mated clas- called a bistograni. The frequency bisto·
sifica1ion is used. gram shows the number of neighborhoods
Figure 9-16 describes a data layer we that are found ia each bar (or ''bin'") of a set
will use 10 illustrate automatic class assign- of very narrow plpulation categories. For
ment. TI1e figure shows a set of"neig.bbor- example. we may count the number of
hoods·· with populations that range from O neighborhoods tha1 have .a population
to 5133. We wish to display the neighbor· between 3.000 tc 3. 100. Approximately
hoods and populations in three distinct S.1% of the neighborhoods have a popula-
classes, high, medium. and low population. 1ion in this range. so a venic.al bar corre-
High will be shown in black. m-?dium in sponding to 8.1 units high is plotted. We
gray. and low in white. We must decide coum aod plot the histogram values for
how to assign the categories - what popu- each ofour narrow categories (e.g .. the
lation levels defme high. medium. and number from Oto 100. from 100 to 200.
low? In many applications. die classifica- from 200 to 300). until the highest popula-
tion levels are previously defmed. 111ere tion value is plol!ed.
may be an agreed-upon standard for high
Neighbomoods
- l.074 polygoos
- PoP\llotion for ne,ghborh<>ods
ranges from O to 5.133 (3 oulliers > 3.300)
...-
•
"" ---
population
Fi&""' 9-16: Neighborhood polyaQns l);od popuL,tiQn levels uwd in ttlbscqurnt c.xnmple, ofclau:iticatic>n
a.wignmc-ot. Tii,; populatio~ for ti,,.,,., l,074 ii...igbb<n:bocxh 1"3Dgo; from O to S, IJ3. The b~ogram M th,;
lower ria:ht sbows tfl.c frequency distribution. NO(c tba1 ihcrc it a b~.ak i.u th< ('ban bc-tw«ll 3,500 and
S,000 ,o show tltc tbttc "oullie:r"' m:i~tborhoods with popubtiont itbo\'c 3,300.
Chapter 9: Basic Spatial Analyses 379
O PU
1711- 3<41~
3422 · 51ll
FIJUJ't' 9- 17: All cqrn.tUotcn'JI cl.Aui6~111ion. Tbc muge Oto 5,1 J3 i.sspli1 into l}u« ~utd~11r1s.
Colors ar¢ assigned as sb0\\1l iu 1hc map ofthe laye:r(left). aod iu the frequency plot (ngb1 , Note
tb1:: td.1ti,«ly (<iw polyaom ,1u.ipc.d 10 tbc. hii,h d ,11uc11 W bJ.,,d:.. A ("w Miplborboocb wi popufa·
tioiit oc.nr 5,000 s.bifi 1l1t dau b<luncurict upwanL
380 GIS Fundamentals
populations below 3,000. The outliers shift large polygon. This results in a small range
the class boundaries 10 higher values. 1711 for the large polygon classes. Cla.sses also
and_ 3.422. res11ltlng in most neighborhoods tend ro have a narrow range of values near
falhng m the small population category. tJ1e peaks in the histogram. Mat1y polygons
Anlllher common mi!ihod for class are represented at the histogram pea.ks. and
assignment results in an equnl-area classi- so tbese may correspond 10 large areas.
fication (Figure 9 -18). Class boundaries are Both of these effects are illustrated in Fig-
defined 10 place an equal proportion of the ure 9-18. The middle class of the equal.
study area into each of a specified number area classification occurs at population val-
of classes. This t<sually leads 10 a visually ues between 903 and 1223. This range of
balanced map bec.ause au classes have populations is near the pea.k in the fre-
approximately equal exten1s.. Equal-area quency histogram. aud these population
classes are often desirable. for example. lev_els ~e associated with larger polygons.
when resources need 10 be distribmed over Tius nuddle class spans a range of approxi-
equal areas. or ,v.hen equally sized overlap- mately 300 population u1tl1s. while the
ping sales territories may be specified. small and large classes span near 900 and
4,000 population units. respecti vely.
. Class width may change considerably
with an equal-area classification compared Equal-area assignments may be highly
10 equal-interval. An equal-area classifica- skewed when there are a few polygons wilh
tion sets class bo1.mdaries so that each class large areas. and these polygons have simi·
covers approximately the same area. A tar values. Although not occurring in our
clas.s may consist of a few or even one example. there may be a relationship
Equal-area classification
O· 901
90) · 12:lJ
- 122, 51.)J
(;'
c
.,.
:,
.
.. .
!
'
population
fia:urt 9-18: Equal4rca elas1-ifaation. Oau bouudari« arc set Rieb th.at ~cb class ba, appro".i.
m1udy lhe S;*'-!)C. total :itta. Tbit oftcu Je11ds to a ssnlllkr ran~ wbnt poups of &cqucut classes arc
fatu~. In llu.1 11!111t.1ncc lhc ulc:di!11n c1tU8 spru1& a ,niall ral'lgc, fro,u 963 10 1,223. while d ie ]\igb pop·
ula11onclus ~m.ii 1-allg:C tho.11, alruo,t JO cim" broader, from 1.224 to S. 133.
Chapter 9: Basic Spatial Analyses 381
belween the population and area fora few naturally occurring clusters of data; not
neighborhoods. Suppose in a da1a se, simi- clus1ers based on 1be spa1ial rela1iousltips.
lar 10 ours 1here is one very large neighbor- bm ra1her cl11s1ers based on an ordering
hood dominaled by large parks. 11,is variable.
neighborhood has bolh !he Jowes1 popula- There are various me1hods 11sed 10
1ioos ru1d larges1 area. An equal-area classi- idenlify narural breaks. large gaps in an
fkation may place lhis neighborhood in its ordered list of values are oue common
own class. If a large parcel also occurs wilh me1hod. 11,e values are listed from Jowes1
high popula1ion levels. we may ge11hree to highest. and the largest gaps in values
classes: one parcel in the small class, one selec1ed. Barring using gaps. low poims in
parcel in 1he high popula1io11 class, and all ct,e frequency his1ogrrun may be used 10
1be remaining parcels in 1he medium popu- iden1ify breaks. There is usually an effon 10
lalion cl.ass. \Vll.ile most equal-area classifi- balance the need for rela1ively wide and
ca1ions are 001 this extreme, unique parcels evenly dis1ribuied classes and 1be search
may sirongli• affec1 class ranges in an for natural gaps. Many narrow classes and
equal-area cl:assification. one large class may not be acceptable in
We will cover a final me1hod for au10- many instances. and !here may be cases
mated classifica1ion. a method based on where 1he specified numbe.rof gaps does
natural breaks, or gaps, in the dala (Figure. not occur in 1he data histogram. More
9-19). Narural breaks classifica1ion looks classes may be reques1ed 1han obvious
for ''obvio11s"' breaks. 11 a1temp1s 10 idemify
0 - 1130
1130 • 2156
- 2156 • 5133
FJeurt 9~19': Nttn1t11l breala claui6ciuiou. Boundruic-.s bc:twcrn daue1 arc placed wbcr,c uat\ltal a;aps
or ]ow poinl.s occur iu the frcquci:icy distribu1i011..
382 GIS Fundamentals
gaps, so some natural bre.ak methods or other census 111easures. These subsequent
mclude an alternative methOd. tor example. vatues wtll depend ou the stze. shape. and
equally spaced intervals. for po~ions of the location of tbe awegated polygons. as seen
histogram where no natural gaps occur. for mean population age in Figure 9-20.
Figure 9-19 illustrates a na1t1ral break This general phenomenon whereby attri-
classific.ation. Two breaks are evident in bute values vary by spatial grouping. known
the histogram, one near 1300 and one near as the modifiable areal tuiit problem or
2200 persons per neigbbod1ood. Small. MAUP. has been exploited by politiciaus to
medium. and large populations are. redraw political boundaries to one party or
assigned at these junctures. another's - but generally not the country' s
Figure 9-17 through Figure 9-19 illus- - advantage. The process of aggregating
trate an important point: you must be care- neighborhoods to create majority blocks for
ful when iut4:rpr1:Ling clas.s maps. be(..11Use poliri<'Jtl :utv:rnUlgP. ,wii- nnmM gerrym.:rnder-
the appareot relative importauc! of catego- ing after Massachusetts governor Elbridge
ries may be manipulated by altering the Geny. when he crafted a political district
staning and ending values in each class. shaped like a salamander.
Figure 9-17 suggests most neighborhoods There are two primary characteristics of
are low population. Figure 9-1 S suggests the MAUP that may be manipulated to affect
that high population neigbborhoods cover aggregate polygcn values. The first is the
the largest areas and that they ;w,, well :011i11g effect. tha: aggregate statistics may
mixed with areas of lower population.. change by the shape of the units, and the sec-
while Figure 9-19 indicates the area is ond is the si:e effect. that aggregate statistics
dominated by low and medium population may chauge with the area of the tlllits. For
neighbort1oods. Precisely because there are example. the mean income of a unit will
no objectively defined populatiln botmdar- change when the boundaries ora unit
ies. we have great flexibility in manipulat- change. either ba:ause of a chauge in zone
ing the impre.ssion we ere.ate. The legend in or a change in size.
class maps should be scrntinized, and the MAUP effects may substantially iuflu-
range between class boundaries noted ence the ,ialues for each unit. and hence sub-
sequent analysis. Openshaw and Taylor
The Modifiable Areal Unit Prob- published results in 1979 that illustrate
lem MAUP dependencies panicularly well. They
analyzed the percentage of elderly voters
A fundamental challenge in spatial aual- and the number c,f Republic au v0ters for the
ysis is that polygons may be redassified and 99 COlmties in Iowa. They showed that the
grouped in many ways when there are no correlation between the elderly vo«ers and
objectively recognizable catego1ies. Con- Republican v0tet munbers ranged from 0.98
sider census data. These data arecollec,ted at to -0.S I by varyi1,g the scale and aggregation
the level of individuals and houieholds but units that grouped counties. Additional work
are reponed at the aggregate le,~I of census bas shown that mullivariate statistical mod-
bloc.ks, block groups. tracts. or counties. An els based on aggregate data are similarly
analyst using fine-scaled block data may dependent on tl1e aggregation m1its. leading
elect to group blocks together in order to to contradictory results predictions.
more readily map or analyze the data. The
most common reason for grouping basic Numerous studies of the MAUP have
units like blocks is wanting 10 match 1benuo shown how to identify andfor reduce the pri-
existing analytical frames such as school dis- mary negative impacts of1he zoning and size
tricts. neijUlborhoods. or wate.rsheds. Spa- effects. The primuy recommendation is to
tially grouping bloc.ks entails aggregating ,\"01k ' "ith lhe must basic \u:iits of measu1e.
their values for population. age. and income. In our census ex1.mple. this would be 10 col-
Chapter 9: S.Sic Spatial Analyses 383
lect and maintain information on the indi\'id· near busy or low lraffic roads. Op1imum
uaJ person or household. This is often 001 zoning approacbts are bes1 applied when
possible: for example. aw,ga1ioo is specifi· interest in one variable predominates.
cally required 10 01ain1ain 1he anonymiiy of Anolller approach 10 solving llle MAUP
llle census respoodeo1s. Howe-~. many in,'Olves cooducting.,, s..:t of sMSitivity ;103J.
effons allow recording and maimaining da1a yses. UnilS are aggrega1ed and rezoned
on llle primary unils "ill,in a GIS frame- across a range ofsizes and shapes and llle
work. and lhis should be implemen1ed when anal)-s<s perfonned for each set. Changes in
possible. Willl reosus daia or Oilier confi- llle resulls are ob',erved. and llle sensiri,,jty
dential infonnation. the ne.."tt best thing is to 10 zone bowidarits and sizes oOled. These
use llle smallest units a\'ailable when possi· 1es1s may identify the relative seosilivities or
bit. different variables to size and zoning effects.
A second way 10 address llle MAUP is Robust resul1s imy be iden1ified. for exam-
based on optimal zoning. Zooes are designed ple. a\'erage age may not change over a
to ma,imize ,ia.riation bttween zones while range of sizes and unit combinations. }~t
minimizing \:ariation \\ithin zones. Optimal may change substmtially o,·er a narrow
zooes art difficull 10 define for more lllao range or sizes in some areas. This approach
one ,,ariable. because variables often do 001 requires many compu1atioos. because it uses
change in concert.. for example, an optimal replicaled runs for each set of variables.
se1 or zones for de1ennuiog traffic densities zone le\'els. and mapes. This often over-
may 001 be an op1imal for average age. Old whelms 1be available coo,puting resources
people are no more nor less likely to live for many problems and agencies.
•
39 8 31 395 C23
435 388
• -
296
,._,..."""" l t 2S
j
384 GIS Fundamentals
Dissolve
A dissoJ,,e function is primarily used 10 assessed. The values for 1be dissolve auri-
combine similar feature.s within a data bute are compared across the bouodaiy
layer. Adjacen1 polygons may have idemi- line. If tl1e values are tl1e same. lbe bound-
cal values for an anribu1e. For example. a ary line is removed. or dissolved away. If
wellands dala layer may specify polygons the value.s for the dissolve attribute differ
with several subclasses. such as wooded across the boundary. the boundary Line is
we1lands. herbaceous we1lauds. or open left intac1 .
water. lf an analysis requires we identify Figure 9-21 illustrates tlte dissolve
only the wetland areas vs. the upland areas. operation tliat produces a binary classifica-
then we may wish to dissolve au boundar· tion. This classification places each state of
ies between adjace111wetlands. We are only the contiguous United States into one of
interested in prese1'1ing the wetJru.ul/upland 1wo categories. tl1ose emirely west of the
boundaries. Mississippi River {l) and those east of the
Dissolve operations are usually applied Mississippi River (O). The attribute named
based on a specific 0 dissolve" anribute lsJVes1 contains values indicming location.
associated with each feanlre. A value or set A dissolve operation applied 011 the vari-
of values is identified that belongs in the able 1s_west removes all st.ate boundaries
same grouping. Each Line 1hat serves as a between similar sta1e.s. This reduces the set
boundary be1ween 1wo polygon feamres is from 48 polygons to two polyg-0ns.
Dissolve operation
Dissolve table
dissolve I
state: name is_we.st value
AIObOm<l o__ _ E
Arizona 1 w
I w w
Arka.f'\SOS E
Colorado
I-
I w
~nntttkuf 0 I:
.... -- ....
Wyoming I w
Fi&UJ'f 9-ll: Au Ulustratiou ofa dissoh·c operation. Boundaries nrc reruow:d when 1hcysq,a:rntc s.1a1cs
with the same \'aluc for the dissol\'e :mn"butc is_west.
Chapter 9: Basic Spatial Analyses 385
Dissolve operations are often needed Dissolves are also helpful in removing
prior 10 applying an area-based sele<:tion in unneeded infonna1ion. After the classific,,.
spatial analysis.. For e.><:unple. we may wish 1ion into small. medium. and large size
to select area.s from tl1e natural breaks clas- classes. many boundarie.s may become.
sification shown OD the left of Figure 9-22. redundant. Uuneeded boundaries may
We seek polygons 1ha1 are greater than 3 inflate storage and slow processing. A dis-
mi2 in area and have a medium population. solve has the advantage of .removing
The polygons may be composed of multi· uw,eeded geographic and tabular data.
pie neigltborhoods. We typically must dis· thereby simplifying data. improving pro-
solve the boundaries between adjace111, cessing speed. and reducing data volumes.
medium-sized neighborhoods prior to Figure 9-22 illusttates the space saving
applying the size test. Otherwise. two adja- and complex.ity reduction common when
cent. mediun,1 population neighborhoods applying a dissolve functio:n. The number
may be discarded because both cover of polygons is r1.'<luced approximately nine-
api;roximately 2 mil. Their total area is 4 fold by the dissolve. from 1.074 oo the left
mi· , above tbe specified threshold. yet they to 119 polygons OD the rigllt of Figure 9-22.
wilJ not be selected unless a dissolve is
applied fust.
f le:urot S,. 22: An c,c;smplc ofit diuoh'c opcr.'11ior1. No1c 1be: tcmo,':U ofli.ne:s se:p11r:t11ing poly·
goo., of 1bc umc siz;e: clllff., Thi$ grudy ~ cc, chc number ofPQlygou,.
386 GIS Fundamentals
e
ID l'Vl'IC
0 I
Cotl
l.000 "'"0"' ""
1979
... "
49 0
b
c ~3·
3 6.000 -
3.000
5
2
2011
1957 24'~- 0
I
-
d I 2.000 9 1989 83 0
•
f
2
2
7.000
9.000
0
I
1959 19
1991 206_
I
I
'
Polygons ourput _ _/
from Dissolve / ---
"'7
~ O') "12!..
10
'
',
'°"'
,...,.
""
""' ..."
...
Sro,,d Exmp
S!O!US
:
e
'
'
j
....
6.000
•
' ' '""" '
' ....
'"'' ""
A tunolc T'21!cs
•• 1'9~9
/1,
1.gg,eqa1e tt,e / ./°' °'\ or-.. R~t J~·
:<.m "' c;.os,
lt> type: Coit
X I lOOl'I
ID
: )(
Y
Ip
I
l
wio
~
O I
.., ., QOOO A.'.'t'*
Stand OQC:
z l I
·
'
..
l !0..000
z 10 type: A.'{( A9(
'
., l ~\
Fi&Ul'f 9~14: D iffcrcn1 aggregation OJ)fflltioos lllllY be applied, lltld appropriate. for diffcr,rn1input ,·Wblc::s.
Here att c:uuupltt of sum. a,"C":l,igc, :u.d 1naxi,mun ope1:uion.s applied duri113, n di.uolvc opcrauou.
388 GIS Fundamentals
• woter point
distance 1o
water (m)
0
6.000
Fla:n~ 9,2S: An cnmplc of:. di&tan¢c fuuctioa. This distflltcc: fwtetion is applied to a poin1 dtua layer alld
Ctt:'l!CS ll l'Ulcr cbl:it
bycr. Titc rastC'I' fa)'C'l' coi11:ains lhc dUt:al'I« to d,c rtcnr«I \\'Mer fc2n.trc.
Chapter 9: Basic Spatial Analyses 389
...
~4- -1~-+-..... I greaterthan that fixed distance.
..•.'
Buffers
..
.. ..
Buffering is one of the most com·
monly used proximity functions. A buffer
. . is a region that is less 1han or equal to a
28-. 20 10 :10 20 specified distance from one or more fea-
rures (Figure 9-27). Buffers may be deter-
20 14 10 10 20 mined for point. line. or area featt1res. and
for raster or vector data. Buffering is the
10 10 10 20 process of creating buffers. Buffers typi·
cal.ly identify areas that are "outside" some
10 14 22 given threshold distance compare.d to those
"inside'' some threshold disrance.
10 10 14 22 28
Buffers are used often because many
spatial analyses are concerned witlt dis.
-
10 20 22 28 36
cance constraints. For example. emergency
diston<:e from nearest planners may wish to know which schools
target cell 10 are within 1.5 kilometers of an earthquake
units
fault a park planner may wish 10 idencify
Fi2ort 9-26; A dislioui fuoC;tion i pplied to n all lands more cl!an 10 kilometers from 1he
raster data sc-1. nearest highway, or a business owner may
wish 10 idencify all po1eo1ial customers
within a given radius of her store. All these
questions may be answered with the appro-
priate use of buffering.
buffe:r
distance buffer
distone
ou1put/
output buffer
buffer
:fiau1'f' 9-27; Exnmplc, of vector and raster buffc" derind from polygonal fc.,turc,. A. b,1,1ffcr i, dcfio«l
by 11,ok ~ that are within some buffer distance how the input fca!UTC$,
390 GIS Fundamentals
Vector buffers
points lines polygons
input
source
featur es
•
•••
•
0
• •
cs ~
output 0 0
buffer /"')
features W
0 0
Fi.au•" 9-29: Vcctor~Jffcl"f produced from poU11. line.. or polygon iupul fc.,t1.1rcs. [11 oll c11i.cs.. the Q1,1tpu1 is
a set o( polrgoll Ce\1tur¢,.
Buffering on vector poilll data is based laye.r. The x and y coordinate locations of
on the creation of circles around each point each point feature are used for x1 and Yl,
in a data set. The equation fora circle with placing tl1e point feature at the center of a
an origin a1 x,o. y, O is: circle.
Buffered circles may overlap. and in
simple b\lffering. the circle bo\lndaries that
r -= Jxz+y2 (91)
occur in overlap areas are removed. For
example. areas within IO kllll of hazardous
waste sites may be identified by creating a
buffer layer. We may have .a data layer in
where r is 1he buffer distance. The more which hazardous waste sites are repre.
general equation for a circle with a center sented as points (Figure 9-300). A circle
at x 1, Y!· is: with a 10 km radius is drawn around each
point \Vhen two or more drcles overlap,
imernal boundaries are dissolved. resulting
in ooocirc\llar polygons (Figure 9-30b).
r = J<x - x 1) ' • (y - y 1) ' (92)
More complex buffering methods may
be applied. These methods may identify
buffer are.as by the number of fealures
Equation (9.2) reduces to equation (9.1) at within the given buffer distance. or apply
the origin, wl1ere xl • O. and y1 • 0. The variable buffer distances depending on the
general equation creates a circle centered characteristics of the inpul feanires. \Ve
on the coordinates xi. yl. with a buffer dis- may bt imeres1ed in areas !bat are near
tance equal to the radius. r. Point buffers multiple hazardous waste sites. These areas
are created by applying this circle equation near multiple hazardous sites may entail
successively to each point feature in a da1a added risk and therefore require spe<:ial
monitoring or creabnent. \Ve may be man·
da1ed to identify all areas within a buffer
392 GIS Fundamentals
00
cs
•
•
c) compound buffer.
overlap identified
08
~()
~Q0
Figul"f' SI.JO: \iu:iouii: lypci of poi.in bufftli. Si.mpfc: buffen diuokc al'C:U nc-.11r 1:nubiplc fc:~llltCS., mon:
c01nplcic butfcr5 do oot. Mul1iring b~1ffm pro,;dc disiauc:c-dcfincd 1:oocs aro1,1od each fc:"'1.irc.
distance of a hazardous waste site, and 1he uation. intem1ediate distance require reme-
number ofsites. In most applications, most diation, and areas farther away require
of the dangerous areas will oo close to one monitoring. These zones may be defined
hazardous waste site. but some wil I be by nested buffe~.
close to two, three. or more sites. The sim· Buffering on vector line a11d polygon
pie buffer. described above, will not pro- data is also quite coaU11on. l1le· fonnation
vide the required information. of line buffers may be envisioned as a
A buffering variant. referred to here as sequence of steps. Fi~l. circles are created
compound buffering. provides the needed 1hat are centered at each node or venex
infonnation. Compow1d buffers maintain (Figure 9-3 t ). Tangen! lines are then gener-
all overlapping boundaries (Figure 9-30c). ated. These lines are parallel to the input
All circles defined by the fc<ed radius buf. feature lines and 1angem to the circles that
fer distance are generated. 111ese circles are are centered at each node or verte.'<. TI1e
then inte~ected to fom1 a planar graph. For tangent lines and circles are joined and
each area. an attribute is created 1.hat imerior circle scgmcms dissolved.
records the number of features within the Variable distauce buffers (Figure 9-
specified buffer distance. 32) are another common variam ofvector
Nesred (or m,ultiring) buffering is buffering. As indicated by the 11ame. the
another common bufforing varianl (Figure bufferdistance is variable. and may change
9-30d). We may require buffers at multiple among feamres. The buffer distance may
distances. in our hazardous waste site increase in ste1>s: for example. we may
example. suppose threshold levels have have one buffer distance for a given set of
been established with various actions feamres. and a different buffer distance for
required for each direshold. Areas very the remaining fean,res. ln contrast. the buf.
close to hazardous waste sites require evac-
Chapter 9: Basic Spatial Analyses 393
r
1<:J-- ~.)
I / ~
or
Figure 9-Jl illustrates the creation of
buffers around a river network. where dis-
tance varies by river size. The increase in
distance may be motivated by an iocreased
likelihood of flooding dowinstream or an
vtrttx increased sensitivity to pollution. \Ve may
specify a buffer distance of 50 km for small
rivers.. 75 km for i111em)ediate size rivers1
and I00 km for large rivers. There are
x many otlter instances when variable dis-
tance bulfers are required. for e.xample. for
Fia:ort 9.JJ: The c«a1ioo ofa lioc buffeut a road noise, smoke stacks, or landfilJs.
(L'Ccd distlUIC~ r.
The variable bulfer dis·iance is often
fer distance oiay vary smoothly: for exam- specified by an attribute in the input data
ple, tlte buffer distance around a city may layer (Figure 9-32). A portcon oflhe anri·
be a function of the population density in bme table for the river data, layer is shown.
the clty. The attribute table contains the river name
There are many instances for which we in nvtrjde11tifier and the buffer dislance is
may require a variable distance buffer. \Ve stored in buffchst. The anribt.ne t:iuftdist is
may wish to :specify a larger buffer zone accessed during buffer creation, and the
around iarge fuel storage facilities when silt ofth~ buffer adjustoo auto,ttaticaUy for
compared 10 smaller fuel storage facilities. each line segment. Note bow the buffer size
depends on the value in buffd!st.
nver.Jcfentifier buffdlsr
mlsslss,:ppt 100
m,ssoun 50
orkonsos 50
Ohio 75
tennessee 75
\, st. croix
Illinois
75
75
wisconsm 75
.-1,u1't 9.JZ; An iUmtrntioo o(a \'ariablc distance bufftt. A line buffet" is sb0\\1) with .i \-ariabk buffer dis,.
taucc based Oil a nverJde:t'4'11\.e:r. A ,'3.nabk buffer distaooc. b\lff'dtsr. i.upccificd in a cable aud applied for
each ri,·cr sc,i11<u1.
394 GIS Fundamentals
Overlay
Ove1·/ay opernrfo,,s are powerfol spa- An overlay operation req1ures tha1 data
tiaJ analysis tools. and were ao i_mportaot layers use a common coordinate system.
driving force behind !he development of Overlay uses tl1e coordinates that define
GIS technologies. Overlays involve com- each spa1ial feature to combine· the data
bining spatial and an:ribute data from two from the input dala layers. The coordinates
or more spatial data layers, and they are for any point on 1he Eanh depend on lhe
among the mos, conunon and powerful coordinate system used {Cbapcer 3). If the
spalial data operations (Figure 9-33). Many coordinate systems used in the various lay-
problems require the overlay of themat:i· ers are not exactly the same. the features in
cally different data. For example, we may the da,a layers will no, align correctly.
wish to know wb.ere there are inexpensive Overlay may be viewed as the venkal
houses in good school districts. where stacking and merger of spaiial data (Figure
whale feeding grou!lds overlap " 1tl1 pro- 9-33). Features in each data l•}'et are set
posed oil drilling areas. or 1he locaiion of one "on top.. another, and the points. lines.
fam1fields lhat are on highly erodible soils. or area feamre boundaries are merged it110
In 1he latter example. a soils da1a layer may a single data layer. The attribute data are
be used 10 identify highly erodible soils. also combined so that the new data layer
and a current land use layer may be used to it1cludes the infonnaiion contained in each
idemify lhe 10<:atious of farm fields. The input data layer.
boundaries of erodible soils will not coin-
cide with I.he boundaries of the fann fields
in mos1 instances. so tl1ese soils and land
use data mus, somehow be combined.
Overlay is the primary means of providing
this oombil1a1 ion.
cttribu'ru t« ~ B
I I
•
ovvloyo11nbures eombll'led
ottrib.ltc.s for bye.rs A d.t B
Fla111'f' 9..J.3: Sp:ninl cL11a O\'crlay. Ch•crl2y c:01nbioc:1 bo1h the: coordi1u 1c infonmuion nnd 1hc: atb1"butc:
in£onmtll011 acf'Qfs diffc:reut dala layers..
Chapter 9: Basic Spatial Analyses 395
\__L/J
'
l,
2
I
I 1 I
r
'-
-3
4
overlay. Intersecting lines are split and a idelll'ify the intersections of 1wo networks
node placed a1 ct1.e intersec1ion poin1. Thus such as road and railroads. but ct1ese also
1opology must be recreated iJit is needed in are rare occurrences. Overlays involving
further processin,g. polygons are the most common by far.
Any type ofvec1or feanire may be Overlay 01Hp111 typically takes the !ow-
overlaid with any 01her type of vector fea- e.st dimension of the inputs. Th:is means
lllle, al1hougb some overlay opera1io11s point-in,polygon overlay results in point
provide useful in:fonnation and are per- 0111p111. and line-in-polygon overlay results
fonued infreque11tly. In theoty. points may in line output. This avoids problems when
be overlaid 011 point. line, or polygon fea- multiple lower dimension featUJ'eS intersect
1\lre layers. lines on all three types. and wict, bigher dimension fearures.
polygons on all three rypes. Point-oo-point Figure 9-35 illustrates an instance
or point-on-line overlay rarely results in where multiple points in one layer fall
intersecting fearures. and so they are rarely within a single polygon in an ove.rlay layer.
applied. Line-on-line overlay is sometimes Output attribute data for a feature are a
required. for example. when we wish 10 combination of the input da1a anributes. If
polygons are output (Figure 9-35. right.
f \
ID type nome A
f-
HIC
- x
•
B Ur2 y
1 county Grewe
C YO
z• •
top), there is ambiguity regarding which Overlays that include ru polygon layer
poim attribute data to record. Each point are most conunon. \Ve are often interested
feamre has a value for an atrribute named in die combination of polygon fean1Ces
class. It is not clear which value should be with other polygons. or in finding the coin-
recorded in the output polygon. the class cidence of point or line fe:uures with poly-
value from p,oint A. point S. or point C. gons. What co1m1ies include hazru·dous
\Vhen a poill't layer is output (Figure 9-35. waste site.s? Which neighborhoods does
right. bottom). there is no ambiguity. Each one pass tllfough on Main Street? Where
outpm point "feamre contains the original are d1ere shallow aquifers below corn-
point attribute information, plus the input fields? All these examples :involve the
polygon feature attributes. overlay of area foa1Ures. eit11er with other
Ooe me1hod for creating polygon out- area fean,res. or with poim or line features.
put from point-in-polygon overlay involves
recording the attributes for one point Clip, Intersect, and Union: Spe-
selected arbitrarily from the points 1.hat fall cial Cases of Overlay
within a polygon. This is usually not satis-
factory because importam info,mation may There are d,ree conunoo ways overlay
be lost. An alternative involves adding col- operations are applied: as ru clip. an imer-
mnns to the output polygon to preserve section. or a 1mio11.
multiple points per polygon. However. this The basic layer-on-layer combination
would stiU result in some ambiguity. such is the same for all three. Tiiey differ in the
as, what should be the order of duplicate geographic extent for wbicl1 vector data are
attributes? It may also add a substantial recorded, and in how data from the attri·
number of sparsely used items. 1bus bute layers are combined. [ntersectiou ru,d
increasing file siie inefficiently. Forcing union are derived from general set cbeory
the lower order output during overlay operations. 111e intersection operation may
avoids these problems. as shown in the be considered in some ways to be a spatial
lower right of Figure 9-35. AND. while the union openrtion is related 10
Note that the number of artributes in a spatial OR. The clip opem1ion may be
the output layer increases after each over- considered a combination of an intersection
lay. This is illustrated in Figure 9-35, with and an eliminaiion. All three are couunon
the combinarioo of a point and polygon and supponed in some manner as stand-
layer in an overlay. The ou1put point attri- alone functions by mosl G[S software
bute table shown in the Jower right portion packages.
of the figure contains four items. This Out· A clip may be considered a "cookie.
pm attribute ·rable is a composite oftbe curter" overlay (Figure 9-3"6). A bounding
input attribute tables. polygon layer is used to define the areas for
Large attribute tables may resuh if which features will be output. ·n1is bound-
overlay operations are used to combine ing polygon layer defines t!,e clipping
many data layers. When the output from au region. Point. line, or polygon data in a sec-
overlay process is in tum used as an input ond layer are ''dipped'' with the bounding
for a subsequent overlay, the number of layer. In most versions of the clip func1ion.
attributes in lhe next output layer will usu. the anrib111es for die clippitig layer are no,
ally increase. AS the number of anribmes included in 1he output data layer. A clip is
grows. tables may become unwieldy. and most often used when sub-setting data geo-
we often delete redundant anribu1es. graphically. to reduce data ·volumes when
working on a small area included in larger
da1a layers. A city manager may only wish
die set of streets within d1eir city boundar-
ies. clipped from a statewide roads layer.
398 GIS Fundamentals
Clip
Catchments by
Counties
;~~ ~.,,)-....p,,..,-.,r::::r...,,_.-....J."'9,t'-'
.,.,...,,,..._.
Figu1'f' 9-36: A clip is a common ,·ariatiou of OY«la)' operatioos. Titc dip prcs«vcs iufonnatioa only from
the dipped (or ta'lc:t) data fayer and oo)y (or the: atta of the <:lippi11~ (or bom1ditt3 laycr). The attn"butc table
of tltc: outJ>Ut L,ycr 1ypic:illy c:onll\izl8 n.11 tltc :'l.llribufcs of the dipped byer(b~. Catduncntt), 11.11d none:
from the: clipping ~ycr(Coi;ntic,).
ln our example shown in Figure 9-36. Users should be certain 1hat transferred
the bounding or clipping data layer consists variables s1ill have valid values after a clip.
of seven counry polygons. and tl1e target or If an area field is included in the input
clipped data laye.r contains many small layer, rite value may be wroog if il is not re-
catcluneol bounctaries. The presence of calculated. Other area-based values may
polygon auribmes in tl1e bounding layer is also be in error. e.g.. a polygon densi1y or
indicated by the different shades for the counis of included features. Si!lce software
different county polygons. The output from defaults vary, tl1e behavior of the specific
100 clip consists of !hose portions of catch· software 1001should be idemified.
ments within tbe clip layer boundary. Note
that 1he clip layer boundaries, here coun·
ties. are 001 included in the ourput data
layer. Also note that only the anributes for
the clipped catchment layer are output.
Chapter 9: Basic Spatial Analyses 399
given county value will usualJy not give an Note that there are often many null or
accurate coumy area. Each output variable empty attribute values in unio11 output lay·
should be scrutinized and the value origin ers. Data in non.overlapping areas are
and contents clearly understood. absent and so cannot be assigned. e.g ... out.
A 1111io11 is a~0ther kind of overlay. It side the county layer botwds but within o,e
retains all data from both the bounding and waiersbed layer bounds in Figure 9-38. The
data layers (Figure 9-38). No geographic presence or null values may alter subse.
data are discarded in the union operation. quent operations.
and corresponding attribute data are saved Many software packages support addi-
for all regions. New polygons are formed tional variants of overlay operations. Some
by the combination ofcoordinate data from suppon an Erase or similarly-11ame<I func-
each data layer. tion. whkh is the complement lo the cl.ip
function. In au Erase function. .areas cov-
ered by the input layer are "cut out.. or
Union
Catchments
ond Counties
fi2ul'(' 9-J3: A unic:m opernlion. Oa12 from two !Ayc:n a.R" combined. iuchidiog •my arens tbal 1UC conl~incd
by one b\11 not botl1 da1a layers. No areas nor attribute data arc discatdcd. or"cl~ ·· as in tl.c in1cl'k'C,lio11
lllld dip opcr1uio1U. Bla:11k 01 null \'lllucs 11tc 1ypically assijl)td in the ntc:u conuuned in 011c buc 1101 dtc o,bct
L,ycr, c:.g., in the c:x:unplc where tlte count>' 1$ Ii.tied as ..Unkr1own.••
Chapter 9: Basic Spatial Analyses 401
erased from the bounding layer (Figure 9- The Erase function is particularly us~
39). Ern.sures may cut existing polygons ful when updating a portion of a data layer.
apart or. where there are coincident lines in in that old. 0111 of date. poorer quality. or
the two data layers. may preserve the edge otherwise inferior data may be clipped out
of existing polygons. In some versions. of a section and newer data substituted.
there is a tolerance distance that aUows for Erase is also often usel'ul in spatial analyses
lines that are.n 'I exactly coincident in dif- that include criteria specifying areas that
ferent layers. but are meant 10 be. 10 be rep- are greater than some distance from a set of
resented only once in the 0111p111. This fcamres. A buffer firnc1ion identifies areas
tolerance distance effectively serves as a that are less than ,he target distance, and
snapping distance. and moves the vertices ct,ese amy then be removed from consider-
in one layer on a nearly coincidc!ll edge 10 ation using au erase openuion.
ma1ch vertices in the other layer. As with
snapping during digitization or other over-
lays. 11\is may help reduce i11corrc;:1or
unwanted geometries.
Counties Catchment
. --
FJantt 9~39: An c:(,implc of n.a Emsc opcmtion. Fcnhu·a in i n inpu1 l~>'tt lll'C ·•emscd" Mscd on the
O\ltCT boooda:ry of nn fflliiog pc:,lygon. Thi, Oj>ffi11io11 is oftm u;ed in cdi1illg or c-.i.rtogn.pbl~ models
that ~ify a.rca mno,·al ba;¢d oo a buffc~ layer.
402 GIS Fundamentals
common
boundary
..-------:: slivers
/
I
l ;::i
'
' 1
.. .. --,
\
......,,
"· -...
I ··--. .
. .....
I --"-
1-i
I I
Fla:ui·t 9•.ao: Slh·c t pol)·goru may occur wbc11 n,·o rcpr«cntatfon<t of ll (canttt arc combin«l. A couw.tou
b<>wul:uv bcn\'ccn 1\\'0 fc::inarcs htts been da\\'cd from diffcrclll M)\U'Ccll. The: rcprc:Scttll'Uion1 clifftt
5ligbdy. This results io s111~ll g~ps and ••4,lfrcr'" polygons along the margin between th«c two layers.
Chapter 9: Basic Spatial Analyses 403
high. The differences are shown in the A third method for sliv,er reduction
larger-scale uiset in Figure 9-40. When the involves defining a snap distance durillg
couory ao<I s1ate data layers ate overlaid. overlay. Much as with a snap distance used
many small polygons are formed along 1he during dala developmenl (described in
boundary. These polygons ate quite small. Chapter 4). this forces nodes or lines to be
but they are often quite numerous. coincident if they are widilo a specified
These .. s.liver.. polygons cause prob- proximity during overlay. As with data
lems because there is an entry in the anri- entry. this soap distance should be small
bute table for each polygon. One-half or relative to the spatial accur.acy oftlte inpm
more of the polygons in the output data layers and the required accuracy of the out-
layer may be these slivers. Slivers are a put data layers. lf the two representations
burden because they take up space in the of a line are within the snap distance then
attribute table bul are not of any interest or there will be uo sliver polygons. ln prac-
use. Analyses of large data sets are hill· tice. not alJ sliver polygons are removed.
dered because all selections. sorts. or other bm their numbers are subsrantially
operations must treat all polygons. includ- reduced thereby reducing the time spent
ing the slivers. Processing times often on mmmal editing.
increase exponentially with the number of Automatic sliver detection and
polygons. removal should be applied carefully, as
There are several methods 10 reduce t11ey may delete valuable data. Only small
the occurrence of these slivers. One identi- slivers should be removed. with small
defined as smaller than an area, length, or
fies all couuuon boundaries across differ-
ent layers. Toe boundary with the highest width tltat is wonh tracking for a given
coordinate accuracy is substituted into all problem or analysis. This distance may be
other dara layers. replaci,1g the less aau. set by the accuracy of tl1e dara collection,
rate represe1nations. This involves consid- or by the requirements of t11e analysis. lf
erable editing. and most common wbe-n polygon edge locations are only digitized
developing n.ew data layers. to within one merer of Iheir true position, ii
makes little sense to maintain polygons that
Another method involves manually are less than a meter in any dimension.
identifying and removing slivers. Small However. if slivers are removed that are
polygons may be selected. or polygons substantially wider and longer than a meter,
with two boumding arcs. couuuon for sliver some valuable information may be lost.
polygons. Bounding lines may theo be
adjusted or removed. However. manual
removal is not practical for many data sets
due to the high number of sliver polygons.
404 GIS Fundamentals
~rA ~B
Ge~o:ua
A 8 8 l l l
A2 8l a,
-
A 8 8 l l
A A a l AZ , ,.l Bl
- ·
Attrb.t< dc.t~ At'Ta..•c dcta
• ·--
'""
A
~
c......,
:J)
)
.......,
--
l.......,
Ffl'"'
]
Chapter 9: Basic Spatial Analyses 405
be used. This causes a doubling in the out· a nearest neighbor, bilinear interpolation,
put file size. 1\vo by1es wilJ hold more than cubic convolution. or some olher resam·
65.500 unique combinations: if more c.ate- piing fonnula 10 crea1e a new layer based
gories are required, then four bytes per cell on Loyer 2 but compatible with Loyer" 1.
are often use<!.
Raster overlay requires that the input
ras,er s~tems be compatible. This rypi·
c.ally means Ibey should have 1be same cell Loye,..r-'1- ~ - - - - - - ~
dimension and coordinate system. includ-
ing the same origin forx and y coordinates.
Jf1bc cell sizes differ, there will Likely be \
cells in one layer that match pans of sev- J.- •
eral cells in (he second inpu1 layer (Figure
9-42). This may resull in ambigui1y when
defining 1.he inplll attribule value. Overlay
may work if'the cells are integer multiples
with the sam,e origin. for example. the
boundaries of a I by I meter raster layer
may be set to coincide with a 3 by 3 raster
ceuf]
layer: however Utis rarely hap~,is. Oa1a
are norn11lly conver1ed to compatible raster
layers before overlay. This is most often
done using a resampling. as described in Fiturt 9-42: Ow:rlaid raster 1aycn should b¢
Chap1er 4. Io our example. we migl11 «>1np111ible 10 cim1tc uurunbipous ovc1•fay. In
choose to resample Loy,:r 2 to match Loyer tbc O\'~rl11v depic1cd 11crc it is tlOt dear w!1ich
c:<tl, troro•Loyer 2 ,liould be c:orubiocd ";lb ,d A
I in cell size and orien1a1io11. Values for iu Layu- I.
cells in Loyer 2 would be combined through
406 GIS Fundamentals
An Example Spatial Analysis set does not exist, but there are point obser·
vations from a ue1wotk of weat!1et statfons.
Fig,1re 9-43 and the following figures I might use interpolation or other methods
briefly illustrate an application of basic (described in Chapter 12) 10 estimate wind
spatial analysis. \Ve seek to identify suit- s~d across 1he study region. These con-
able areas for wi11d farms. based on two siderations highlight ru1 important early
criteria: areas with high average wind step in spalial analysis: we must assess the
speeds and low population density. High available data. and determine if it is appro-
average winds are preferred because the priate for our intended analysis. If no1. we
energy produced at a site increases with must create the required data or modify our
wind speed. Low population densities are analysis.
preferred because land is less expeusive
and tl1ere are fe1ver neigl1bors to bother. For this example. wind data were
This simple example does not include obvi- obtained from the U.S. Department of
ous additional factors. such as the distance Energy. and population data from 1he U.S.
10 power Lines. avoiding pro1ec1ed lands. or Census Bureau. \Vind data were reclassed
the difficulties of building offsl1ore vs. to !hose ,-.lues (4 or greater) toot provided
onshore. but it does illustrate how data may suitable potemial energy (Figure 9-43). We
be combined iu a set of simple spatial ti.mc- miglll represent this graphically as shown
tions to answer a question. Trus analysis in Figure 944. where the input and output
requires wind data of appropriate accuracy. layers are shown as boxes. and the spatial
spatial extent. and appropriately stmuna- operation noted inside au ellipse. Arrows
rized. For examp le. I might wish to base show the direction or flow of tbe analysis.
my anaJysis on average daily wind speed. The categories used in the recla.ssilication
or maximum hourly wind speed for a day. should be based on prior knowledge: here.
or maximum daily wind speed. If these we might know that wind levels below lhe
data do nol exisl.. l would need lo eilher category 4 are unsuitable for the wind tur-
change the problem formulation. my analy- bines to be used. In general. thresholds or
sis methods. or develop the data from exist-
ing sources. For example. if a gridded dala
. :,.1• \
. Recloss
Wind
Class
) density suitably calculated. for example. in
persons per square mile. If.not. this calcula-
tion will have to be perfonned prior to the
rec lassiticatiou.
Census doto
Recloss
Flflut'f 9--47: These i.utamcdi.atc la)'C19 :II'<' combined in au o,·crlay opcratiou., aud then :ucas
sdcc1cd b:olsc:d on critcrin for c1ch t'dUlt:u:it grogmphic unil
Chapter 9: Basic Spatial Analyses 409
Wind Both
Closs Closs
Recloss
windohigh and
popdens=low -+ suit=high
\
Popula-
t ion Suit-
Closs able
Sites
Fi&urt 9--f8: A graphic rcprekntation of the final li1Cp$ in d~ c::-camplc spari11.I ao11J)'li._ Thi, ,-,phic
cxparukon F:igurc 947 to explicitly show the ovc:rla)' a,Ki rccWsification oprn1ti.OM. and the m1nmcdi111c
lnyet ~uircd couccpnssUy. and when usinj most currcrtd)' a\':tilablc software.
410 GIS Fundamentals
Network Analysis
Nerworks are common in our lives. accumulation or resources along links and
Roads. powe.rlines. telephone and television between the connec.ted centers. Resources
cables, and water distribution systems are all flow 10 and from the center.I through tl1e net-
examples or networks we milizc many times works. In addition. resources may be gener-
each day (Figure 9-49). Networks must be ated or absorbed by the links 1hemselves.
effectively managed because they are crucial The links 1ho1 form 1he networks may
10 civiliza1ion and represcm subs1an1ial have attributes that affect the flow. For
investments. Spatial analysis 1ools have been example, J~re may be links tha1 slow or
developed 10 help us design. use, and main- speed up !he flow or resources. or a link may
tain necworks. allow resources to flow in only one direc-
A 11etu'Ork may be defined as a set of tion. Link attributes are used to anode! flow
connec1ed feamres. often 1ermed 11odes or charac1eris1ics or !he real network: for exam-
centers. These features may be cemers of ple. 1ravel on some roads is slower ~tan oth·
demand, centers of supply, or both (Figure 9- ers.. or cars may legally move in only one
50). Centers are c·onnected 10 at least one direction on a one-way street
and possibly many 11en,-.,·k li11ks. Links The concept of a rrnush cost is key to
interconnect and provide paths between cen. many network analysis problems. A transit
ters. Traveling from one center 10 another cos, reflec1s the price one pays to move a
often requires traversing many separate resourc.e lluough a segment of tbe network
links. Tra11si1cos1s are 1ypically measured in lime.
Network analyses. also known as net- distance. or mone1ary units: for example. i1
work models, are used 10 represent and ana· costs 10 seconds 10 travel through a link.
lyze the cost, time, delivery, and Costs may be constant such that it always
Fizurf' 9-49; Nctv•orks in 8 OIS arc used to rcprC5Ca1 ronds. pipe-lines. power lnt.iismissiou grids. rivers,
aod 01.bcr conncc-tced systems tJtr0t1Jb which important rcsou.rc:es flow.
Chap1er 9: Basic Spatial Analyses 411
takes 10 seconds to tnwerse the link regard- within a network, including starting and end·
less of direc1ion or 1ime of day. Al1ema- ing centers. These cen1ers must alJ be visited
1ively. coses may vary by lime of day or by traversing 1he ne~vork. There are usually
direction, so it may take 15 seconds to tra- a very large number of altemalive routes, or
verse an arc during morning and eve,1ing pa1bways. lhac may be used 10 visil all cen-
rush hours. bm IOseconds odierwise. or ii 1ers. The bes1 rou1e is selecced based ou
may t-ake twice as long to travel north to som-e criteria. usually the sltortest. quickest.
south !ban 10 travel sou1b 10 north. or lease cosily rou1e. Funberrestric1ious may
We will discuss lhree types of problems
be placed on 1be ro111e: for example. 1be
that are commonJy analyzed using ne1works: order in which centers are visited may be
rome selec1ion. resource and 1erri1ory alloca- specified.
tion. and traffic modeling. There are many Rome selection may be used 10 improve
types of net,vorking problems: however, the movement of public tra11sponation
1hese tl1ree are among 1he 111os1 common and 1brough a necwork. School buses are often
provide an indication of the melhods and routed using network analyses. Each bus
breadth of ne•twork analyses. muSI s1ar1 and finish al a school (a cen1er)
Route selection involves identifying a and pick up children ac a nu.mber of s1ops
"besf' route based on a specified sel of crite- (also centers). The shor1es1 pub or 1iroe
ria. Rou1e selec1ion is often applied 10 find route may be specified. Alternate routes are
the least costly route that visits a number of analyzed and 1he "bes1" rou1e selec1ed.
centers. Two or more cente.rs are identified
network links
• centers
Transit cost
from A to B
along thi-s link
is 12.4 units. \
All links have
an associated ·- -;-----..
transit cost. A 8
Fil""' 9..50: Bnic: nctwori: d cmctlU. Co tten ~re c:onn«tcd by II sc:t oflitil..'S. Costs ni.\y be .11uoci.,1c:d
"iili tra\·miug the links. Network 1m.alysii typi(',111)' in\·oh-c, ruo\"Ulg tuOI.IK" or dcmnncb among
c:c:ntm.
41 2 GIS Fundamentals
Selection of the best roule iuvol\les an are examined. and the link added that gives
algorithm 11la1 reeurs.ively follows a leas1- the lowes1101al path length. ln F.igure 9-52b.
cos1 se1 of arcs. begiruting ai the current two links are added. Note 1ba1 tbe liul<s
node. A set of interconnected network links added are not connected 10 the initially
is identifie<I. as ,vell as siart and desiination selec1ed link. Tliis would have given a total
cemers (Figure 9-51). The roote from s1ar110 cost or 14 (6 pius S) or 21 (6 phis IS). while
destinalion locations is typic.ally built itera- the selected Jinks give a lower path cost of
1ively. One route finding algorilhm adds the 12. Now. the candidate links are those con-
leas1-cos1 link at each siep. Mulriple paths necied to any of the selected Jinks or 10 the
are tested ullfil a path connects the start and start point. Since all links from the start
des1ination centers. poin1 have been selecte<I. only those con-
This simple 1,uethod begins at the stan nected to candidate Jinks are examine<!. Of
ceuier. Paths are exiende<I by adding the link these. lhe loweSl cost path is adde<I. The link
that gjves 1be lowes1101al cost for all paths with a cost of 8 that is auached 10 1he ini-
currently pursued. The initial set of candi- tially selected link is c-bosen (Figure 9-52c).
date links consists of all those connecting to The candidate sel expands accordingly. and
1be staning poim. The lowest cost Link is is evaluated again. Verify that 1be links
added. as shown in Figure 9-520. The link shown in Figure 9-52d and Figure 9-52e
with a va]ue of six is chosen. Now the set of should be the next. cumulative l<>w cost
candidate links consists of any Link con- paths selected. This method is used unliLlhe
necre<l 10 this selecte<l link (the rwo links des1ina1ion is reached. and 1he leas1-cos1
with cosls of 15 a.nd S. respectively), plus path identifie<I (Figure 9-53).
any cowiected 10 •he s1aning point. All paths
destination
• 6 7
•
3 13
12 5
11
2
• 5
15 3
7 7
II 9
11
13 12
'
15 8 I 7
9 travel cost 8
6
~
10 12 12
start
Fizune 9+~1: Ao aam1>lc network. S1.t11 11nd di:iti.n111ion cculc" 1uid costs for link 1nt\"C~I
lll'C , bon"tL
Chapter 9: Basic Spatial Analyses 413
d) e) - --'4'--~
8 8
8 8 6
6
12 12 12 12
destination
'
••
'
.--··- ---- 6 7
4
'
'' 3 12 13
' 5
'' 11
'•
-.. -... ---·-- . .
4
5
2
•' 15 3
'
' 7
11 '' 7 9
'
'
' 11
13 ''' 4 12
- ---
' - - ..... -
' 15 8 I 7
''
9 •' 8
•' 6
••
10 ••' 12 12
start
Fip1'f' 9-SJ: l<:a!;l-CO$l path (or the cxrunplc route finding aJioritJuu described in die text.
414 GIS Fundamentals
Many different pathfindiug algorithms resource allocation. in that all routes are pur·
have been develope<I, most ofwbicb are sued. DOI just UIC least-cost roote. The romcs
much more sophisticated thao the one are measured outward from eacb allocauon
described above. :Note thal the described center (Figure 9-54).
pathfinding algor;tlun bas a rapidly expand- Variations on resource allocation
ing munber of lin:ks 10 evaluate at each step. include setting a center capacity: The center
Computational burdens increase accord- capacity se1s an upper limit on _resourc_es tliat
ingly. A subset of all possible candidate may be encompassed by a terruory. Lmks
paths may be. examined because it_becomes are assigned to the nearest center. but once
too computauonally llme-consummg to . the capaci1y is reache<l, no more are added.
examuie au possilble paths. Most pathfinding Max.itnum distance also serves t-0 limit the
algoritluns periodically review the total range of the territory from the ~enter. Both
accumulated cost for each candidate path. of these restrictions may result m some lut3S·
and stop following some highest cost or least signed areas. that is. ponions of the ne1work
promising paths. that are not allocated 10 acei.11er.
There are many variations on this route Resource alloca1ion analyses are use<I in
fotding problem. 11,ere may be multiple cen- many disciplines. School dimicls may use
ters that must be visited iu a specific order. resource allocation to assign neighborhoods
and c.arriers defined to transport specific 10 schools. Tl>e type and munber of dwell-
amoums to or from centers. Centers may add ings in a district may be included as nodes
to or subtract from a carrier: for example. on a network. The munber of ch ildren along
some centers might represenl houses with each link is added umil the school capaci1y is
cbildre1t other centers may represent reached. Resource allocation 11~1y also be
schools. and carriers represent buses that used to define sales terril'ories. or to de-ter.
trausport children. House'l must be visited to i!lifie if a iltW business s~ould i;.e loeared
pick up children. lbut a bus bas a fLxed capac- between existing businesses. IfenougJ,.cus-
ity. These children must be. 1ransporte~ to the tomers fall betwee11the temtones of ex1stmg
school, and there. may be time constramts: business c.enters. a new business between
for example, children cannot be picked up existing business centers may be justified.
before 7 a.m. and mus1 be at school by 7:55
a.m. Nerwork·based route selection basbeen Tmf!ic modeling is anou,er oft-applied
successfully used to solve 1bese and related fonn of network analysis. Streers are repre·
problems. sented by a network of interconuected mes
and nodes.Attributes associated with arcs
Resourt:e a/locario11 problems involve define travel spee<I and directio11. Anributes
the apportiorunen1 of a network to cemers. associated wi1b nodes iden1ify n1ms and 1be
One or more allocation centers are defined in time or cost required foreach tum. Illegal .or
a network. Territories are defined for each of
these centers. Territories encompass links or impossible turns may be modeled by Sf<Ct·
fying an infinite cost Traffic is placed m 1he
non-allocation centers in 1be network. These network. and movement modele-d. Boule.
links or non.allocation centers are assigned
necks. transit times. and underused routes
to only one allocation center. The features may be identifie<l. and this infomiation use<I
are usually assigned 10 1he nearest center. to improve traffic management or bmld
where distance is measured in time. length, additional roads.
or monetary units..
Traffic modeling througb networks is a
Resource allocation algorithms may be subdiscipliue in il:s own right. o·ue to the
similar 10 rou1e finding algorithms in that tlie cos1 and importance of1ra1isponation and
distance om from each center is calct1lated traffic management. a great deal of emphasis
along each path. Each center or arc is bas been placed on efficient traffic manag·
assigned to 1he nearest or least.cost center. men1. Transp:>nation eugit1eers, compmer
The route finding method is exhatistive in
Chapter 9: Basic Spatial Analyses 415
o allocation center
~--r--c-·- T--T·- .j[--T"---r-T--?"---"'r=-=-=
I\
11
1,
I'
ri t-- -1 :ri:--,
, 'J..''
••
~--rL.Jt.-:..'"!-~ .._
'
~
~ -I
- --
-- I I
t
I
I
-
II J \ ...,,I f I ! ~
I
I
<.::;
..._.f
-l
'f'-+- ... ---,
l_J, '\ II
' I
!) ',~---' i.-
-
, ---~ --•
I
1 •
II
I
I
rx--- _,...._,i J J
I'
·-
I
I
I""'
\
t r--+-. I
._1-
I
I
f""'•
",, - -..
-
~
....
- -
I 1 I I I
t I I I / ,.., I I
---+--+-----1-.( - ., ~ - · -
I I -"' .,,.
• ' I ~ ~
..... ......
•
- =
1
= =;
' <
I' r LJ ( ·~i D
- ~ ~•
fla,urt 9-5.,: Alloc:uiou of uctwork linla 10 distinc1 ccntm. NctWOl'k liub orrcsour«s an: ani8J1<d to 1.bc
"nct1n:S1.. "nla, whm dirial'I"' ,n.,y ~ d,1,nni11-cd b)' pb)'Sinl di.11autc1 or by 1:0111,, ln\\'cl 1it11t, or ao,n,
0 1hcr l.,ctor.
scientists. and mathematicians have been found in literature listed at the end of this
modeling traffic via networks for many chapter.
yeors. An in-depth discussion of network
analyses for traffic management may be
416 GIS Fundamentals
(321-301)/(359-JOI) • 0.34
b)
.. • -!
.,,
978
995
••
t007
\026
Iii
IOJJ
• •
!064
\07\
'i
fl&lll't 9..S6; Jdio~yncralie dcvclop11i.<n1 Utaf fC$Uh iu oon-li.ncar addrC$$ k9U¢n«s. on lhc growtd.
so mon may r«ull wbc-n i c«odina: is apphed. Gcocodinjt is t)'picalt)' applied witb 31111.m uuptioo
or l\ linc;1r duan'bution o{~ddrcUcs :ttroH II r:ansc. When thil i.s 1101 tmc, llt illw1r11ted wi1h "ddrcu
2SO io part o. abon. or 1026 in pm b. g~odcd loc.ationt: will be in error.
may apply set or Boolean algebra. and may ne<:ted set of linear and poim fearures. Traf-
combine these with analyses of adjacency. fic flow. oil a.nd gas deUvery, or electrical
connectivity, or containment. A selected set ne1works are examples of feacur,es analyzed
may be classified in that variables may be and managed with ne1work mod!els. Route
changed or new variables added that reOec1 fiuding. allocation. and Oow are commonly
membership in rbe selected set. modeled in networks.
Classificatioru may be assigned auto- Geocoding. or linear referencing. is
matically. but the user should be careful in used to calculate approximate l,oc.ations
choosing the assignment. EquaJ.area. along a linear segmem when tlte endpoint
equal-interval, ai1d natural breaks classifi- addresses are known. Often 1iseo in census
cat.ions are often used. Tbe nature of result- and delivery applications, geocc.xling works
ing classifkation.s may depend best when addresses are uniformly spaced
subs1amially on lhe frequency histogram of across the segment. Because it is an
tbe inpm data la}'er, panicularly when out- approximation. geocoded locations are
liers are present. expected co sometimes be in error. and
1hese errors are often more freq\lem in mral
A dissolve operation is often used in
or sparsely addressed segments. Linear ref-
spatial analysis. Dissolves are routinely erencing may also be used to locate
applied after a classification. as they changes in linear feamre characteristics. for
remove rcdundallt boundaries 1hat may example. road surface or accidem loca-
slow processing. tions.
Proximity fu11clions and buffers are
also c.ommonly applied spatial data opera-
tions. TI1ese functions answer questions
regarding distance and separation among
Chapter 9: Basic Spatial Analyses 419
Suggested Reading
Ahuja, R.K... Magnanti. T.L., Orlin, J.B. (1993). Neh,·01* Floll's: nieo,y, Algorit/111,s,
a11d Applicatio11s. Englewood Cliffs: Premice Hall.
Steinitz. C.. Jordan. L. (1976). Hand-drawn overlays: Their history and perspective
uses. landscape Arcldtectul'e. 56; l46-157.
Stillwdl. J.A., Clarite, G. (2004). Applied GIS a11d Spatial A11a(vsis. New York:
Wdey.
Worboys. M.F•• Duckbam. M. (2004). GIS: A Comp11ti11g Perspecti,·e (2nd ed.). Boca
Raton: CRC Press.
Chapter 9: Basic Spatial Analyses 421
Study Questions
9..1 - Deline and give examples of local, neighborhood, and global spatial operations.
9.4 - Write the simplest Boolean expressions that result in the grey area, selections:
o) b)
A 8
c) d)
A 8
c
422 GIS Fundamentals
9.5 - Write the simplest Boolean expressions that result in the grey are.a selection:
o) b)
A B A B
c c
C) A d) A
8
8
c c
Chapter 9: Basic Spatial Analyses 423
ID cotg.
l A
recloss 2 A
3 8
5
~ A
9
3 5 8
6 6 8
7 A
B A
9 c
10 c
Recloss by ID
,,,..,-- to assign Type:----
JD Tvne
1 c
z f
3 c
4 d
5 5 d
6 c
7 e
8 ~
424 GIS Fundamentals
9.8 - Reclassify tJ1e following polygons, according ro the colunm Area. into ·small
( <18.000). medium ( 18.000 10 45.000). and large (> 45.000).
ID Arto TY!lt
0 l00,074 low
l J0.2•0 high
2 14,228 hlgh
3 17.770 hlgh
4 43.923 low
5 8.688 hlgh
6 46,169 mid
7 87.915 1ow
8 7.762 high
9 23.318 mid
10 62.201 low
L
II 10d,7d9 mid
12 19.036 mid
ll 15,239 hlgh
14 111.718 low
15 14,85<1 hlgh
16 16.948 hlgh
17 77.985 low
18 8.281 lllg~
19 27.437 mid
9.10 - What is the modifiable area unil problem (MAUP)? Why is it important? Whal
is 1he zone etfec1. and what is the area effect?
9.11 - Whal i.s a dissolve operaiion? What are they iypicaUy used for?
Chapter 9: Basic Spatial Analyses 425
9.12 - Perform a dissolve operation on the variable Type for the layer depicted below:
0 100.074 low
I J0.240
"'"
2
l
14.228
:1.no """
..,h
,• 43923
S.688
low
• 46169
"""
""'
'
8
9
10
87.915
7.76Z
23 318
bZ.iOJ
-"""
low
IOW
104.749
" """
...,..
19036
"
..
13 l~.239
14 m.11e IOW
1.i:.s,, l'li;II'
..
16 lb.9.tll
17 n.985 low
8.281
19 27.43.!. "'"
""'-
9.13 - Perform a dissolve operation on the variable Type for the layer depicted below:
10
0
....
J00.07.C b ...
I JO.Z40 ....
2 14 228 high
11 no high
....
l
• 43.923 low
• high
•
7
"~.169
87915
low
low
• 7,762 high
9
10
23.318
62.201 ""'
IOW
II
12
104 1'9
19.034 ""'
"""
13 15.239 high
.."
16
111.718
14 854
l~.9"48
low
..,h
hi;h
17 n.9U tioh
18 • za1 high
19 27 4J7
""'
426 GIS Fundamentals
9. 14 - Draw th.e resultant polygon boundaries and complete the table in a dissolve
operation that ,calculates the sum of Count. based on Closs. Label each polygon start-
ing lowercase a, b. c.... and enter the label in New!O in rhe outpm table for 11,e coJTe-
sponding row.
JD Closs Counl Type Cos-r
A l _ _ 11 Form 1.000
a z 9 Form 900
c 1 3 Ranch 1.100
0 Suburb
E
1
z '
21 SlJburb
200
700
F z 1, Form 800
I
G I 6 Ranch l ,200
J
NewlO Closs Coum
1
·-- '
Chapter 9: Basic Spatial Analyses 427
9.15 - Draw the resultant polygon boundaries and complete the table in the dissolve
operation that calculates the average of Cost, based ou Type. Label eaclt output poly-
gon staniug lowercase a. b. c .... aod emer the label in NewID in the output table for the
corresponding row.
tD Closs Count Type Cost
-
A I 11 Form 1.000
B 2 9 Form 900
c I 3 Ranch I 100
0
E
F
I
2
2
4
21
14
S\Jburb
-
S\Jb<lrb - 200
700
For~800
c I 6
--Rench -
1200
'
I
New!O Type Cost
9.16 . Select the most appropriate characteristics for the buffer below. Is it simple.
multi-distance, or variable distance? Does it retain or dissolve intersections? Is it
interior or e:<terior?
-t,000
meters
428 GIS Fundamentals
9. 17 - Sketch out the output from a variable distance buffer applied to the set of
points shown below. Draw output buffers that dissolve tl1e boundaries between areas
that fall within multiple buffers.
4
•
ID
I
2
)
•5
<l<,1
100
200
150 -
300
100·
zoom
- 1
6 800 • 5
7 200 •
8 100
9 150
•2
7
•
•s
.6 9
•
9.18 - Sketch out the output from a variable distance buffer applied 10 the set of
points shown below. Dissolve boundaries for imersectiug buffers.
5.
ID distOJ1ce
1 I 250
• •4
2 2 0
7
..3 • 6 3 1500
•
4 500
8 5 0
1.000 •
meters 6 500
7 250
8 250
Chapter 9: Basic Spatial Analyses 429
9.19 - How are raster proximity functions different from vector proximity fimctions?
9.20 . Wby arc o\llp\11 fcan,rcs in vector overlay typically sci 10 chc minimum dimco-
sional order (poinc, line, or polygon) of Che input fcacurcs?
?.21 - Complece the cable for the veccor poinc overlay shown below:
• •
J
A J
•·
• I
•
m
•·
B
•• ,P
• 0
•
0
' • 0 l • ' l ) •
IO Lndcov cou-nty IO rn,me type
A tents, \'10lk1,,,a m Boul(ltt' CZy
B
""""' ""'°'" n EkJHc YoBo
o"" ""'
0 Roe1M!1c
p LelCr
9.22 - Complete the table for the vector poim overlay shown below:
• ,I
A
•· l
,2
)
l
' ·' '
8
c I
,3
.s
• • 0
0 l
' ' ID l
' • 0 I
' •
"
r
IO ,'°"" stts
A c" r,, I ~f1 ~
8
c
WII
"""
.,,,.
leosc '
•'
•
STM<
x<
I f
430 GIS Fundamentals
9.23 - Sketch both tlte output polygons and the resultant attribute table from the over·
lay shown below:
Spo.ficl l)oto
..
l
+
.
Tabks
It> .,.. .,.., . IO ~,...
COi.i'ii')' .storus
.. ; I
I f«
.....
1902
+ ,o Polk ....... •• '
I I
2
I'
9.24 . Sketch both tl1e output polygons and the resultant attribute table from the over·
lay shown below:
Spollol Doto
e
• • •
l
I •
10
"" - ,. V/cM
""' ""'
-
1 1215 E.'Vf
2
--
1863
f---
u ....
• • " """"'
a 2
•••
187
I
i
' 1919
l ' '" ---- ---- --l---·----
e • c.... .?,)12
Chapter 9: Basic Spatial Analyses 431
•
b •
,g
B • • ·h
'
1D 5f01US tonirl
A prlVCI e. ~ ~ense.
8 pub IIC _po,tc
ID
•
•
d
•
• • •
2
' . f •
.
ID t.ndc.ov. county
• """"'
rorn, h ,'Cllo'No
Ln«lln
10
•
b
ID ._
1 E""'
2
""'"' -
1.111r
<
••
[ •• f
t •L
J
[
432 GIS Fundamentals
9.27 - \Vhal is the sliverproblem in vector layeroverlay? How might this problem be
resolved?
9.28 - Sketch t"be output of the raster overlay sbowu below, providing both cell values
and lhe output table with ID and count variables.
Layer I
A A • B x x x x
A A B B w w x x
A c c B w w w x
A c B B w \'/ w w
9.29 - Sketch the outpu1 of the rasieroverlay shown below. providing both cell values
and the output table with ID and coun1 variables.
Loyer I Loyer 2
A B B B x y y y
A B a B x w x x
c c c B w y y y
c c c c w w y y
t A 2 t x 7 I
__ ________
3
2 6
c
7
7
2 w 9
·-- >--
_;_
I
I
I
:
9.30 - Describe/define ne1wark models. What distinguishes them from other spatial
or temporal models?
9.31 - What a.re the common uses for ne1work models? Why are the.se models so
important?
Chapter 9: Basic Spatial Analyses 433
9.32 - Calculate and record the geocoded address for the boxes labeled A, C, E, G, I,
K. M, 0. and Q in the figure below:
Bendigo St
ln<Jloo R
&
~
~~ ~
li\l"
r<
"'"' lo1-
~
§
~ ~ R lS § Yorro ~d
liiJ l!!I m ..l2ll Im Q .iliJ
»
Ill: 1£'
~ OegllbO Rd
~
~
Bendigo 51
9.33 . Use tile figure above to calculate and record the geocoded address for the
boxes lal>eled 8. 0. F. H. J, L. N, P. and R.
434 GIS Fundamentals
9.34 - If you s1an with the layer on the left, and wish to create the layer on tl1e right.
what would be the table you would use for the re<:lass operation at o). and \\1l.at single
spatial operatioo would you use in b) 10 obtain the desired result?
o)
1 3 ReCIO$$
Table
4 b) ope.roflon
5
6
0 b
9.35 - If you s1a11 with the layer on the left, and wish to create the layer on tl1e right,
what would be the table you would use for the reclass operation at o). and ,mat single
spatial operation would you use in b) to obtain the desired result?
o)
1
Recla.ss
Tobie
2
5
b} optrotton
6 8
3 7
9
b
0
d e
435
Introduction
Raster analyses range from the simple to cepts and knows a broad range of raster
the complex. largely due 10 the early inven- analysis methods.
tion. simplicity. and flexibility of1he raster Raster analysis falls into different cate·
data model. Raster cells can store nominal. gories. Map algebra is the fo11nda1iou for
ordinal. or interval/ratio data. representing a many raster workflows. Tb.iis frnmfog in mm
wide range of variables. Complex coostnicts draws on rasteranalyses sp.anning local.
may be btult from raster data, including net- ueigbboruood. zonal. or global operations.
works of c0111nected cells. or groups of cells To these can be added more involved proce-
to fom1 areas.
dures relating 10 bulfering. overlays. and
The Oe:tibility of raster analyses bas cost-surfaces and many otb,ers besides.
been amply demonstrated by the wide range
of problems 1hey help solve. Raster analyses
may predict the fate of pollutants in the
atmosphere. disease spread. animal migra-
1ion. and crop yields. Time varying and wide
area phenomena are often analyzed using
raster data. Raster analyses are applied to a
range of scales. from fine grained problems.
for example. in U.S. EPA analysis of poJ.
hned Supemmd sites to NASA global-scale
estimates of forest growth.. Local, st.ate, and
regional organi2atio11s use raster analyses at
many scales in between (Figure JO· I).
The long history of raster analyses has
yielded tools valuable 10 many GIS users.
Tools often share a concepnml basis and
may be adap,ted to several types of prob·
lems. In addition. specialized raster analysis
methods have been developed for less fre-
quently encollJltered problems. The GJS user Fii:m'I!' 10 1: flood riskmaps.JWO<:!uced ,-ia
4
2x
5 7 2 3 3 5 7 2
-t
9
8
7
8
10
8
7
7
'
J
'
4
•
•
'
3
7
J
3
2
\ 3 ' 2•6
lO
8
'
3
)
2 3 l •
10
18
"
20
•
8
6
12
.6)
,.
2
0
l
2
l
'
3 l
16 16 6 8 6 7 10
-
·3
-
14
16
14
,.
8
8
(o)
8
6
•
4
i 11
8
II
10
(b)
8
6
fi&uff" 10-?: AD aninplc of mter ~!:ions. On 1hc left s:idc{o). cnch iuput ccU is: mul•!l)lii:d
by the value 2 . and the res.uh stored ui the co1l'C6poudi.ug output locatiou. TI.c right side (b} oftl.c
h,utt ilhastttncs lnycr additiou.
Chapter 10: Raster Analyses 437
Local
function
e.g.,
10 12 42
3,() 9 4
· 12 8 !5
plus 4
t• 16 46
34 13 8
· 8 !2 !9
Flaurt· 10.J: Local fu11ctio1u auign oupm ,-alud tbn.1 depend on <btn. 0111)' &om the. co1Topoudi1ij i11ptt1
l~tion.
Neighborhood
function
e.g.•
I( 12 42
30 9
•
•• 8 15
neio"l)o(ho<>d
1lfo.iumum
I
33 •2 •2
30 42 <2
)0 30 !7
f i.a;urt- 10--1: Neighborhood fouctioru use iup,1t ,.1lucs from a foc111I nnd .1dja.ccnt cell, to uk11l.1tc values
for ,be COITCspondiug O\llpul ~IL
438 GIS Fundamentals
it specifies a partlcular cell size and dimen- every cell fora layer, for example. the global
sion. while a vect,or local operation entails mean. maximum. or minimum.
non-uniform areas within most layers. Note that in our examples all have the
NeighborbOO<I operations se1s are also same extent. e.g... LoyerA and LoyerB in figure
more unifonnly defined in ras1er than vector I0-2 cover the same area. TMs may no1
data sets. A raster ne.ighborhood specifies a always be lme. When layer extents differ.
fi,ed number of cells and arraugemen1: for most GIS software will either restrict the
example. 1he neighborhood might be a cell operation to the area where inptu layers
plus the eight surrounding cells. This neigh- overlap, or place a null or a ..missing data''
borhood ltas a unifonn area and dimension. indicator into cells where input data are lack·
Vector neighborhoods depend not only on ing. This number ac1s as a "no d'.aia" flag.
the shape and size of the target fea1t1re.. but indicating there are no results. II is a unique
also on the shape and sizes of adjacent vec- placeholder that indicates no ,-.lid data are
tor feamres. presen1.
Global operations in map algebra may Incompatible rastercell sizes cause
produce unifonn omput, or they may pro- ambigui1ies wbeu combining ras1er layers.
duce different values for each raster cell TI1is problem was described briefly in Chap-
(Figure 10-S). Global operations may return 1er 9 and is illustrated here. The left side of
a single number. lJ)lated in every cell of the Figure 10-6 shows a rnsier mismatch. Sev-
output layer. The global maximum function eral cells in t.oyer2 correspond 10 cell A in Loy·
for a layer might be specified as: erl. If these two layers are added, there are
SC\'eral potential input values fro- Loyu2 cor.
OU!..num , ~IObolmcx(ln..Joy«) (I0,3) responding to one input value fo.r Loyer!. The
problem is compounded forcen e bec.ause a
portion of llie cell is not delined for Loyer2.
This would assign a single value lo Out-
_num. The value would be the largest nun•· There is input ambiguity in .most raster
ber found when s~ching all 1be cells of operations. One might argue the best choice
tn.Jay<r. This ''collapsing'"of data from a uses Loyerz with comple1e overlap. or mean
two-dimensional :raster may reduce the map or median number for all overlapping cells.
algebra to scalar a lgebra. Many other func· orsome weighted average. This ambiguity
lions return a single global value placed in will arise whenever raster darn s.ets are 1101
aligned or have different cell siz:es. While
Globol
function
e.<J .•
lO 12 42
30 9 4
·I 8 15
globo~
moX1mum I
42 42 12
•2 42 42
42 •i 42
Flaurt 10..5: GlobA.J functions integrate iJiput from 11n ciui.rc IA)'C'r 10 uku.J:uc OUlpUt \'11lue1.
Chapter 10: Raster Analyses 439
Loyer!
cell A
ceII A
-- .....
~
~
L
__.. -/v
FIJUJ't 10-6 : b:ompatibl~ ccU sizw:cs or pid cdg« s.bould be hannouiz:cd ,ia rcsamplinJ: prior 10 analysi.J.
the GIS software may have a default method Chapter 4. The analyst may select a template
for choosing the "best'" input when cells mis· or standard layer in a specified coordinate
match. these decisions may not be univer- systent \\;th a specified staning coordinate.
sally best. such as lhe lower-left cell values (Figure 10-
Analysis is usually best served by a 6). With a fixed dimension and starting
prior re sampling of the data into a compati- point. all cells unambiguously match across
layers.
ble coordinate sys1e111, using the transforma·
tion and resampfu1g methods described in
440 GIS Fundamentals
Local Functions
There is a broad number of local timc-
tions (or operations) that can be conve- Table 10-1: Common local mathematical
niently placed in one of four classes: funciions.
mathematical tiui<tions. Boolean or logical
operations. reclassification, and multilayer Function Descrip lion
raster overlay.
Add, subtract, Cell-by-cell combination
multiply, and with lhe aiitl:lmelic oper.
Mathematica l Functions divide atlon
ABS Absolute value of each
We may generate a new data layer by cell
applying matllem;:uical functions 011 a cell-
by-cell basis to input layers (Figure 10-2. EXP. EXP10. LN. Apprles base· e and base
Table 10-1). Any number of inputs and out- LN 10 10 exponentiation and
logarithms
puts may be supp,omd. depending on the
function. SIN, COS, TAN, Apply trlgonomeU1c rune-
ASIN,ACOS, tioos on a cell-by-cell
A broad array of mathematical timctions ATAN basis
may be used. witb a few constraints. Raster
INT, TRUNC Truncal~ cell values, out.
data value and type are perhaps tl1e most pvt Integer portion
common constrai11ts. Most raster models
store one data value iu a cell Each raster MODULUS Assigns the decimal por-
tioo or each ceu
data set has a data 1ype and maximum size
1ha1 applies 10 each cell: for exan,ple. a 1wo- ROUND Rounds a cell vatue up or
byte signed integer may be stored. Mathe- dOWT'I IO nearest integc,
matic.al operation:s that create noninteger value
values. or \'lllues laiger than 32. 768 (Ille SQRT. ROOT Calculates the square
capacity of a two-b)1e integer). may not be rool or specifies other
stored accurately in a two-byte inleger out- root of each ceu value
put layer. Most sys1ems will do some form POWER Raises eaeh cell to a
of a111oma1ic type conversion. but there are defined power
often limits on the largest values that can be
stored. e\'en with automatic conversion. from tlllits ill meters to units llle3S\lred in
Although the set of fimciions and func- feet
tion names differ among software packages, Note that although many sys1ems will
nearly all Jl'ICkages suppon the basic arith· lei you perform these operations on any type
metic operations of addition through divi- of raster data. lhey often only make sense for
sion. and most provide the trigonometric inte1'1aVratio da1a, and may renirn erroneous
functions and 1heir in"erses (e.g.. sin. asin). results when applied to nominal or ordinal
Tnmcation. power. and modulus fi.mc1ions data. Nwnbers may be assigned 10 indicate
are also conunonly supported, and Yendors population density by high. medium, and
often include add,tional functions they per- low. and while the sin function may be
ceive 10 be of special interest These ns11be- applied lo these dala, lhe resulls will usually
matical func1ions are often applied in raster ba"e tinle meaning.
analysis. for exoruple. when ruulliplying
each cell by 3.28 to convert height values
Chapter 10: Raster Analyses 441
Input Output
o)
1 3 l t 0 1 0 9 0 1 0 l
0 ti 2 ·I 0 5 2 5 0 N l I
AND = - ~
I 2 5 0 0 2 N 2 0 1 N 0
0 t N N O l -3 4 8 a 1 N N
b)
1 3 1 I 0 1 0 9 1 l 1 l
0 N 2 -1 0 5 2 s 0 N 1 1
OR =
I 2 5 0 0 2 N 2 l 1 N 1
0 t N 0 .J 0 1
N
' 8 N N
cl
NOT
I
1
3
N
2
I
2
5
1
·1
0
- =
- ,_ -
1
0
0 0 0
N
,_ a
0 a
-- 0
a
1
0 l N N t 0 N N
442 GIS Fundamentals
the reader should consull the manual for the distinguisJ1 among origins for a tme assign-
specific software 1001 lhey use. mem.
Figure I0-7c shows an example of the Logical operations may be pro,~ded
NOT operation. This operation switches true that perform ordinal or equality compari·
for false. ana false for 1111e. Nole 1ha1null iOns. or 11la11es1 if cell values arc null (Fig-
input assigns null output. ure 10-S). Ordinal comparisons include less
Finally. note 1ha1 many systems provide than. greater than. less than or equ.,110,
an XOR operation. known as an eXclusive greater than or equal 10. equal. at1d not equal.
OR (not illustrated in our examples). This is Examples of these logical comparisons are
similar 10 an OR operntioo. except that true shown in Figure 10-Soand b, respectively.
values are assigned 10 the outpm when only These operations are applied ceU-by-cell.
one or the other of the inputs istrue. but not and lhe corresponding 1:me or false output
when both inpms are tme. This is a more assigned. As shown in Figure 10 -So, the
restrictive case than rhe general OR. and upper left cell of the first input Layer is not
may be used in ins.tances when we wish to less than the upper left cell of the second
input layer. so a O(false) is assigned to the
Input Output
a)
l 3 l l 0 l 0 9 0 0 0 1
0 N 2 ·l
less 0 5 2 5 0 N 0 I
:
l 2 5 0 than 0 2 N 2 0 0 N l
0 I N N 0 .3 4 8 0 0 N N
b)
l 3 l 1 0 l 0 9 0 0 0 0
0 N 2 ·1 0 5 2 5 I N 1 0
equal :
l 2 5 0 0 2 N 2 0 I N 0
0 l N N 0 -3 4 8 I 0 N N
c) l 3 l 1 0 0 0 0
0 N 2 -l 0 I 0 0
ISNULL :
I z 5 0 0 0 0 0
0 l N N 0 0 l l
f iJUl't' 10-3: E.~aDIJ)lc, oflogic~I comparison OJ)<nlton for r&$lc:rdaua uts. Oulptn values rcQOrd the
ordioal compari.so:iu for inputs. with 11 I sjpi.if)'llli tnac: aud a Ofalse. The IS1'tULLopcrn.riou assisus
1ruc ,•alucs 10 the ourput wttct'le,•cr 1hc:re arc: Jil.iHil'lt input dtl12.
Chapter 10: Raster Analyses 443
upper left cell in the output layer. The upper areas that have changed. If the cell ,,llues
right cell in tile first layer is less than the cor- are not equal across the years. the logical
responding cell in the second input layer. equol comparison will ren1m a value of o.
re.sul1ing in the assignment of l {tme-) in the while areas that remain the same ,,~IJ main-
output layer. l3ut the value of I. Further fogical compari·
We often need to test for missing or sons. 11su1g class ,~tues. could identify how
unassigned values in a raster data layer. The much each of the component crop types had
operation bai no standard name. and may be changed.
variously c.alled via ISMtSStNC, ISNVLL. Ol Note that logical operators may be
some otherdescriptively name fimction. The applied to interval/ratio. ordlinal, and cate-
operation tests each cell for a null value. gorical data. although ordinal comparisons
shown as N io Figure I0-Sc. AO is assigned should be carefully applied ·to categorical
to the corresp,onding output cell if a non-null data. For example, ui our crop types example
value is found. o,herwise a I is assigned. in Figure 10-9. soybeans areassigne<l a
These t<sts for missing values are helpful value of 2. and are '·larger" ·than com. but
when identifying and replacing missing data. this distinction does not imply that soybeans
or when detennining the adequacy of a data are somehow two limes larger. more ,•alu-
se.t and identifying areas in need of addi- able or anything other than just different
tional sampling. from corn.
Figure 10-9 shows an example of a logi-
c.al comparison among two cL11a layers. The
left and cent"11 panels show land cover for
an agriculniral area, with three categories:
corn ( I), soybeans (2). and all O<hers (0). We
may be ime~ted in identifying aues that
were rotared between these two crops. or
from 1hese two crops to other crops over the
2009-2010 time period. The logical equal
comparison between these layers reveals
equol
--
I corn I corn 1 unchanged
fl beens (l beons 1iJ cl>onged
II other Ill other
fla.ut-t 10-9: An ex:unple of a lo3ical operation ,ipplicd co catc~orica.l data. Jicrc laudusc cbissc:sofcom (\'llluc:
= l). SO)k~1is (2), and other erops (0) O\'cr1wo time periods. The-cqu.:1J opet:1tion rctunli ,0 where bnd '*
h.-, dwagecl. and 1 where ii 1'°'' remained consiant O\'Cl' these two period$.
444 GIS Fundamentals
Input Output
ou1
o)
l 3 l l
'"0 0 x t x x
I x
0 N 2 ·l Rec loss 0 N b N
2 b ;
by
l 2 5 0 toble 3 f x b s 0
4 e
0 l N N 0 x N N
5 s
b)
1 3 1 I In ron,ge out 0 b 0 0
0 N 2 ·l Reclass 0 10 1.5 0
0 d b N
;
by 1.51035 b
l 2 5 0 ranges 0 b c 0
3 5 to 10
0 1 N N N
'd 0 b d d
Fi1u1•t- 10-10: Rastef'r«lwiftca.tion by 1ablc U111td1i.na (o) aod by table range (b). Co both casic-s. input
cell ,'llluei :trc com~rcd 10 tbc ..in" cohut\n ofd1c 111blc. A 1tutc11 it found and lite corrcspoiM!ing
..ou(' \1llues .usipd.
Chapter 10: Raster Analyses 445
source layerassigned for a tn1e outcome is ofir-then slatement. Note thal the value that
specified. as is 1be Vlllue or source layer is ou1pu1may be a scalar va[ue. for example.
assigned for a false omcome. An example of the number 2. or the value ourput may come
one conditional function may be: from the corre.sponding location in a speci-
fied raster layer. The condition is applied on
a cell-by-cell basis. and 1he 0111p111 V11lue
assigned based on the results of tl1e condi-
Output a CON (rest, out if true, out if false) (10.4) 1ional 1es,.
0 5 2 ·I
4 2 5 0
0 I 7 5
Loyer8 LoyerC
I
x
x
x
x
y
x
y
y
(is LoyerA < 0 b
0
b
b
b
b
b
x x y y ~~ 0 b b 0
y y y y 0 0 0 0
L
Output
x
x
b
b
y
x
y
y
_j
0 x b y
y y 0 0
Fl2m~ 10~11: Rcclanificatiou by condition anii:llS a..n output based on a conditi01Ull test_ ht lhis cxai.uptc.
c:td1cdl in loy,erA il comp.,rcd to die muttlxr 3. FotloyerA cdl, with \'llh.tcs leis thrut 3. d1e coudition
c,-aluates to be tnie, nnd tbc ootput ccll ,·nhu: is 11.$$-ig:ncd from t..cyet6. If die LOverAcell ,·ah1c is equal to
or~tcr than 3. then th< output cc:U ,·aJuc: i.sassigncd fiotn loytrC.
446 GIS Fundamentals
5 l I I
0 N 2 N
-
2 5 0
'
N N 7 5
LayerB
I
b
• • b
(! Loyi;rA null?) x x y y
0
• b b
..:::.,/~
y x y y
a a 0 b x x x x
0 0 0 0 x x x x
L _J
Outpul
,_x x y y
y
• y b
x x x x
0 0 x x
f ia;urt 10-U: LOQl lofieal opcrn.tious may be wmbined.. flus example altows tbe lSNULL fuuc-tion
nnbcckkd iu the condittoiull fiu.ctiou, CON. As described in Fi3ute J0-10. the first A!i\lnleUt in thi:s CON
fiull!tion defines the Boolc:.n tc,t. Herc, if lhc cell \'aluc in Loye.rA is null (N), then the condi1ic:,11nl 1clit
rct\lms a troc ,·ahi.c. This c.'(ccut" tbe followlug cutry, nn~nignn1e11t to the outpu1 from ~ . Jf the cell is
non-null. ISNUll ren1ms a ,-aJuc of {alse. and the Output cell ,..,.Jue assigned from the corresponding loca-
tioii il'l la"fE'.rC. Note the Output ,•11Jues ate o orb. from LoyerS. (or the cdlt in LoyerA 1J1.111 h.\\'< miJJ (N) \'lll-
ucs.
Chapter 10: Raster Analyses 447
Figure 10-12 shows anotlier example of tion. For example, if there are null values in
nested runciion. Ou1pu1 values are assigned both lhe source dam layer a11d 1be area out-
from 1wo different inpm layers. Cell values side 1he clip. one cannot be cenain if 1he
are assigned rrom LoyerS when t.ayuA values nulls come from the source ,or indicate a
are null and from LoyerC wht?n t.oyerA values region outside 1he clip area. Special coding
are non-null. This might be desirable ifwe or 01ber provisions can be used 10 avoid
have an inc.omplete bul othemrjse high-qual- rhese a mbiguities.
ity da1a set a:ud we wish 10 fill missing val- A clip using ras1er daia n~,y also be
ues from 1tie nex1 best available da1a. Map implemented as a reclassificalion and then a
algebraic exp-ressions with nested ftmctions mulliplication. Nole 1ha1 in Chapter 9, we
can become quite complex, bo1 also may be described 1he vec1or clip function as a spe-
quite effecti\l,e and efficient in solving com- cial case of O\lerlay in whic.h the attributes
plex problems. and interior geometry were ·saved based on
1he bom1daries in a clipping layer. Tl\is clip·
Raster Clip and Overlay ping layer serves as an ou'lli ne or area tem·
plate for which data are retafaed. ln a raster
Clip (or subset extraction) is another clip. 1he clipping layer may be represemed
common 1ype oflocal raster function (Figure as a se1 of cells with a vah1e of l embedded
10-13). Source and 1empla1e data layers are in nonclipping cells with values of 0.
specified and an output data layer creal'ed.
This ou1pu1 layer contains only 1be values of Figure 10-14 illus1ra1es a raster clip
1he source 11lai are indicated by the 1empla1e operation that is a c.ombinalion of cell redas·
layer. The nature of the template and outpu1 sification and mulliplica1ioru. The first s1ep is
data layer values depend on lhe specific 10 identify the se1 of values llllll defines 1he
implememalion of1he raster extraction. dip area. This is the portion of the inpul data
Template values are typically assigJJed a laytr 10 lie w.insftrttd to tlie outpUI data
value of I for 1bose cells 1ba1 are 10 pass layer. Individual cell values or cell values
1htough 10 tl>e output and a Oor null value over an interval or range may be defined.
for 1bose th.11 are 10 be ignored. 0111pu1 val- These may come from a selection based on
ues for lhe clipped area are copied from the raster values. from a lis1 of "'1llues. or from a
source. while- ou1pu1 values for 1he area out- previous spatial operation such as a buffer.
side 1he clipped region are typically assigned A clip template is created 1ha1 defines a
a nuUvalue, or the value 0. binary mask. a set of cells cbm "mask" our a
Care must be taken to ensure there are ponion of an input layer. Cells 10 be passed
no ambiguous cells c.reated by this conven. 1brougb 10 lhe ou1pu1 layer are se110 lhe
I 3 4 7 0 0 0 I N N N 7
6 J 2 ·I 0 0 I I N N 2 ·l
Clip •
I 2 5 0 0 I I I N 2 s 0
0 1 3 2 0 1 l 0 N I 3 N
Fii::ure- 10- 13: Au ilhalmlion of a r:ister clip (or c.-<tnu:tion) Opcnltion. \•;dues from :1. source l:1.ycr arc
e:<tracled based oo valu« iu au c:<trnetion template.. Output ciclls arc oftoru assigned n null \'3.luc. N, iu
the "outside'' area.
448 GIS Fundamentals
Input roster
2 2 2 8 8 2 2 2
2 2 2 8 8 8 2 2
2 3 3 3 8 8 8 7 C!Jp roster
2 3 3 3 8 8 8 7
0 0 0 0 I I I I
3 3 3 6 6 6 7 7 0 0 0 0 l l l l
3 3 3 3 6 6 6 7 x 0 0 l I l l I 0
3 6 3 6 6 6 6 6
0 0 I I I 0I 0
3 6 6 6 6 6 6 6
0 0 I I I 0 0 0
0 0 I I 0 0 0 0
0 I 0 0 0 0 0 0 Ou1put roster
0 0 0 0 0 0 0 0 0 0 0 8
0 2 2 2
0 0 0 0
8 8 2 2
0 0 3 8
3 8 8 0
Fi~utt 10-14: A rastcrdip open.lion 1~y be pcrfonned \i a multipli-
catton of a binary uusk. Au i.itput ttlStCT(lcft) bas be-en cuultif.Hcd by a 0 0 3 3
3 8 0 0
bin111y rnster. Cdl·byo.ecll mul!iplic:uion pAH« tlU"OUgb uc 1 input
cell corre,ponding to a I in llte clip raster (center), 11nd pa"" through 0 0 3 6
6 0 0 0
a O for alJ other cells to tlk: output f'Micr (nplt) . Note I.bat zero values 0 0 3 3
0 0 0 0
ate 1.01 allowed in thC' input r.titct bcuusc they Wl\'e the same. cff«t
:.s :terot in die clip,n1s.1rr. 0 6 0 0
0 0 0 0
0 0 0 0 0 0 0 0
value l (Figure 10.14). Cells 10 be "clipped mented so that it admitS continuous num-
away" are set co the value O. l11e clip tem- bers. this iypically resul1s in too many
plate or layer is then multiplied by the input unique cell combinations to be of much
easier, yielding rut ou1pu1 easier. Cell-by-cell value. If continuous da1a are used, 1hey are
multiplicaiion by l passes values 1hrough 10 often convene<l 10 ca1egories firs1: for exam-
lhe ou1pu1layer. Mulriplicalion by Odiscards ple. rainfall may be assigned to :low.
values for 1he cell, resuhing in a clipped ras- medimn, or high classes.
ter to 1he area of U1terest Raster overlay involves the cell-by-cell
Raster overlay combines features from comparison of values in two or more layers
1wo or more darn layers. and is among 1he (Figure 10-l 5). The values in each inpul da1a
most useful of spa1ial func1ions. The feanires layer are associated with a specific combina-
in raster data corre.spond to cells, or perhaps tion of addi1ional variables. and these addi-
groups of cells wi1h 1be same \'3lues. but as 1ional variables may be recorded in an
with vector overlay. great utility is often attribute table. Each unique co111bina1ion of
gained from comlbining data from diffe.rent cells from the two layers is iden1ified. and
layers. assigned a new ideo1ifier (0u1-10) in 1he 0u1-
There are some differences between eas- Note the two input attribute tables
PVT l<JYtr.
ier and vec1or overlay due 10 the differences are combined in a corresponding fasl1io11. ln
in the da1a model. Ras1er overlay is of1e11 Figure 10-15 you C9JJ see tile upper left cor-
restricted to nominal data. The cell values do ner of 1he 0\/lp<,1 ~ r has 1he corresponding
type and nome attribute values from Input
1101 typic.ally represen1 continuous variables
layer J. and the JO and cost atlribute values
such as temperanue. bu1nu.her categorical
from Input IOyer 2.
variables such as type or township name.
Allhough rasier overlay may be imple-
Chapter 10: Raster Analyses 449
--
Roster cells Attribute. tcbles
a 0 a b
Input 0 0 c c • SI\Jbblr\$
b buppk,51
loye.r 1
a a b c • ......
0 0 c 0
overlay
Input x
x x
x
x
x
x
x
10
x
v
....
C~I
low
layer 2
y y y y
-y y
f-
y x
• • b v
bul)r)ki$$
.,, ....
4 4 6 l • nodo
Fi.&llt't' 10-1$; .R..i.stCT o,·crl.1y in,-olvc.s lbc ccll-by..ccll c01n1.>im1tioo of da1.i in 1mihiplc J11y·
Cl'$. Kew O\ltput ,iatucs arc g,cn<-ratcd for each unique combinatioo of input \'alucs.
Recall th.at in many implementations of a ceJJ of type o in the lower Ce.ft comer of
1he raster data model 1here is a many-to-one Input~ 1 th.al is not contiguous with 1be
relationship l>enveen 1he raster cells and the rest ofthe o cells in Lo~r t. This combina-
attribute rows. This occurs bec.ause multiple tion carries through to 1he output, where
cells correspond 10 each row. Also note tlta1 1here are disjw,ct groups of cells with out-ID
the cells may form disjunct regions of the values of 6.
same type; f<M°example. figure 10- 15 shows
450 GIS Fundamentals
Fuzzy Membership and Raster Fuzzy se.ts may be assigned via a mem-
Overlay bership function (Figure J0-16). In our Ctlt·
rent example. we may repn."Sent the surety a
Raster data n,ay be reclassified into pixel is dense vegetation from the infrared
/11::y sets. typicaUy to represent uncerc,inty index we use 10 assign the vegeration type.
or ambiguity in set membership. For exam- Ground visits. or truth. may show tba1cells
ple. laud covtr mapping requires assigning with a strong infrared index are always vege.
distinct classes like water or upland. There uuion. while those with a weak index are
may be some uncertainty if an area near a not and intenucdia1e indexes ar,c split. We
shoreline is water or upland. Similarly, land may develop a graph 1hat shows. our c.er-
cover along a suoorban/n~al edge ns,y be tainty as a function of the infrared index that
ambiguous. because mis-interpreting a lawn connects specific levels oftlte index with the
for a hayfield cuay change a cell assigmnent vegetation set
from one class to another. Similar ambigt1-
ities due to data source or conversion or Fuzzy raster cells are then assigned
assignment methods may resull in fuzziness based on 1he input values. rrnnsferred
in raster \lalues. \Ve sometimes wish to through the membership func1ion. to give a
reflect this ambiguity. corresponding fuzzy value. Higb values
reflect high likelihood of membership in the
We lll3Y reprcse01 uncertainty in our vegetation se.1, while low values represent
measurement withji,::y member.ship val- low likelihood of membership.
ues. Suppose we have a fores r/noo-forest
classification map based on satellite data. Fuzzy rasters may be combined in vari-
Recall that die level of infrared reflectance ous overlay operations. often in ~stimating a
may be used singmy or in combination with membership or suitability 1hat depends on
other remotely-sensed image bands to detect several factors (Figure 10-17). For example,
vegetation. Our classification process often it may be impMAnt 10 idemify ripllfiaJi
provides an estimate of the per-pixel likeli- zones near water courses in a semi-arid envi-
hood of our assigDDlent being correct. e.g.. rooment Vegetation is a good indicator of
some cells are categorized as forest wi1h a riparian zones, but there are noru·riparian
likely 99% ac.curacy, whi.le other cells have areas with dense vegetation, and riparian
only a 55% chance of being forest. We may areas withom vegetation. We might also use
wish to represent 1his uncertainty in mem- ele\'ation data and derivatives to help with
bership. again witJ1 fuzzy sets. the riparian/upland zone assignment. For
example. riparian areas tend to be natter
than adjacent landscape due to s.edime.nt
3 8 ti -- ;s
r•
'
I.:
F lt.1n-l' 10-16: Ft~• Sci iaui_gmncitL Rlllclcr t'llluet on the lcfl c01TC1po11d 10 .t probsbili1y of n•nnbcn.hip 10
a clllSS (center), "'-'"<din a fuuy raster (right).
Chapter 10: Raster Analyses 451
deposition, and they are close to tJ1e same the component cells. In our example, the
elevation as the rivers running through them. riparian membership value fora raster cell is
Local topogmphy within riparian areas make the largest value for th.it cell from the vege-
the slope and height relationships somewhat tation. slope, and rise above waler cells.
fuzzy, and so I may develop membership Fuzzy AND assigns the miruimmn of the
(rn1ctions (or each oftbese (actors and prob- component cells. and hence is more conser-
ability of riparian membership. vative in assignment. Other versions include
Fuzzy membership rasters may then be fi1zzy product, sum. weighted sum, and
combined in several ways 10 assign member- gamma combina1ions.
ship. Fuzzy OR overlay assigns the probabil·
ity of cell membership to the largest value of
02<) 036 O!,O 0.0 076 Ot> oo, en 093 08. ooi c 10 oiz 0,1 0<>3
0 8 0.lol o., · 0.M 100 0.4; O?I ".OJ OM ,oo OCN OOb 0.06 C.8l 0(...t
.•
,•
.-·)
··1
\.
•
'' , >'
,
Fuzzy CR Combination Funy .a.ND Cotnb1not1on
01]
. "' 018 01Vi on 00 021 O.C::' 0.&oil 041
0.36 oa, 088 100 :.oo oin oos 03} 0"" 08'
Fijtlltt 10-17: Ft1z:zy sets: o,·fflay way take se,·ttaJ Coons, iuc:ludina OR corubioations, as.signing the higbes1
comc:idnt1 fuzzy numb«. or AND combiiiati:ou. aui31Ung the !owe.st coiucido u fuzzy nwubcr.
452 GIS Fundamentals
• •
•
fla:un 10-18:TI,e concept of a mo"i.ug window inrnstcrncigbborbood opmiliom. Herc.
II J by J ,.,indow is
swq,t from left 10 righ.1and from lop lo bottom ,icross a t11S-1¢r lay«.
The window a1 each Joc111ioo defines t.bc input ccllt used in II r:u.tet opcratioo.
Chapter 10: Raster Analyses 453
numbered in both the x and y directions to include caJculating tJ1e largest value, the
provide a narural center cell. ru>d 1bey are mode (peak of a bis1ogram). median (middle
typically square. A 3 by 3 cell window is 1he value). 1he range (larges, minus smaUesl). or
most common size, allhough windows may diversily (nwntrer of differenl values). These
also be rec1a11gular. \Vmdows may also have neighborhood operations are useful for many
irregular shapes: for example. L-shaped. cir- kinds of processing.
cular. or wedge-shaped moving windows are Consider the majority operation, also
someiimes specified. known as a majorityfilter. You might won-
There are many neighborhood functions der why one would want to calculate a
1ha1 use a moving ,,~ndow. These include majority filler for a da13 layer. Da1a smooth-
simple functions such as mean. ma.timum. ing is a common application. We described
minimum, or range (Figure 10- 19). Neigh- in Chapter 6 bow multiband satellite data are
borhood functions may be complica1ed, for often converted from raw image daia to land
example. the s1a1is1ical s1andard devia1ion. cover classificaliou maps.Tl1ese classifiers
and they may be oonarilbmet-ic. as the func- often assign values on a pixel basjs, and
tions that retum a cowll of the numberof ofte-n result in many single pi'<els of one land
unique values. or tlie mode. or a Booleru> cover 1ype embedded wi1bi11 ano1her lru>d
occurrence. Any function that combines cover type. These single pixels are often
infonnation from a consistently shaped smaller than the minimum mapping unil the
group of ras1er cells may be implemeuied smallest unifom1area that we care 10 map. A
wi1h a moving window. majority filter is often used ·10 remove this
Moving window functions may be ariili- classification '1J.oise.''
meiic. adding. subtracting. averaging. or olh- A majori1y filler is illus1ra1ed in Figure
erwise matbe.matically combining the values I 0-20. 11 illustrales NASS crop data for an
aroUttd a ce,mal cell, or they ,nay be co;n. 111~ in cen1tal 1t1diaful. base.:t on classified
parative or otherwise extrac1 values from a sa1elli1e images. There are over 40 common
set of cells. Common statistic.al operations land cover types in the area_ but these have
00 00l ~
Fi&m't 10~19: A g:iwn rutcrncigb-
borbood may ddiric the i1lpUI for
$C'\'crnl railer ncigliborbood opcrn•
tioos. Here a 3 by J ucighboruood
is q>«ifiC'd. These uiuc cells may
be tUcd H Utpul for m e:in, inedilln,
r.uioimuw, or ~ nwnbcr ofother
Mean \ M • d l ~Minimum functions.
00 00 00
454 GIS Fundamentals
• rood
.. ,am
• 1>eons
OOlht.r
.J'..A,sma.,onty soof
beol'\S,
A nrighborin,i ,ens:
tht ceMer cell is
- . - r<clossed rrcm corn 10 beons
i.;
No moJority in the .; neighboong cells. ~
so 1he center cell is not re,lossed.
_ oflel retcins us votue of be<lns _
Flgutt 10,20: An C'."tampk ofa m.ajority filter npplicd 1011. raster data I.ayer from a claui.ficd satcUite i.n'-'lge.
M.:111)' isoL,tcd cells arc: con,"cr1cd to the cntcgor)' of the domi.i,11111 w rroundi1ig class.
been reclassified iiuco the dominanc 1ypes of 20). lf a majoricy is no1 reached. as when
developed (road). com. beans. and otber o,uy one or two cells add up 10 the most fre-
crops. Each pixel is 30 m across. and the quent type in the four bordering cells, then
image on cbe left is NASS da1a as delivered. 1be center cell ,•alue is unchanged in the ou1-
Nole cbac com and beans dominate. bu1 cbere pu1 (bonom. Figure 10-20). The removal of
are many "stray.. pixels of a dissonant vege- most of the single pixel "noise'" by the
cacion 1ype embedded or on 1he edge of a majority filler can be observed in 1be classi-
dominant cype in an area: for example. bean fied unage on the rigtu side of Figure 10-20.
pixels in a com field. orcom pixels in a be.an There may be many variants for a given
field. These scray pixels usually do 001 rep- operation. The majoricy fi11er just discussed
resent realicy. in 1ha1 ahhougll cbere may be may assign an outpul value if only 1wo oftbe
1he isolated plao1 or cwo from previously four adjacenl cell values are most frequenl,
deposited seed in these annual crop rota· or use the Sor 24 nearest cells to calcula1e a
tions. che patches alnlos1 never approach 30 majority. The dependence of output on algo-
meters in size. The embedded ce.lls are most rithm specifics should always be recognized
often mis.classifications due to canopy thin- wbeu applying auy ras1er operation.
ning or perllBps weeds below cbe crop. and
are often below the minimum mapping wiit. f igure 10.21 shows an example of a
mean calculation using a mo\ling window.
The illustta1ed majoricy filter councs cbe The function scans all nine values in the
values of1he four cells sharing an edge wi1h window. It swns them and divides the sum
any given cell. ff a majori1y, me.ailing 1hree by lbe number of cells in uie window. thus
or more cells. are of a type. cben Ille cell is calcula1ing the mean cell \'l!lue for the inpu1
output as chis maj oricy cype (lop. Figure 10- window. The multiplication may be repre-
Chapter 10: Raster Analyses 455
seuted by a 3 by 3 grid containing the value Note that when the edge of the moving
one-ninth ( J/9). The melll\ value is then window is placed on the ma.igin of the origi-
stored in an OUlJ)\11 da1a layer in the locarion nal raster grid. we lose tile outside rows and
corresponding to the center cell of the mov- columns in our output raster. This is illus-
ing window. 1'he window is then shifted to tnned in Figure 10-21. right. The moving
the right and 'lhe process repeate<l. When the \\~ndow is shown in the upper right corner of
end of a row is reached the window is rhe input raster. Jf we place 'lhe center cell
returned to lh.e leftmost colulltlls, shifted for tl1e window 111\y ttigher or further 10 the
down one ro,v. and tl1e process repeated umil right tllen pan of the kernel will fall outside
all rows have been included. our input data. Output values are not defined
The moving window for many simple for tl1e cells along tlie top. l>onom, 111\d side
mathematical functions may be defined by a margins of the output raster when using a 3
kernel. A kernel for a moving window func- by 3 window. Each neighbo:rhood operation
tion is the set of cell constants for a given applied to successive output layers may
window size :and shape. These constants are erode the margin f\artber. For larger kernels.
used in a function at every 010\'ing window e.g. 5x5 or 7x7. we loose a wider margin.
location. The kernel in Figure 10-21 speci- We commonly address ibis margin ero-
fies a mean. As the figure shows. each cell sion by defining an oversize<! snidy area.
value for the l npU1 kJye-r at a given window Data may be Josi at Lhe margins. bul these
position is mwltiplied by dte corresponding data are not importam if they are outside tlle
kernel constant. TI1e result is placed in the area of interest.
Output layer Different moving windows 111\d kernels
may be specified to implem.ent many differ-
92 10.4 10.2
92 10.l 9.8
flJurf' 10.21: An c.:ctunp!c of II mc-111\ full<:tion 11pplicd ,-i:1 a n\O\'in3 w11idow. ln~n fayer cell ,-ah.ic:s f:ro1n
a 3 bv 3 mo,ing window (upper left) IIR 1m.dtipllcd b)' a kcmd to sp«ify conupoudmg oulpllt cell val·
uos. Thi, process is ~pc.filed for each «U in t.bc input layer.
456 GIS Fundamentals
ent neighborhood fw1c1ions. We mighl be 1he center cell. Spatial structure such as an
interested in the differeuce in a variable abn1pt change in ele,•ation may "be detected
across a laudscape. for example. changes in by this kemel. Tiie keniel in 1he cen1er-bo1-
canopy height 10 detecl tl1e boundary of a tom of Figure I 0-22 may be used 10 detect
recem forest fire. diJferences in the v direc1ioo.
Edge derectiou may be based on c.om- There are functions known as lu'gh-prus
paring dilferences across a kernel. The vaJ. filter. witb kemels that accenn1ate differ-
11es on one side of the kernel are s11btrac1ed ences between adjacent cells. These bigh-
from the values on the other side. large dif- pass filler kernels may be usefuU in identify-
ferences result in large output values. while ing lhe spikes or pits that are cha.racteris1ic
small differences resuh in small 0111pu1 val- of"noisy" data. Cells identified as spikes or
11es. Edges are de:fined as tl1ose cells with pits may then be evalua1ed and edited as
ou1pu1 values larger tl1an some 1hreshold. appropriate, removing the erroneous values.
Figure I0-22 ill11strates the application High-pass kernels generally con.rain both
of an edge detection operation. The kernel in negative and positive values in a panern that
the cemer 1op of Figure 10-22 amplifies dif· accenlual'es lac.al differences.
ferences in the x direction. The values in the Fig11re 10-23 demonstrates 1he use of a
left of three adjacent cohunns are subtracted high-pass kernel on a dala set containing
from the value in the corresponding right· noise. The elevation data set shown in tbe
hand row of cells. This process is repeated top portion of the figure contauis a number
for each cell in the kernel. and d1e values of anomalous ce.lls. These cells have
smnmed or averaged across all nine cells. extremely high ,,.Jues (spikes. s.hown in
Large differences result in large values. black) or low values (pi1s. sh0\\11 in white)
either positive or negative, saved in the cen- relative to nearby cells. If uncorrected, pits
ter cell. Small dilferettces between die left and spikt! will afftct slope, aspo!CI. and
and right col11mns lead to a small n11mber in other terrau1-based calculations. These
A
0 40 l~O 0 0
Input Kernels
layer 0 ,o •O 0 0
0 0 0
0 80 80 0 0
980 980 940 940 940 honzonto l 0 ·I
difference 0 0 0 0 80 80 0 0
980 980 940 940 940
0 80 80 0 0
980 98C QOC 900 900
980 980 qoo 900 900 B
-980 980 -90C 900 900 vert1co 0 t 0 0 0 0 0 0
difference 0 0 0 0 0 40 40 40
l 0
0
0 0 <O 40 ,o
0 0 0 0 0
0 0 0 0 0
Figure 10,22: There is ,i llltJc uumbcro( kcmds used with ruo\'inJ; wiudows. The kc111d ou cop amplifies
cliffttcnees ill die horitont11I dirct1ion., while tile kc-md on tile bouom lllnplifics diffcmic.ct ll~ the \'crtic:\.I
dircctio11. ThcK and other kernels may be '"ed to detect sp:c:ific fc.nt\U'C$ in a cbtn layer.
Chapter 10: Raster Analyses 457
r-- .
.. ~· ..•• ·~ ••,,. ·,,.,.,.~ ,,....
••• t,Cl ! 1.1. r~
"' ...
'" "'
1:1
"'
,. .... m
""
"' • ~
-
- lt0'1 ltJl1 tCU
kernel for
high~poss filter
.,., " "" .... .. ... -"' "' "' ... .,.
. J:C!
"'
..' ---- ""' "'
11r,11 l1co, tCQI'
~.'
'"
,;,
"'
·l n,
m•
'" "' "'
1;:.;E ... ... .. ....
·l
,,...
---
... ,:,
~ >R
rn
... '"
1. . . . . .
• •
-
·~· ...
,, "~ ,., ••• .., ,,. "' "', "' " , "'
...,.., .,... '" ... .,"' "' '" -. ..., l I.JC.
...
..
I•"- ""
•1: ;r• :oc1
..
... ... ... ,,, •«• ... = .., "' "'
'" "' "" ..., ""' '"' '" ... .. -
'"'
t.l~ M
..,
"'
..,
9
"""'"'
'"
.-,
,,., ;1.,1 •••
··•• [1<m
,n
,,., ""
1,0
... "" ... ... "" ....
i·~
•.. ''" ,,-~ 1•11 ... ... •= -- \\JO
.. OU 1>0
!ClO _11111
m
sp,kes • pits 0
/ Ouput layer
l. 35.7 ~ t;,,tj--1-j--j-1,-r-!-~--+--;--t-,-+-;
~ n ,Ul !fl 1n 1(0 tll IW 1'!4 ~ ,> •I ., .. , ''°
- -""' -
u. n ... « u, "
\I<) I
l¢J l(I'
1...
1t>
~ '6' al
~
1,. 1•;11 16: 10, IM
1111 1»
'"
.,
f ia:o~ JO. ;u; An e-AAmplc of a mo,;ng wiodow f\lnctioo. Ras1crd.,1a oftC"U contain ~001112lou, '":noi" ..
(dad. and Li$111 cells). A high·pau filter ,ind kcmcl. shown at top fdl. highlights ••noisy" cells. Local diffCJ·
enc« ate 11ru3lilificd M> lh111llllomalous «lb ntt c:u ily idt1nificd.
458 GIS Fundamentals
locally extreme ,,,,lues should be identified shown here recorded as integers. Output val-
and modified. ues may be real numbers or integers.
The high-pass kernel shown contajns a depending on the. progranuning .algorithm
value of 9 in the center and •I in all other and perhaps the specifications sel by the
cells. Each value is divided by? 10 reduce user.
the range of the 01ttpu1 variable. The kemel The mean filter (Figure I0-24) both
returns a value near the local average in "Sm<>Olhs.. data and increases sparinl co,•o,f.
smoothly changing ams. The positi\•e and once in lhe output da1a set. A\ler.aging com-
negative values balance, rerurning small bines nearby cells. bringing hjgh or low
numbers in flat areas. values nearer the local mean. Tlte nearby
The high-pass ke.mel generates a large values become more similar. meaning 1bey
positive vah1e when centered on a spike. The have a big.her spatial covariance (discussed
large differences between the center cell and in greater depth when in the spatial predit·
adjacent celJs are accentuated. Conversely. a tion section of Chapter 12). Large numbers
large negative vnlue is generated when a pit are found near large nwnbers. and small near
is encountered. A:11 exan,ple shows the appli· small. Low spatial covariance means nearby
cation of the high-pass filter for a cell ne.,r values are unrelated - knowing lbe value at
the upper left comer of the input data layer one cell does not provide much Unfommtion
(Figure I 0-23). Each cell ,,,.Jue is multiplied about the values at nearby cells. High spatial
by the corresponding kernel coefficieo1. covariance in the ''real world" may be a
These numbers are summed, and divided by good thing. lf we are prospecting for miner-
als, then a sample with a high value indicates
9. and the resttlt placed in tlte corresponding
output location. Calculation results are we are probably near a larger area of ore·
shown as real numbers. but cell values are bearing deposits. However. if the spatial
au1ocorrela1ion is increased by 1he moving
7 9 j 10
!
u
·~ .... .........
9 ~ Ou1pU1
layer
8 9 9 u 8 \
102 JI 3 IL i
9 '\o 12 10 8
__..,--
~ -----
92 10.4 10.2
m eon
kernel
- 92 10.J 9.8
f i.JUJ" 10-N : A ruq\-ing ,,-inc!Qw m11,y increllK s.patia.l «worinucc.. Adjacent output «111. .slmc :mnny iupul
<:c.U values. In 1:bc m~ fiw<:tiou shown bC-1'1:. tbis ricsuhs in similar outptll ccll ,'lllu«.
Chapter 10: Raster Analyses 459
,.,,
,_. ......,
Morrl,on Mltlct L..at-s K,1n.tb«
W1t:ter
DevNoped
For-en
Shr't.ib/Gr.lSS
a .a
2.1
28.?
U
13.6
S..6
<O.l
1.6
1.1
3.•
l?.7
LI
39.I
L6
•••
n.,
1.7
...•
o.•
,.,
L9
36.8
0.6
Agrlgutt...-e
VIC!'tlllnd
1.1
>2.)
U.9
21.0
11.7
.,.. 1a.a
34.I
<&4.4
2A.O ,..,
27.7
,...,
32.7
~. A.i t,~n
•
Fl;ttff' 10.2:S~ St111i&tiC1 al'C' often cakuJmcd by zones, be.l'C c:ociraetint limd C:O\"c:t C31etorics from :. rattCT
l.:lycr-, ~ 11,cd on iona. dcfinc:d by :1 , -cct~c:ounty bo.1.nd:u-ics (blnek. linu:).
d.1.1a
460 GIS Fundamentals
ferent iOQes; for example, we may be creal· Zones may be defined in vec1or or ffi$ter
ing an eleva1ion data set from many sources. layers, depending on the software. If raster.
and we may wisl1 to use the highest-quality zonal f1mctions may require compatible
data in zones where it exists. and use succes.. coordinate systems. cell sizes. and orienta-
sively poorer data in other zones. Zonal 1ions. although sometimes software will
func1ions give us these capabilities. automatically cooven via projection and res -
Figure I0-26 illustrates the application ampling. as needed.
of a zonal fimction. In Utis example. the
function calculates the zonal average for Cost Surfaces
ht.Lo,.,, based on zones defined by
zonuaver. The syntM here is Many problems require an analysis of
travel costs. These may be mone-tary costs of
travel. such as the price one nms1 charge to
Out_Layer • profitably deliver a package from the nearest
ZoneA"9(1n_Loyer, Zone._Loye:r) ( 10.8) distribmion cemer to au points i:n a region.
There is no standard syntax across soft,v11re., Travel costs migb1 also be meast.1red iu 01_her
so the specific order and interpretation of units, for instance, 1he time it lakes to travel
operands depend ,on tile software used. from a school to the nearest hospital. or as a
likelihood, such as the chance of a noxious
Note that the output here is a raster. with foreign weed spreading out from an intro-
identical values in all the cells of a given duc1io11 poi111. These analyses may be per,
zone. This is a rather c-0mmon version of a fanned with the help of cos! sr11foces.
zonal functions. Many GIS softwares may
create a table witb zonal identifiers and sum- A cost surface contains the minimum
mary values. with or without an 01.11put data cost of reaching cells in a layer from one or
layer. more source cells (Figure I0-27) . The sim-
plest cos1 surface is based on a unifom1
Chapter 10: Raster Analyses 461
travel cost. Travel cost depends only on the culaled from e.ach source cell, and the lowest
distance covered, with a 11,ed cost applied cost is typically placed in tlte ou1p111 cell.
per uni1distance traveled. This cost per unit Note that distance is commonly mea-
distanc.e does. not cJ1ange from cell to cell.
There are no !barriers, so the straight line dis- sured a1 least two ways - • s1rnigh1 line
(Euclidian) distance, as shown in Figiare 10·
tance is convened 10 a cost. First, the dis- 27, or as a row-column distance. A row-col-
tance is calculated from our source or urnn distance is me-.asured arong the row and
stoning location to each cell AS illustlllted column axes. and is by defin,i1ion longer 1Jian
in Figure 10-27. 1he distance is calcula1ed 1he straight-line djstance. Straight line dis-
based on the Pyllmgorean fonuula. Dis- 1ances are preferred in mos, applica1ions,
tances to tacb cell in the x and y directions al1hough they are more ditr.cuh to imple-
comrib111e 10 lbe total distance from a so1u-ce ment.
cell or cells.
Travel cosis may also 1"" calcttla1cd
The distance from a source cell is com- using africHon s,nfnce. The celJ values ~fa
bined with a fixed cost per unit distance to friction surface represent lhe cost per urut
calculate travel cost. As shown in the right travel dis1ance for crossing each cell. Fric-
side of Figure 10-27. each distance value is tion surfaces are used 10 represeo1 areas with
multiplied by the fs,ed cost fac1or. Tins a variable travel cost. Imagine a large mili-
re.suits in a cost surface. a raster layercon- 1ary base. P31t of the ba~e may inclu~ flat.
taining the ,ravel cos, 10 each cell. lf1here smoo1h areas such as dnll f.elds. parking
are multiple source cells. travel costs are cal- lots or parade grounds. These areas are rela-
1ively easy 10 cross. witll correspondingly
= 22.4
,, /
source source
20 ' '-.. -..to cell
40 20
cell
.... ./
[/'
/
22 4---
• 14.1 10 44.8 28.2 20
Flaun I0-21: A co~t 1\uf.1.ce ba,cd Oll ll f-t., cd co,t ~r unit disb11oi:. Minimum dista.occ from n s.ct of
,oon:c «lh• it multiplied by a fixed coo *tor to )'lcld a ~st surf<Kc,
462 GIS Fundamentals
low travel times per unit distance. Other The travel cost required to r.each each
parts of 1he base uiay be covered by open cell is lhe minimum accumula1ed 101al of the
grasslands. Wbile 1be s11rface may be a bil cost times the distance to a source cell. We
rougher, travel times are still moderate. specify a minimum accumulated cost
01ber ports may be composed of fores1s. because if 1bere is more lhan one source cell.
These areas would have correspondingly 1here is a large munber of po1emial pmhs 10
high travel times, as a vehicle would have to each of these.source cells. Distance across
pick a pa1h among tl1e trees. Finally. tliere each cell is muhiplied by the friction surface
may be areas occlllpied by wa1cr. fen~s. or cos1for 1lu11 cell and s11mmed for a palb 10
buildings. The.se are.as would have effec- ac.cumulate the total travel cost. The lowest
tively infinite 1.ravel times. cos1 p.1th from a source location to a celJ is
Each eel.I in d1e fric.tion surface contains usually assigned as 1be crave! cost 10 1ba1
the cost required 10 traverse a ponion of the cell.
cell (Figure I0·28). A value of 3 indica1es i1 Figure I 0.28 shows an example of cal·
costs 1hree units ( of time. money. or other culations for the friction cost along a set of
fac1or) per unil distance in die cell. If a cell paths. These are straight line palhs thal
is IO wide and costs 3 w1i1s per uni1 distance, lta\'el eilber p<1rallel 10 1be cell boundaries
and 1he cell is crossed aloog 1he width. then (purely in an x or y direc1io11) or a1 some
the cost for traversing the cell is 10 limes 3, angle across cells.
or 30 uoi1s. Sample calc11la1ion oflhe fric1ioo coses
The aclual cost for trave.rsing the cell fora path parallel to the x-axis is shown at
depends on the distance ,raveled lhrough 1he lhe 1op middle and on 1he left side of Fig11re
cell. When a cell is traversed parallel 10 1he 10.28. NOie 1ba1 wbeo traveling parallel to a
row or column edge, then the distance is cell boundaty, one half-cell width is lrn-
simply tlie tell diilleoSioa. Wilen a cell is ve~ed ill tlie s1a,1ing And Mdittg cells. lmer.
1raversed a, any otlier angle. lbe discance mediate cells are crossed a1 a full cell wid1b.
will vary. It may be greater or less than the \Vhen moving from the slarting cell to the
cell dimeusion. depending on lhe angle and adjacenl left cell, a friction surface value of
loca1ion of lbc pa·1b. l is eoco11n1ered. lhen a friction s11rface
value of 3. One-half the disrance, 5 units. is
~· /.,/
'9 ,,,.~ "·
. ($ • 3)
20
50
2
~ ~
~..... I 1
\ Cost •
..___ (5 6 • I)
39.2 14.l 10
(5.6 " 3)
2 1 1 (5 6 • I)
. (5.6 . 2) 42.3 22.4 ,!O
39.2
10 10
...,_. urtti --+ +-- uni•s -+
Fi&u1·t- 10-18:A co1,t surfnc:c: b,u cd 011 s~1i.11I\' \"1.riablc t:nn'Cl C:05'1s. A fric:1io11 ,,lffac:c spcc:-ifics the fP:Uit Uy
varying co5t of tm,·cling through rn.&t¢rceu.._ The diitn.zw;-c tra,·mcd tbrougb each cell is multiplied by the C:0$1
in the 6icrion swiacc. The ,·alues arc sunuutd for och path 10 yield a total cost.
Chapter 10: Raster Analyses 463
through the top-right cell at~ per-un_it fric- In general. the cost of a11y path is
tion cost of 1. One,balftbe distance is expressed as:
through the adjacent cell to 11te left. at a per-
uni1 friclion cos1 of 3. The total cost JS then
the distance traveled in each cell multiplied
by the per-11ni1 (ric1ion cost of the cell:
where d, is the distance and c, isthe cost
across eaclt cell of a patlt.
5 "1• 5 ' 3• 20 (10.9)
Many sofrwares calculate the C\U11ula-
1ive cost for the most d,rect pmh usmg a
The friction cost when traversing cells at slightly different approach, called dte n>w-
an angle. is illustrated at the bottom le~ ~d column disrm,ce. Rather than travel along a
bonom center of Figure 10-28. The fr1cuon straight line path. the row-coh1mn distance
cost is the sum of the cell cost per unit dis· travels from cell center to re.II center (Figure
tance multiplaed by the distance traveled in J0-29). Calculations are much easier
each cell. The path begins a1 the source cell because 1he leng1h of the pa·th wi1bin each
and ends rwo cells 10 the left and one cell cell is constant with row cohmm distance,
down. Each intervening cell is traversed for and for square cells this distance equals the
a distance of 5 .6 cell units. The distance tra- cell width (or height). The distance Ill each
versed in each cell is multiplied by tile fric- c-ell , 1aries wben using the straight line dis-
tion value for each cell. The total cos1for tance. and so the time required to calculate
this leg is: the accmnulated distance is substantially
increased. The row/column distance gives
5.6"1, 5.6 "3 , S.6' 1 • 5.6 '2 • 39 2 (I0.10) the same relative cosls for travel from a
source cell to each target cell. but the abso-
hne costs change (compare ·11_1e costs on the
right of Figure 10-2$ to the nght of Figure
I 0-29).
..-
2 l I • (5 • l)
.. •
•
(5
(5
' I)
• 2)
55 40 10
2 I J 55
45 30 zo
_ ,o -+
Uf'll1$
fii:ur~ IOi.29: Cakulacions of th,: lr:m:I cost from a sourc,: c<"U Co ~b other cell usU~g ro,,·<olunm
distaucc C;11kulatioos.
464 GIS Fundamentals
Many implementations of a friction sur· 1he output that originate from distinct input
face or cos1 function allow you 10 search for combinatious.
the minimum cost to travel 10 a cell from a Raster analyses can be locaU. neighbor-
set of source cells. The straight line distance hood. zonal, or global. Local analyses are
may 001 be 1he "leas1 cosily." and so al1ema-
very conunonly us«i because they are eS$en·
1ives may be examined. There are many tial to many lasks. Neighborhood operations
routes from any siou.rce cell to any de.stina. are particularly common in raster analyses.
1ion cell, thousands of distinct routes in most and may be applied with a moving window
instances. Software iypicaUy implemems approach. A moving window is swept across
some optimization algorithm to eliminate all cells in a da1a layer. 1ypically mulliplying
rou1es early on and reduce search time, kernel values by da1a found aronnd a center
thereby arriving at the cost surface in some cell. Window size and shape may be modi-
acceptable time period. fied a11he edges of the da1a layers. Moving
No1e 1ba1 barriers may be placed on a windows may be used 10 specify a wide
cost surface 10 preclude travel across por- range of combiuatorial. terrain. and statisti-
tions of the surface. These barriers may be cal functions. Zonal functions can mimic
specified by seniog 1he cos1 so high 1ba1 no some elements of vector analysis by identi-
path will include ·rhem. Any circuitous rou1e fying regions as 1he basis for work. Global
will be less expensive than traveling over the functions. examined above in the context of
barriers. Some so:ftware allows 1he specifica- map algebra. typically draw on all cells in
1ion of a unique code 10 idemify barriers. and 1he raster 10 come up with sin.gle values.
this code precludes movement across the Finally. 1bere are general analyses 11ta1
cell. may be applied using ras,er data selS. Com-
mon.ly used are buffering and overlay. but
Summary 1he1e A~ joined by mo~ sophis1;ta1ed forms
such as cost or friction surfaces lhat are an
Raster analyses are essemial 1ools in imponant subse1 of proximity aualyses that
GIS. and should be 11nders1ood by all users. may be easily applied in raSler ruialyses. A
Raster analyses are widespread and we11 cos1 surface identifies 1he iravel coslS
developed for ma:ny reasons. in part d11e 10 required for movement from a specified set
1be simplicily oftbe daia sirucmre. the ease of locations.
with which contimuous variables may be rep·
resemed. and 1he long bis1ory of rasier anal-
yses.
Map algebra is a concep1 in which ras1er
data layers are combined via sunuua1io11 and
multiplicalion. Values are combined on a
cell-by-cell basis. and may be added. sub-
1rac1ed. 11111hiplied. or divided. Care must be
taken to avoid ambiguous combinations in
Chapter 10: Raster Analyses 465
Suggested Reading
de Smi1h. M.1.. Goodchild. M.F.. Longley. P.A. (2007). Geospalial Anal)'sis, a Com·
pre.l,eusfre Gufde ro P;iuciples. Tec/111iq11es. and Software Tool~·. Leicester: Mata-
dor.
Eashnan. J.R.• Jin, W., Keym, P.AK., Toledano, J. (1995). Raster procedures for
mutci~cri1eria/muhi--0bjec1ive decisions. Photogmmmern·c Eugiueen'ug and
kemore !ienni,g, 61 :S39- S47.
Eastman. J.R. (1997). idrisifor lf7ndou·s. Worcester: Clark University.
Hengl. T. (2006). Finding the right pixel size. Co111p111ei·s and Geosciences. 32: 1283-
1298.
Mitchell. A. (1999). T/,e £SRI Gnide 10 GJS .4nalJois: Geogmpl,ic P1111erns and Rela-
rionsbips. Redlands: ESRI Press.
Study Questfons
10.2 - Why musl raster layers have compatible cell sizes and orientations for most
raster combination operations?
10.3 - \Vhat is a null value in a raster data set? How is 1his mill value typic.ally treated
in a raster operation?
(I) 5 6 8 3 13 16
19 17 9 ti @ 7 15
8 5 [I 8 3 13 16
19 17 9 ti @ 7 15
Chapter 10: Raster Analyses 467
10.6 - Wbal are the values in cells CI. C2, C3, and C 12 in tl1e output layer?
Con(Loyerl < 2. o. 1)
Loyert
2 N ! 2
! N ! 3
• I 2 0
Outpur
N 2 N I
Cl Cl CJ C4
C5 ,. C7 C8
C9 c,o en c,2
C,J C15 C,6
'"
I 0.7 - Wbal are the values in cells CS. C7. C IO, aud C 13 in tlte oulpul Uayet1
Con(Loyerl < 2. o. 1)
Loyer I
2 N ! 2
1 N J 3
• I 2 0
Oufput
N 2 N I
Cl Cl CJ C4
C5 ,. C7 C8
C9 c,o en C!2
10.8 - What ar-e the cell values for cells Cl, C3, C4, and CIO in the output layer.
below?
Ou1pul • CON((loyerA .. N), t, loyerA)
loyerA.
N N I 0
I N 2 N
N 4 N N
Ou1 w
N
0
' I
Cl CZ CJ c,
cs C6 Cl cs
C9 CIO Cll C12
Cl3 C14 Cl5 C16
10.9 - Wbat are 11te cell values for cells C2. CS. C7. and C l I in the ompm layer.
below?
Output • CON((loyerA=cN), t, loyerA)
loyerA
N N I 0
I N 2 N
N 4 N N
Output
0 t N I
Cl C2 CJ c,
cs C6 Cl C8
C9 CIO Ctl Ct2
10. 11 - Whal are the values in output cells C9. CIO, Cl I, and CI2?
Output = CON(ISNULL(loyerA). 1. N)
loyerfl
I N N 0
0 N 2 I
N l 5 0
Output
N l N I
Cl C2 C3 C4
cs C6 C7 CB
10.12 - What are the values in output cells C7, CS, CI3, and CI6?
Output = CON(lSNULL(loyerA). I. N)
loyerA
I N N 0
0 N 2 I
N I 5 0
Output
N I N I
Cl C2 C3 C4
cs C6 C7 CB
10.14 - D<>es a NOT operation applied 10 a raster cell value containing a, NULL value
rentm a ~VLL value. a zero value. a I. or some other non-null value?
1 I 0 0 0 1 0 1
0 0 0 1 1 0 0 1
and =
1 1 0 0 1 1 1 0
1 0 0 1 1 0 1 1
1 1 0 0 0 1 0 1
0 0 0 1 1 0 0 1
or =
1 1 0 0 1 1 1 0
1 0 0 1 1 0 1 1
0 0 N 3 0 5 0 1
1 0 0 3 1 0 0 1
and =
1 7 1 0 1 0 1 0
0 N 0 1 1 0 1 1
0 1 0 0 0 1 0 9
0 0 0 I 1 0 0 I
or =
3 0 0 0 1 1 N 0
1 0 1 1 0 0 6 0
Chapter 10: Raster Analyses 471
10.18 - Describe how Jocal arithmetic functfons can be used to apply a ,cJip function
in a raster euvironmeut.
10.19 - \Vhat is a kemel in a moving window operation'! Ooes the kemel size or
shape change for dilferem ponions or the rnster data se1? Why or why not?
10.20 - \Vhat moving window operation would most likely use the kemel below?
·2 ·I ·2 ·I ·2
-1 0 0 0 ·I
·2 0 25 0 -2
.J 0 0 0 .J
-2 -1 ·2 -1 -2
10.21 • \Vha1 moving window operation would most likely use the kemel below?
I 2 I
0 0 0
-I -2 -1
10.23 - Calculate the cost of travel between A and B. ru1d A and C, over the cost sur·
face below. both by straigh1 line. and by row-<:olumu paihs.
Source/target cells
A
B c I 10 units
Cost surface
3 5 6 8
4 1 7 5
1
2
2
5
4
1 6
I 1 [SJ1 remember. the
diogonol of o square
is 1.41 x t he edge
10.24 . Calcula1e 1be cost of travel be1ween A and B. aud A and C. over the cos, sur-
face below, bo1b by s1raigh1 line. and by row.column paihs:
s ourceItarqet cells
A
B c [ 10 units
Cost surface
2 4 7 8
3 I 7 9
I
5
1
1
4
4
1
7
2
[SJ1 remember. the
diogonol of o square
is 1.41 x t he edge
473
11 Terrain Analysis
Introduction
Elevation and related terrain variables tion influences soil moisn1re and hence food
are important at some point in almost every· production. Terrain in large part affec1s
one's Life. Elev.11ioo and slope cl\ange across wa1er qualicy 1brougb sedimeo1 generation
1be Landscape (Figure 11 -1 ). and •his varia- and 1ranspor1. 11 sirongly influences irans-
tion detemiimes where rivers flow, lakes ponation networks and the cost and methods
occtir. and Ooods are frequeol Terrain ,,aria- of building cons1ruc1ion. Terrain variables
fian~ 11· 1 : Au e:<rui'11c of .t fcou.i.n·baKd ii~gc ofdie western Vniti:d Stales. Shnding, b:i.s,c,cl oo loci!
clc:,.-tioo. c:::inphuiz« tcm.in shape:. Topographic featttrcs are dcari)' ickotified. inchidua,g 1.bc Cc.-ntral Val-
ley ofCali.ion,ia a1 tbc left oftbc imaj:c. and to lite center and ri3bt. the pamJJd mowua.1os aud ,-alleys o(
the B!ltill !llld R.·•ugc tcgion (courte.t{USOS).
474 GIS Fundamentals
are frequently applied in a broad range of measurements are slow. and e.s1in1ates are
spatial a11alt, es (Table I l -1). likely 10,.,,ry among human analysts. In
Given the imponance of elevation and contrast. d igital slope calculations are easy
other terrain variables in resource manage· to program, consistent. and have proven to
mem. and the diffic11lties of man11al 1emin be as accurate as field measurements.
analysis. it is not :surprising 1hat terra in anaJ. Both data and melhods exisl to extract
ysis is well develcpe<I in GIS. lndee<I. it is imponant terrain ,.,,riables via a GJS. Digital
often impractical 10 perfom, consis1en1 1er- ete,•ation models (DEMs). described in
rain analyses without a GIS. for example, Chapters 2 and 7. have been developed for
slope calculations over large areas based on most of 1he world using methods described
manual methods are slow. error prone. and in Chapters 5 and 6. and DEM renewal and
inconsistent. Elevation change over a hori- improvement cootinues.
zontal distance is difficuh to measure. these
upslope area watershed area Sotl moisture. water runoff volume and
above a point timing, poUutlon or erosion
hazards
NormQl
direcilon
!la:ure ':?~~: A d~ictiou of., stuf:.m: rcprcscmcd by a ras1cr DEM. aud cbRn,tes io z ,,atucs (or cdls med
,M C31C'\ 1.1lll1Ut \':U10US tC1Tllm llltn'butc:S..
4 76 GIS Fundamentals
..
-··.
.. ~·.-/
.
Higher elevalions occur al the lower right where s is slope, aton is I.he inverse 1ru1gen1
corner. and lower elevations occur tow3rd l\uiction. z is elevation. x and y are the
the upper left. The direction of steepest slope respec.tive coordinate a.«es. and dZ/dx and
trends from one cornertowards tJ1e.other, but dZ/dy are calculated for each cell based on
does not pas~ directly through the center of elevation values surrounding a given cell.
any cell. How do we obtain values for the The symbol dl./dx represents tile rise
rise and nm? Which elevations should be (change in Z) ove.r lhe nm io the x direction.
used 10 t.1lcu!a1e slope? Intuitively we and dZ/dy represents the rise over the run in
should use some combimuion of a number of they direction. These fonnulas are combined
cells in the vicinity of the center cell, per- to calculate !he slope for each cell based on
haps all of them. I.he combined change in elewation in the x
Elevatioo is often represemed by the let- and y directions.
ter Zin l'errnin functions. These terrain func. Many different fommlas and metl,ods
tions are usually calculated with a have been proposed for calculating dZ/dx
symmetrical moving window. A 3 by 3 cell and dZ/dy. The four ue(lre.st cells method.
window is most conunOIL aJthougb 5 by 5 shown in Figure 11-6 and at the top of Fig-
aud other odd-numbered windows are also ure l l-7, is the simplest. and uses the four
used. Each cell in the window is assigned a cells closest to Zo:
subscript. and the e.levation values found at
window loca1ions referenced by subscripted
Z values. dZ/d• , (Zs· Z,l/(2C) (11 Zl
Figure I ll-6 shows au example of a 3 by
3 cell window. The central cell bas a value of
44, and is referred to as cell Zo· The upper f!Z/IJ; • (Zi • Z7)/(2C) (tlJ)
left cell is 1eferred 10 as Z1• lhe uwei center
cell as 22• and so on through cell Z8 in the
lower right comer. Here. C is the cell dimension and the Zs are
Slope at each ceoter cell is most com· defined as in Figure 11·6. T.his method uses
monly c.alculated from the formula: the four cells that share an edge with the
focal cell. z•. 25 . Z2• and Z7 • in calculating
dZ/dx and dZ/dy. This four nearest method
2 is perhaps the most obvious and provides
s = o,on (~ )' + (~) (11 t) reasonable slope ,~Ines tUld:er mmiy circum-
s1ances.
y ,, ' z,
42 45 47 tor Zo:
'• • ,, dZ/dx • (49 • 40)/20 • 0.45
dZ/dy • (45 - 48)120 • -O.l5
40 44 49
x
fiam't' 11-'; Slope ~kul_.ation W.sed on cells ndjau:ent to 1be ceotcrcdl.
4 78 GIS Fundamentals
40 44 49
•
-1 " 0
z,
I
•
0
~
0
.. 0
'• • ,, ,. z, z,
44 48 52 0 0 0 0 -l 0
OZ/a,,.• (Z, • Z, )12C dZldy, (Zz • Z 1 )t2C
dZ/dx * dZ/dy .
(49 • 40)/20 • 0 45 (45 • 48)120 • -0 15
40 44 49
~
-2
~
0
.. 2
z,
0
~
0
.. 0
z, z, z, z, z,
'
44 48 52 -1 0 l -1 -2 -I
- c-10-- dZ/dx . dZ/dy.
((Z3 • z, ) ·2<Zs · z,) ((21 - Z6 ) •2(Z, - 2 7 J
·(Z. • Z,, ))/SC ·(Z3 • Z8 )J/8C
dZ/dx • dZ/dy.
[ (47 42) • [ (47 • 52) •
2 (49 • 40) • 2 c,s . 48) •
(52 • 44) 1180 (42 • 44) )/80
• 0.39 • -0 16
Fi&u1"' U -7: F0\13' near«t cells met.bod (1op} aud third order finite diff'crcocc rucdk>d (bottoitti
explained on the ucxt pa3c), used iu cakul.atl.ltf slope:. C isc:dl s12c :md dZ/dx :md dl!d,f :uc the
c:b:utg:n ifl cle\'lltion (riK) with d1111)@:c.C in honiont:il ~tion (nu1). Note tbll.t different i lopc \'alun
al'C produced by the difftte11l mcthod-,.
Chapter 11: Terrai n Analysis 479
plG"I C.Ul'\'Otute
?(Ct-11. EGl • FGM)
ci. H'
prof1k CIIW't\J't
·2(DG1 , £>i ,FGti)
c'. M2
Different softwares apply differeo, sign curvanire (Figure 11-9). Piao c1,rvan,re may
conventions, sometimes maki_ng concave also be envisioned as a vertical plane slicing
curvamre positive, somelimes assigning into the surface. and is measured in a hori·
them negative values. Raw values are zomal plane. The surface traces a patb on the
reported in some versions. while other soft- face of lhe plane, and I.he plan curvature is a
wares scale curvatures over a standard measure of the shape of Iha, path. Concave
range. e.g.. from O 10 I00. As with most spa- plan cu.rvamre values are small or negati\le
1ial analysis. the specific software imple- for sloping valleys or clefts. while convex
men1a1ion should be verified over known plan curvamre values al ridge and peak si1es
1es1 cases. are large or positive.
Pfau cw,·nn,re is the profile shape in the
local direction of level. at right angle 10 the
steepest direction. This me.ans plan cur\la-
ture is measured al a right angle to profile
Chapter 11: Terrain Analysis 481
tiaurt" u . lJ; Tcmiin m<>1pbometty, or n,orl)l1ome~c: feat\ll"CS. may be deri,·cd from dircction3I conn:xity
incas:ures. Atmllded map of a 11,ouutainous arc-.1 shows ,1lllcyt aud cbaw,cls (left). wbichi arc: idauificd visi
inorpl1omc1ric COT.till :uuilysc::s aud sbowu as unif01mly Wdcd arcat (right).
482 GIS Fundamentals
••
• ' . \
•,
·-
••
Fi&UN" 11-12: The waters.bed and drainage oetwork for tbc Brule Rin:Tin
uortlKtU Miunesota. Water cltains downslope from bia)ll.and lakes. at the leA
11.1~ l~p. 1ow11rd .Lllk~ Superior, ~11.be lower riiflt. ·me wtucrsl1cd 11rc11 eon-
tnbuhog lo tbe m"Cr , flow, tbc m ·cr path. ruid lnl:.c locn.liom o.o be m,,ppcd
from a DEM. .,,,,......
Chapter 11: Terrain Analysis 483
"! "'/
-":_.!,A ··," r\"
/
P'\ "\ ""t "'\ ..-.
-- ..- :, "\
"' "" "/ ~ flow direction
..-
...
-
-· "'; "\ "\.. " ...
"' "'\ "''-. '"'-.
••'-. "' "' ....
watershed from
flow direction
·, ...\, -
~
"'\
\ \
'".......
- °/' /
\
;"1"
droinoge network
from flow direction •
upstream oreo
,..
""'.. _,,,,. ·-~ -;-- -,
" "; ,.I ";
I -/
"';
-;
Fl~•~ U- 13: Flow dir«tiou (arrow. and nwubc:rrcfic:ctiujl a?Umub degrees). w1uc:nhcd. ind dnijnagc- uc-t-
wo1·k shown for• m51cr grid. Elc,'2tion d:u:11 arc used lO dcfu1c: d1c: flow dircc1ion for c:ttl1 c:dl. Tock flow
d.ircc1lon1 IUC' thcu used to <ktmni:i,c: 11 mimbcr ofimpor1~ot hydrolog.ic ti.mclious.
buried storm :sewers. and oLher built features but this leads 10 more complicated raster
alter flow directions in ways that aren't rep- s1n1cmresand calcuJa1ions.
resented by terrain. However, subsurface \Vhen ihe flow diredlo11 arrow from one
drainage and built features are oflen based cell does 001 point exactly a,1 the adjacem
ou modified ilow directions 1ba1 are first cell, we may distribute 1he Oow to more than
derived from surface shape. one adjacent cell. There are various ways to
Flow direc1ious may be envisioned as au dis1ribu1e flows among adjacent cells. The
arrow from a single cell 10 a single adjacent D8 method is common. and assigns all flow
cell, and stored as compass angles in a raster from a cell to the cell with the steepest
data layer (Fi,gure 11 •13). Acceptable values
are from O10 360 if1be angle is expressed in
degrees azimuth. Alternately. flow direction
can be stored as a number i11dit•1ing the
adjacem ceU ·10 which water flows. taking a
value from I to S or some other unique iden-
lifier for each direction towards cells..
The use ,of a sin_gle Oow direction is an
incomplete represen1a1ion in many
instances. ceals often exhibit divergent flow.
in muhiple di.rec.tions out of a cell to muhi·
pie adjacent cells (Figure I l-14). Flows may
also be convergent, with m11l1iple cells con-
tributing 10 a cell. The most common flow
direction methods provide a single direction
for each cell. so divergent and some conver·
gent flows are no, represented. One solution
involves recording sub-cell flow directions, fianN' U -H : Au C:<llmplc of simple (fl), co1ivcr·
gcut (b), aod divcrgciu (c) flow~
484 GIS Fundamentals
downhill grndienl (Figure 11-15, lefl). The nOI been widely implemented. A more com-
OS is simple 10 understand. program. and mon option is 10 use higher-resolution raster
store. but is panicularly poor at representing data such that raster cell size is ssnall enough
divergent flow and now in low-gradient to make within-cell divergence or conver-
areas. This can cause large errors in deri\'ed gence impacts negligible.
measures such as upslope contributing area Flou·1ux ,mwlmi011 m~a, conrt-ibuting
or soil moisrure indexes, and lead to atypical oren, or upslope area are otbe.r important
drainage networks in nearly Oat areas. Out· bydrologic characteristics. A Oow accumu-
put flow direction rasters derived from the lation area function is based on a flow direc-
OS method may be represented with only S tion surface. The flow accumulation
codes, allowing a simple and compact data function places a value in each cell that is the
layer. area uphill that dra.ins to that eel I.
Altemative flow direction methods may Watersheds may be identified once a
assign Oow 10 multiple cells. and hence rep- flow direction surface has been detennined.
resent some forms of divergent now. One Flow direction is followed "uphjll'" from a
common method. known as D-it!fiuio,. dis- point. until a peak is reached. Each uphill
tributes flow 10 oae downslope cell when the cell may have many con1rib111ing cells. and
flow direction is exactly toward the center of the flow into each of these cells is also fol-
the cell. and otberwise assigns a portion of lowed uphill. The uphill list is ace11mulated
the flow 10 each of the two adjacent cells in recursively umil all cells contributing 10 the
the downslope direction (Figure 11 -15. slarting ceJl have been identified. and thus
right). The split is proportional 10 the a11gles the watershed is defined.
be1weeu tbe steepest downslope direction
and the respective cell cemers. This reduces Flow direction in Oat areas is difficult to
!he main shortcoming of !he OS method, calculate and prone 10 error. ASpect is wtde-
while slightly increasing complexity. fined in a 1ntly Oat region. because there is
zero gradient. Flow directio!ls i11 these cases
While perhaps more accura1e in many may be strongly influenced by small height
conditions, multiflow direction systems have errors. so flow directions are sometimes
..
: : 2 . 2
...... .... .
. ... . ..
. , , ,oouoo,o, . . , .. . . . . . . . . . . . . . . ... . . . .. . . . ..,,•••• • .
............
. ..
.................................... ..........i
~
manually specified. or the aspect calculated (Figure 11-13, Figure 11-16 ). A drainage
using a larger cell size or neig)lborhood. The ne1wod< may be defined as any cell that bas
neighborhood may be successively a contributing uphill area larger than some
expanded until an unambiguous flow direc- threshold. These drainage netwod<s are only
1ion is defined. approxiruaiions. because the melhod does
Vector incision is another common not incorporate soil texture. depth. porosity.
melhod for prescribing flow diree1ion in flat subsoil water movement. or other properties
areas. A vec1or Oowpali1. e.g .. a digi1ized that affect surface flow. Nonelheless. a
stream segment. is overlain with the raster, drainage ne1work deri,•ed fr-0m terrain data
and raster val ues modified downstream alone is often a useful first approximation.
along 1he vec1or 10 specify an appropria1e The uphill area for each cell may be calcu-
fl.ow direction. The most common approach lated. and 1be area compared 10 the threshold
lowers elevations along the flowpath, taking area. The cell is marked as part of the dra.in·
care 10 noc crea1e a sink aloc1g or at the end age netwod< if the area SUIJ>asses the thresh·
of the flowpalh. old.
A dro;nage network is the set of cells A drainage ne.twork may have discontin·
1hrough whic.h surface wa1er flows. Streams. uous lines when local small dams or sink
creeks. and rivers occur where now direc- areas capnire Oow. where all surrounding
tions converge. Thus. a flow direction may cells point into. and none out of. a location
be used 10 produce a n•1p of likely stream (Figure 11-16). This may crea1e cells imme-
location, prior to field m..,pping a stream diately downhill from the sink that bas a uro
I J I I \ ..... • - • • • •I
/~~
I
• I \
• -- - l.\ •
I ,r . /
' I
'· • -· - .- - ./
.. ".. --
• •
. ,... .....
' /
• \,
'· ./ •
"
- .-
-
/ / I I
•
\
1
I
I•
•
I
I
I
I
/
/
/
/
,•
~
.-
- .,..
I I
!
.,.. / I I I I I I / .,.. / . -· • .-
_.,
-· __. / I I I / / ...... .,.. _., • / /
(don«t. blue Lil el«trouie edition) alou~ 111 coi1,·e1ge:su zone. wi1b ,·alue, above I.be suzface: accumulation
threshold. On rtJ1c left of the fif;'Ul'C is. ti ocal still.:. with aU Oow ditcctiocis. poinlina, inwnrd. 11-c flowlinc
froin 1bc 1q,pa lcfl cnlet1 the ,ink nJd doci n<>t e."(.il.
486 GIS Fundamentals
upstream are.a. The stream will end, bul then plowed field may retum spol heigh1s for
may begin again f\1nher downhill. Namral local mounds and furrows. incomple1ely bar-
sinks may be qui1e common in knrsr regions. ves1ed crops. and fami machinery. A log or
where sinkholes occuron the surface due to dense sJm1bs in a steep.sided ravine may l>e
collapse<I subterraneBll caverns (Figure 11· miside111ilied as 1he ground surface. creating
17). Hydrologic sinks are also common a borrier in 1be data 1ha1 doesn'1 represem
along drainage ways in dry areas where tme conditions. Pits can be artifacts of inter.
check dams are built 10 reduce Dash flooding polation methods lha1 are used 10 fill in lhe
or store water. Pits are also common in areas grid values in unsampled loea1ions. Pos1 pro-
of deranged topography, for example, in lite cessing aims to remove these spurious read.
relatively Oat. recently glacia1e<l 1errai11. In iugs. but 1hey are commoo nonelheless.
mos1 other areas. pits are often data ar1ifac1s Pi1s may cause problems ov·er locally
and do not represenl real geography. Pits flat surfaces, often along drainage ways
represented in DEMS should be evalua1ed (Figure 11-18). Flow direc1ion and flow
earlier during processing 10 de1ermine if1bey accunmlation func1io11s oft'en return errors
are real. and how processing alters results. due to spurious pits. particularly near water·
Random errors in DEM elevation values courses. These low areas are shown as white
often create spurious pil.s (also known as pa1ches in 1he figure. Tiiese appare111 ponds
false sinks). Because our technologies for do not exist in many landscapes~ in that an
crea1iog DEMs ace imperfect. DEMs often erroneous pi1in a stream course-creates false
contain these pits that aren't on the Earth's basins.
surface. Pi1s causes errors in subsequen1 hydro,
Spurious pi1s are found in mos1 DEMs logic calcnla1io11s. Drainage networks are
due to small elevation errors. For example, incomplete, flow accumulation values are
DEM dara collected wirli LiDAR often lidve 100 Jow. and warershed~ may be impr6jlerty
a small ground foo1prin1. and may sample iden1ified when pi1s are encoumered (Figure
small features that are above the surrom1ding 11-19).
ground level. A laser image over a recentJy
Flgurt t t. t 7: E.xrunples of 11. discom.inuous iltcam nctw·ork, wi1b downslope Row s1oppi.ng 11.1 a link (A}.1t!ld
then rcs.t.)11ing downllill (8} wl1cn the Qow a~cu1nnlation threshold is ~Jilin s,~ucd. ibc san1c kind of
break is o~en·ed 31 a roaid cl'O$$.ing (C aud D). although th<rc is likely t1 cuh·c11. uol rqlfekotcd in the 0 1:M.
Chapter 11: Terrain Analysis 487
f luurt- U· l3: Examples of crrotm>us pits ca11Kd by DEM cmm. The liiht-colorcd areas along dntinagc
wa)'I fflO'I',· local depressions that arc anifacts ofdata cnon. aod do1ft extst on the landsc<11pc, Drnin:agc net·
worb acid wucnhcds based oo uufiUcd OEMs wiJJ be- i.n «ror. because t1w: Row ditectioos based oi, the
DEM• will be inw11rd ZII 11.ll pill.
Filled
' - water- I!
shed ~~-:-' :,..~~{.;;
1-streom -stremn '·
:..,-_.c:;- 1
r ~ /',..
!.<""
Fl,a:utt 11-19: \\'iltmbcdi aud stream networks dctiue11tcd from uucouditiom:d DEMs (lc·ft) often rcsuh in
ouu ini strcntn flC@n\OltS (:u o. b, d, :ul<l e), atid incoo.1plc1c \\'lttmbcds (upsltdtru from 0t and b, :n)d :u c).
488 GIS Fundamentals
Fia:u~ l1·21: A DEM (top} with b1~c siub al A through O a, a rc-.suh of highway bcnm1, with scnta.1
s.unlltt si.nla in 01htt loc11t1ont. SU:W: a-c ttmo,·cd by either:. f'ill process (middle), or btt3Chin,: (bot·
10111). Breaching reiuh-.: in an output ,,ufi11c:c 1m1 i.l morc acci.1r.11c for1110i.1applic:21ious.
490 GIS Fundamentals
Conditioned Elevation
f i&llr" ll-22: An e;ump!c of die stcpi. required tocrcalc watmbed and clrnin.igc uctwork fen.hares from a
digi1al elevation 1nodet
Chapter 11: Terrain Analysis 491
rainfall required to saturate an area and pre· These and others are described in the refer-
dic1 lhe likeliltood and io1ensi1y of flooding. ences a1 lhe end oflhis chapter.
The specific cntclm,em area (SCA) is
defmed as lhe 101al area draining 10 a poin1 Contour Lines
relative 10 droirulge widtll, in ra11er d.11a sets
calculated as Co111011r li11es. or 1opog.rapbic contours.
are co1lnected !lites of u11ifonn elevation that
nm at right angles 10 the local slope. Contour
lines are a common feature on many map
SCA , AAEAJC (11.5)
series; for example. they are depicted 011 1he
USGS I:24.000 scale.nationwide series, and
Bri1ain·s 1:50.000 Ordinance Survey maps.
where AREA is the accumulated surface area The shape and densiry of coniour lines pro-
upmeam from a poim. and c is 1he ras1er vide detailed infonnation 011 terrain height
cell dimension. Srrenm pou-er index (SP[) is and shape in a 1wo-dimensionel inap. with·
defined as: out lhe need for con1i11uous 1one shading.
Both color and continuous tone printing
were importan1 limi1a1ious for pa.st canogra-
SPI • SCA • ton (bl (11.6) pbers. Comour lines could be easily drawn
with simple drafting tools. Although contin-
uous tone printing is much ress expensive
today. contours will remain ,commonas they
where bis tlte slope ma poun. and SCA is as have entered the culture of r.n,,p making and
defined above. SPI is used 10 identify the map reading.
po1en1ial erosion a1 a point which depeuds
both on the upstream area and hence ability Several rapid. efficiem methods have
to accumulate water. and the local slope. been developed for caJcuJating contours,
which drives the erosive energy in water ei1her from points or from grid daia (Fig11re
flow. 11-23). Early contour maps and DEMs were
Perhaps the mos! commonly applied
wetness inde1< is calculated by:
Contour placement
d \,...----Pomr with he19h1
w - ln( -SC~
ton
(11.7)
Ht,
. '. ·- - H I
. ·- ···
of He
d2
where w is the weiness index at a cell. SCA is H,
the specific catchment area. and ~ is the A contour passes through a height value
slope a1 lhe cell. This index bas been shown He at a point on the straight line be-
to effectively represem the increased soil tween known points with heights
wetness due to large upslope areas and low Hb. Hf (see above). Here, we ensure
slopes, parliC'Ularly when combined wilh Hb <Ht. The point Is at a calculated
plan curvature and profile curvamre mea- distance d2, as shown in the diagram
above. according to the formula:
surements. These factors sort terrain along
ridge-10-s1ream and conve~·to-concave gra· dz = d i , Hr - He
dieo1s. H, - Ht,
There are m.any other topographic iodi· FI11urt u ..2J: Contour liuc locatio,u att oAe:it
ces. e.g.. for estimating total solar radiation. cs.1un1ucd from poi111 hc:ig!11 loc111ions. a• a lindr
surface air drainage. orsurface roughness. proportion '?f the be-igl1t and d~a.111:e diff'crc,,ces
l,ct\,·«o pouus.
492 GIS Fundamentals
camera ~ otion
flat area (the .foreground of~,e photo, al bot· tive travel <tis1a11ce along lhe specific profile
tom). where elevation does no1 change path. Profile plots are common on 1he edges
much. ru1d there are few contour lines. Steep of maps. particularly maps of off-road. bicy-
areas and cliffs are depicted by an incre.ase cle, or cross.cowitry routes.
in contour density. as shown at poin1 s. with
changes ln steepness depicted by changes in
Profile plots otkn have some level of
vertical e.x..1ggeration bec.ause horizontal dis-
density (above and below point C). Peaks, tances are usually much larger lhau elevation
such as Ille lop of Washington's column. o. gain. Ver1ical e,,aggeration i·s a scaling factor
and Nonh Dome. E. appear as concentric applied to the elevation data when shown on
rings. Note that contours may succinctly rep· the graph. For example. Figure 11-25 shows
resent complex terrain stnK:tll!es, such as the a square graph tha1 depiclS approximately 31
curving arches in 1be cemer of1be photo- km across the Earth's surface. The vertical
graph, and siJ.Own below point C. and the eleva1ion axis spans approximately 2.5 km
overhanging cliff, to the left ofpoinl F. over Ille san1e dimensions on the graplt This
is a ver1ical exaggennion of approximately
Profile Plots 12 (from 3112.5).
Profile plots are another common deriv,
a1ive of elevation data. These plots sample
elevation along a linear p,vfile path. and dis-
play elevation against distance in a graph
(Figure 11-25). Elevation is typically ploned
on they a~d$, and horizontal distance on 1he
x axis These profile plots are helpful in visu-
alizing elevation change. slope. and cumula-
Elevation Profile
_3000
-"
1'!
~ 2500
~
c:
:B 2000
j A
w
1500
1000 +-~~~~~~~~~~~.--'
Fiaur. 11.H :An e,uirnplt of, pro6le plot The pl'bfile p:i1h i, thcwn on !ht di,dtd relief in~~e (lef1). with
the ;tarting point A aod ending point 6 . The profile plot is ;bown on the rigl1t, with corrcsponclmg mirting
aod cod.in; poimtt. The ploc shows the chai1;c io clc,'ati:oo 11lollfil the path. Note tha1 the ,·Cl1lical cxaag,crat100
hett ia n.ppt0xir1u1dy 9 10 I.
494 GIS Fundamentals
Vlews hed s
- visible terrain
The \Tewshed for a poin1 is the collec· - hidden t errain
tion of areas visible from that point Views
from many locations are blocked by terrain.
Eleva1ions will bide poin1s if lhe eleva1ions lines of sight..__~ /
are bigber chan the line of sight between die
viewing point and target point (Figure 11-
26).
viewpoint
Viewsheds amd visibility analyses are
quite important in many instances. High- Fi1,urt 11-26: Mechanics of de6i:iio3 a ,'ie\\,.
volt1ge p-0wer lines or cell 1owers are often sbcd.
placed atl'er careful coosideration of their
visibility, because mos.t people a.re averse to
viewing lhem. Communications antennas. less titan lhe slope 10 a cell closer 10 lhe
large iudustrial comple.'<es. and roads are viewpoim along the line of sigh1. then 1be
often localed al least partly based on their target cell is 001 visible from the viewpoint.
visibiliiy, and viewsbeds are specifically Specialized algorithms have bee n developed
managed for many parks and scenic areas. to subs1an1ially reduce the time required 10
calculate viewsbeds. bm in concept. lines of
A viewsbed is calcula1ed based on cell-
sighl are drawn from each viewpoin1 lO each
10-cell imervisibiliiy. A line ofsight is drawn cell in 1he digital elevation data. lflhere is
belween the view cell and a potentially visi-
no intervening terrain. the ceUis classified
ble 1arge1 cell (figure 11-26). The eleva1ion as visible. The classification ideutifies areas
of this line of sight is calculated for eveiy 11ta1 are ,~.sible and areas that are hidden
illlervening cell. If the slope to a target cell is (Figure 11-27). Viewsheds for line or area
features are the accumulated viewsheds from
all the cells in !hose features.
Chapler 11: Terrain Analysis 495
Shaded Relief Maps incoming light ray and the s.urface nom1al,
shown as 0 in Figure I 1-29. The surface nor-
A shaded ,~liefmap. also often referred mal is defined as a line perpendicular 10 the
to as a hU/shode map, is a depiction of 1be local surface. Direct beam sunlight striking
brightness of terrain refleclions given a ter·
rain surface and sun Joca1ion. AJ1bough
shaded relief maps are rarely used in anaJy.
ses, lhey are among the most effective ways
10 conumu1ica1e the shape and sm,c1ure of
1emiin fearures. and many maps include
relief shading (Figure I l-28).
Shaded relief maps are developed from
digiL,I elevation da1a and models of ligh1
ref1ec1ance. An anificial sun is "positioned'"
a1 a Jocaiion in 1he sky and ligh1 rays pro-
jecied onto 1he surface depicted by 1he eleva-
tion data. Light is modeled that strikes a
surface ei1ber as a dire<:1 beant. from 1he sun
10 1he surface. or from background ..diffuse··
sunlight. Diffuse light is scattered by !he
aunospbere. and illuminaies "shaded.• areas.
ahhough 1he ;Jh1mina1ion is 1ypically much
Jess than that from direct be.am.
f'ia11~ u .,?9: Hi.1.lsmde IU3J>$ $li,ow n-Oectauce a, II
The brightness of a cell depends on the fom,ion of the angle.. 0. between sunbeams and mr·
local incidence angle, 1be angle be1ween 1he lice oomlal.t.
N A
Fli,u·t 11-28: Relief shad.in; is often added as a backjround ''\uidcr" o ther mapped data to pro,iidc a ~ o{
1crr11i11 UL,pc and i!cep1\cn . This sl1J1din1 pro\'idcs :1 truff·d.nnctisiorul pcrtpccti\'c (or~ nU:ppcd Atu, .u
<.kmoruiratcd in tJiis: rcltcf $li.,clirig of ll U.S. Oeologic1tl S\1n·ey 1:24.000 sc11lc q\aand~g;le mnp.
496 GIS Fundamentals
the surface at a right angle (9 = OJ provides and this may be shown as various shades of
the brightest rerurn. and hence appears light. grey in a hillshade surface.
As 9 increases. the angle between the direct A shaded relief map also requires a cal-
beam aud the ground surface deviates from culation of visibility, often prior 10 calcnla1-
perpendicular. and the brightness decreases. i11g the rcflecL1nces. Visibility 10 the s110 is
Diffuse sunlight alone provides a relatively de.termined; if a cell is visible from the sun,
weak return.. and hence appe.ars dark. Com- the slope and aspect values are used to
binations of direct and diffuse Light result in assign tlte cell brightness.
a range of gray shades. and tliis range
depends on the terrain slope and angle rela·
1ive to the sun'slocation. Hence, sub1le van. Terrain Analysis Software
ations in terrain a:re visible on sbade<I relief Terrain anaJysis and DEM data manage-
maps.
ment and analysis are imponant enough to
Calctllating a shaded relief surface be include(! in most general-purpose GIS
requires specifying rhe sun's position. usu- packages, including ArcGIS. GRASS.
ally via the solar zenith angle. measured ERDA$, ldrisi. and Manifold. \Vhile they
from vertical down to l11e sun's location. and suppon the most co,mnon se1of terrain and
the solar azunuth angle. measure(! from hydrologic analyses. none of tl1ese packages
nonh clockwise 10 Lhe sun's position (Figure include.s the broadesl range of terrain pro-
I l-30). Local slope and surface azimuth cessing and analysis functions. Specialized
define a surface oom,al direction. An angle analyses are often performed using software
may be defined between the solar direction with a specific focus on terrain analysis.
and the surface nonnal direction. shown as 9 These include tl,e Whitebox GAT. from tlte
in Figure 11 -30. As noted earlier. the amoum University of Guelph. and commercial tools.
of reflected energy decreases as Oincreases, such asthe Watershed Modeling System
(WMS) by the Scientific Software Group.
Whitebox GAT con1ains what is likely
tbe most comprehensive sel of 1errain analy-
,- sis functions in a freely available package.
/ Suppon is panicularly strong for hydro logic
surface and stream link processing and anal-
N ysis. witlt functions for calcnlatiJlg various
flow direction. accumulation. and watershed
delineation methods typically not supported
by 01her packages. Basic terrain modifica-
tion. LiDAR da1a input and processing. and
ArcHydro is a set ofbydrologic analysis Profile curvanire and plan curvature are
tools wrineu :as an extension 10 ArcGIS. 11 1wo other impor1an1 terrain analysis fimc-
suppons a falrly complete set of hydrologic tioos. These functions measure the relative
and watershed delineation fi.mctions. convexity or concavity in the terrajn, relative
Then: are many other packages avail- 10 the downslope direction for profile cuf\'ll·
able. inch1di11g SAGA. TAUDEM, Surfer. nire and the cross-slope direction (or plan
TAP, and MitroDEM. which provide various curvature.
specialized capabilities. and may be wonh Terrain analyses are also used 10 develop
investigating for users interested in term.in and apply hydrologic functions and models.
and hydrologic analysis. Watershed boundaries, flow directions,
flowpa1hs. and drainage ne1works may all be
defined from digital ele,•a1ion data.
Summary
Viewsheds are ano111<,r commonly
Terrain analyses are commonly per- applied terrain analysis function. Intervisi·
fom1ed witlti11 the framework ofa GIS. bility may be computed from any location on
These.analyses are important because l'errain a DEM. A line of sig)l1111ay be drawn froan
governs where and how much water will any point to any 01her point. and if there is
acc1u11ula1e <m the landscape. bow much no intervening terrain. then 'the two points
sunlight a site receives. and the visibility of are intervisible. Viewsheds are often used 10
human acth~ties. analyze the visibility of landscape alterations
Slope and aspect are two of the most or additfons. for example, when siting new
used terrain variables. Bolh are commonly roads. powerlines. or large buildings.
calculated via trigonometric functions Finally. relief shading is another com-
applied ina moving window to a rnsrer mon use of terrain data. A shaded relief map
D£M. Several kernels have been de,-eloped is among the anost effective ways 10 depict
to calcula1e changes of elevation in x and y terrain. Terrain shading is often derived from
direc1ions. and these component gradients DEMs and depicted on maps.
are combined to calculate slope and aspect.
498 GIS Fundamentals
Suggested Reading
Ali, G., Birkel, C., Tetzlaff, 0., Soulsby, C.. McDonnell, J.J .• Tarolli, P. (2014). A
comparison of weu,ess indices for 1he prediction of observed coUJ1ected saturated
areas under contrasting condi1ions. Em1l, Swfnce PTQ,:e,s.t and Ltwdforms.
39:399-413.
Ayeni. 0.0. (1982). Optimum sampling for digital terrain models. Phorog,.nmmerric
Euginee,·ing anti Remore Sensing. 48: L687-1694.
Baud, LE. (I '>86). Topographic partition of watersheds with digital elevation models.
Ware1· Resources Research. 22: IS--24.
Baral. OJ.• Gupm. R.P. (1997). Imegration of satellite sensor data with OEM for the
sn1dy of snow cover distribution and depletion pauems. lme11,mio11al Joumal of
Remote Sensing. 18:3889-3894.
Berry. J.K. (1986). A mathematical structure for analyzing maps. Em-ironmental
Mauagemelll. JI :317- 325.
Berry. J.K. (1987). Fundamental operatioJts in computer-assisted mapping. lmerna·
1io11al Jounwl of Geographic !11/ormariou SJ-stems, I: 119- 136.
Beven. K.l .. Kirby. M.l. ( 1979). A physically-based variable contrib1ning area model
of basin hydrology. Hydrological Sciences 8111/eti11, 24:43-69.
Bols1ad. P. v.. Stowe,T. ( 1994). An evaluation of DEM accuracy: eleva1ion. slope and
aspect. Ph.otogrammetric Eugiueeriug and Remore Seusing. 60: 1327- 1J.32.
Bonham.Carter, G.F. (1996). Geographic Juformariott Systems/or Geoscieu#sts:
,11odelling ,vith GIS. Ottawa: Pergl\Jllon.
Burrough. P.A .. McDoJtOell. R.A. (1998). Principles ofGeogrop!,;m/ Informarion
SyS1e111s (2nd ed.). New York Oxford Universiiy Press.
Collins. S.H.. Moon. G.C. ( 198 l). AJgorillllllS for dense digital terrain models. Photo·
grammeufc Engineering and Remote Seusillg, 47:7 1- 76.
OeFloriani. L .• Magillo. P. ( 1994). Visibiliiy algorithms on 1riang11la1ed digital terrain
models. lmen,ational Joumal of Geographica/ /11/om,arion Systems. 8: 13-4 1.
Dozier, J.. Frew, J. ( l990). Rapid calculation of terrain parameters for radiation mod·
cling from. digital elevation data. IEEE Transaclious 011 Geoscience anti Remote
Sensing. 28:963-969.
Dubayah. R.. Rieb. P.M. (1995). Topographic solar radiation models for GIS. /111emn-
tio11al Journal of Geogrnpl,ica/ bifomwtion Sys rems. 9:405-419.
Fisher. P.F. (1996). Reconsideration of the viewshed function in temin modeling.
Geographical Sysrems. 3:33-58.
Chapter 11: Terrain Analysis 499
Hodgson, M.E. (1995). Whal cell size does the compu1ed slope/aspec1 angle repre-
sem? Pltotogramme.oic £11gineel111g and Remote Sensing. 61 :513- 517.
Horn. B.K. (198 1). Hill shading and 1he renectance map./£££ Proceedi11gs 011 Geo-
sciences. 69: 14-47.
Hutchinson. M.F. (l 989). A new procedure for gridding elevation and stream line da1a
with automatic removal of spurious pi1s. Jo11runl Of Hydrology. 106:21 1-232.
Hu1chinson. M.F. (1993). Development of a cominem-wide DEM with applica1ions 10
terrai11 and climate analysis. In: M.F. Goodchild et al. (Eds.). Emfro11memal Mod-
eling with GIS. New York: Oxford U1uversi1y Press.
Jain. M.K.. Ko1hyari, U.C., Ranga, RK.G. (2004). A GIS based dis1ribu1ed rainfaJl.
mnoff model. Jo11mol ofHydrology. 299: 105-122.
Jenson. S .K. ( I991). Applica1ions of hydrologic infonna1ioo automaiically extracted
from digi1al eleva11on models. Hyd,-ologic Processes. 5:31-44.
Jenson. S.K., Domingue. J.O. (1 988). Extrac1ing 1opograpbic s1n1cnire from digi1al
elevation data for geographic informationsystem analysis. Photogrommetrk
Eng;nee,.ing and Remote Seusiug. 54: 1593- 1600.
Jones. N.L.. Wright S.G .. Maidment. D.R. (l 990). Wa1ershed delinea1ion with trian-
gle-based 1errain models. Journal ofHydrnulfc E11glueerfog. l 16: 1232-125 l.
Julian. J.P.. Ehnore. AJ.. Guinn. S.M. (2012). Cbaru1el head locations in forested
watersheds across the mid-Atlantic United States: A physiographic analysis. Geo-
1110,pl,ology. 177/78: 194-203.
Lindsay. D.B. (2005). TI1e 1errain analysis sys1em: a 1001 for hydro-geomoiphic appli-
catioos. Hydrologicol Processes. 19: I 123- 1130.
Lindsay. D.B. (2016). The practice of DEM siream buming revisi1ed. Earth Swfoce
Processes n11d Ln11dfo1111s. 41 (5): 658-668. 001: I 0.1002/esp.3888.
Loubaiclti. M.. Bormru1. M.M.. Johnson. A.L.. Johnson. D.E. (2003). Crea1ing Jow-
cost nigh-resolution digitaJ elevation models. Jourunl ofRange iWm,ngeme111.
56:92-96.
Manz. L. ,v.. Garbrech1. J. (1998). The rrearmeor of Oai areas and depressiollS in a1110-
mated drainage analysis of raster digital elevation models. Hydrologicnl Pro-
cesses. 12:843-856.
Maune. D.F. (Ed.). (2007). Digital E/ew,ri011 Model Tec/1110/ogiesn11d .4pylicnrio11s:
The DEM User's Mo1111nl (2nd ed.). Be1hesda: A.merican Socie,y ol Pho1ogram-
metry and Remote Sensing.
500 GIS Fundamentals
Moore. LO., Grayson. R.B. ( 199 1). Terrain-based catclm1en1 partitioning and runoff
prediction using vector eleva1ion data. Wnrer ReJOW'teS Research. 27: 1177- 119 1.
Moore. l.D.. T'umer. A. Jenson. S.. Band. L. ( 1993). GIS and land surface-subsurfltce
process modelling, ln M.F. Goodchild et al. {Eds.). £,iviromnentaf Modeling with
GIS. New York: Oxford Universi(}' Press.
Skidmore, A.K. (1989). A comparison of techniques for calculating gradient and
aspect frorn a gridded digital elevation model. J111e111mio11nl Joumnl o/Geo-
grapl,;ca/ Jnfon11atio11 Systems. 3:323.-334.
Soutl1ee, F.M ... Treitz. P.M., Scott. N.A. (2012). Application ofLiOAR terraiin sur-
faces for soil moisrure modeling. Pho1ogra,m11e1,.ic £,,g;nee,·ing and Remote
Se11si11g. 1:2:1241-1251.
Strahler. AN. { 1957). Quantitative analysis of watersl1ed geomorphology. Trm,sac·
rious ofihe .American Geophysical Union. 8:913-920.
Tarboton. D.G.. Bras. R.L.. Rodriguez-Iturbe, I. (1992). A new method for tbe deter-
mination of Oow directions and upslope areas in grid digital elevation models.
Wa,e,- Resources ReJ·eard,. 33:309-319.
Tomlin. C.D. (1990). Geographic J11formntio11 Systems and Cartographic J\f,odeliug.
Upper Saddle River: Prentice-tfall.
Wilson. J.P. (20 12). Digital terrain modeling. Geomorpl,ology. 137: 107- 12 I.
Wilson. J .. Gallant. J. (Eds.). (2000). Ten·ai11 A11alvsis: P1i11ciples a11d Applicatio11s.
New Yorlc Wiley. •
Wood. J. (1996). 111e geomorphological characterization of digital elevation models.
Ph.D. thesis. Uoiversi(}' of Leicester, UK.
Wood. R., Sivapalan. M.. Robinson, 1. ( 1997). Modeling the spatial variabili(}' of sur-
face runoff using a 1opograpilic index. m,,.,.
Resources Research. 33:1061-1073.
Zevenbergen. L.W.. Thome. C.R. ( 1987). Quantative analysis of land surface topogra-
phy. Eartlr Surface Processes cmd Landforms, 12:47-56.
Zhou. Q.. Liu, X. (2004). Analysis of errors of derived slope and aspect related 10
OEM data properties. Co111p111ers & Geoscie11ces, 30:369-378.
Ziadat. F.M. (2005). Analyzing digital terrain anributes 10 predict soil anributes for a
relatively large area. Soil Scie11ce Sociery ofAmerica Jo11111nl. 69: 1590-1599.
Chapter 11: Terrain Analysis 501
Study Questions
U J - Whal are digital elevation models, and why are they used so often in spatial
analyses?
11.3 . Write tl1e definition of slope and aspect and the mathematical formulas used 10
derive them from digital elevation data.
11.4 . carculate dZ/dx and dZ/dy for the following 3 x 3 windows. Elevations and the
cell dimension are in meters.
b) 6, 63 62 dZ/dx • dZ/dx •
65 64 64
70 68 66 dZ/dy, dZ/dy •
c) 18 23 17 dZ/dx • dZ/dx •
21 2, 19
20 22 18 dZ/dy• dZ/dy ,
502 GIS Fundamentals
11.5 - Calculate dZ/dx and dZ/dy for die following 3 x 3 windows. Elevations and the
cell dimeosio!l are in meters.
b) 68 63 61 dZtdx • dZ/dx:
69 67 66
70 71 72 dZtdy• dZ/dy •
c)
15 19 18 dZtdx = dZ/dx •
19 20 19
21 23 24 dZl<ly= dZ/dy •
11.6 - Calculate 1be slope and aSJ)<lCI for 1be underlined cell values. using the four
nearest cell method.
11.7 - Calculate the slope and aspect for the underlined cell values, using the four
11eares1 cell me,hod.
11.8 - Calculate lbe slope and aspecl for !he underlined cell values. using 1he 1hird-
order fiui1e difference me,hod.
11.9 - Calculate the slope and aspect for the underlined cell values. using 1he 1hird-
order finite difference method.
"
710 706 704 700 702
-
•>
708 705 705 697 700
-
711 709 705 l96 694
714 712 708 703 698
11.10 - Plot a graph of slope in degrees (on x axis) against slope in percent (y axis).
Which is usually farger. slope as degrees. or slope expressed as percenr?
11.12 - Draw the approximate location of contours for the following set of points.
Stan contours at the 960 value and use a 30 unit contour interval. For this ex,ercise. it
is pennissible 'to estimate the contour locations visually: you do not have to calculate
the distances between points 10 place the contour Jines.
• U:16 lOU
10,0
tOtO
• 1092
•
10Ja
!O~
1010 ml 1014
Ill? 107J
IIUi' 10;13
.... ""
, . .1
10 . .
.
10<.3
11011 1081 1. .,
lOl<
lOll
.
.., ,... '"''
1009 1000
10-4 7
....
1038 1010
•
1041
10.tO
•
!:,)Cl
10)) 101~
•
.. ....
100.a
•
Chapter 11: Terrain Analysis 505
U .13 - Draw the approximate location of contours for the following sel of points.
S1an contours at the Ovalue ruid use a 200 uuit coniour interval. For this exercise. it is
pennissible to estimate the contour locations visually; you do not have to calculate
lhe distrui.ces be1weeu points 10 place lhe contour lines.
563 1078
766 429
160
• • • 306
•
• • 381
571 •
912 • 588
578 •
235 • 632 640 •
1102 • 313 940• •
• • 395 • 10<0
795
• • 8:15
J242 • •
• 384 937
•
942
•
377 • •
128 ;i1267 317 925
1312
930
• 512
• 1281 • •
• 1302
• • 530 • •
1306
• 1306 1317 823
1136
• 596
•1318 •
• 674 30!
956 • • • •
• ii16 401
• • 884
547 •
•
73! 1062 • 1188
•
• 391
•
~90
1013 • 5'8
706 •
• • 983
5,2 •
969
• • 582
• •
1110
662
492 592 644
•
• • 1032 • •
987
• 900
739
•
1!3&
• •
1049
1379
753 • •
663
• !248
• 1417
•
U.14 - What is the fonnula to calculate a contour height from two measured eJeva.
tions?
506 GIS Fundamentals
11. 15 - Using the figure below. calculate the distances to the listed contour line along
the shoncs1pa-tit between points. The example shows the distance calculation from
Point A to the c.ontour wit.h value 250, along the straight line from A to B. wa1en the
values of A and B are shown. and the distance from A 10 B is 148.
Discance from B 10 the 300 concour on the line B • D. when d80 i~94
Distance from E to the 250 contour on the line E · D. when dEo is 115
Distance from C 10 the 200 contour on the line C • D, when dco is 188
Distance from E to the 300 contour on the line E. G, when dee is 248
·o .310
c.1ao ·
G.340
E.220
11.17 . \\~1a1 are the plan curvan,re and profile curvature. and how do they diffe(?
Chapter 11: Terrain Analysis 507
U.18 - Define the waters.bed boundaries and possible stream locations in the digital
elevation daia depicted below:
JIJ Joo .JbY J8l
)7!1 17~
)18
508 GIS Fundamentals
11.19 - Define the watershed boundaries and possible stream loc.ations in the dhdtal
elevation data depicted below: -
604 639
742 666
11.20 . Define the foUowing: solar zenith angle. solar azimuth angle. and solar inci-
dence angle.
11.21 • Omw a diagram illustrating the solar incidence angle. and identify ,vhat site/
terrain factors affect the solar incidence angle.
11.22 . What are viewsbeds. when are Ibey used. and bow are they calculated?
11.23 - What is a shaded relief map? How are the values for each cell of a hiillshade
surface calculated?
509
_..._
Fi.v;urt' 12· 1: Air-qu.,lity ,.tries acrou !Pace 11nd time, iu,d is
only oi~s:urcd in a fcwlocalious. Spabal estOuatiou methods
are couunollly us~ to prcdic1 air quality at wsam:plcd loc-t1·
1io11s (counc.1r U.S. NPS).
510 GIS Fundamentals
the analysis and. if so, the missing values iu ideutifyi.ng these core areas. These meth-
es1ima1ed. FinaUy. es1ima1es may be ods typically draw from a set of sample
required when changing to a smaller cell size poims that consrinue even1s.. such as an
in a raster data sel. The ..sampling.. fre- observation of an animal. a busi11ess loca-
quency is set by 1he original raster. and vaf. 1ion. or a crime thin has been committed.
ues must be estimated for the new. smaller Spatial prediction typically 1ransla1es
cells. from lowerspatial dimensions to the same or
Spatial i111e1polatfou is the predic1ion of higher dimensions. This means we typically
variables at u1uneasured locations. and based generate poims or lines from po:int data, or
on a san1pli11g of the same variables at areas from point, line, or area data. Predic-
known locations. Most interpolation metll· tion meihods allow us 10 extend the illform,1-
ods rely on the nearest points to estimate tion we have collected, most ofieo 10 "fill in''
missing values, and use some meas.ure of between sampled locations, bul .also to
distance from known 10 unknown values. We improve 1he q11aLi.ty oflhe data we ha,-e col·
might have meaSt1red air pollution at a set of lected.
towers across a region. but need estimates Spatial prediction methods may also be
for all loca1ions in tha, region. lmerpolation used to 1ransla1e infonnation from a higher
is routinely used Qo estimate air and water order to a lower order. that is, 10 estimate
temperalu.re. soil moisture. elevation. ocean poim values from data collected or aggre.
productivil)•. pop,.1a1ion density. and a host gated to area or lines. We may have pop11la-
of additional variables. 1ion data reported far an area. a11d we may
Spar;a/ prediclion also invol\'eS the es1i~ wish to estimate popula1ion for a specific
matioo of variables at unsampled locations. poin1 within this area. This may be affected
but differs from inlerpolation in that esti· by the modifil1ble aret:1/ ,mir problem. a com-
mates Me based a.t least iii pMt on other vafi. mou baz:11d in spatial estimation method!
ables. and often on a total se, of described in Chapter 9.
measurements. \Ve may use elevation to help Wha1e,-er !lie methods used. spatial esti·
estimate temperamre because it is often ma1ioo is based on a set of samples. An indi·
cooler at higher locations. A map of eleva- vich1al sample consists at leas1 of the
tions may be coo1bined with a set of mea- coordinaces of the sample locatjon and a
sured temperatures lo estimate temperatures measuremem of llte variable of illteres1 a, the
at unknown J()(:ations. sample location. We may also 01easure addi-
Acorenma is characterized by high use, tional. related variables al the sruuple loca-
density. in1ensily. or probability of oceur· tion. Coordina1es should be measured 10 the
rence for a variab le or event. Core areas aJe bjgbes1 accuracy and precision that is prac1_i-
defmed from a se·t of samples.. and are used cal. given cost and time cons.trainls and the
10 predict the frequency or likelihood of imended use of 1he data . Sample ,'llriables
occurrence of an objec1 or event. Home should be measured using accurate. stan-
range.s for individual animals. concenlra- dardized, and repeatable methods.
tions of business nctivity. or centers of crint.i·
nal activity are all examples of core areas.
There are several methods that may be used
Chapter 12: Spatial Estimation 511
Sampling
Estima1io11 is based 011 a sample of Sampling Patterns
known points. The aim is ro estimate the val·
ues for a variable ai w1knowu locations 1'here are several commonly applied
based on ,,atues measured at sampled loca- sampling patterns. A systematic snmpli11g
tions. PlaMiog will improve 1he quality of pn11em is che simples, (Fig,ire t2-20).
the samples. and usually leads to a more effi· because samples are spaced uniformly at
ciem and accurate iu1erpolatio1L fixed X and Y illlervals. The intervals may
001 be 1he same in both directions. and che x
We control two main aspects of the sam· and Yaxes are not required ·to align with 1he
piing process. First. we may control the loca- northing and easting grid directions. The
tion of the samples. Samples must be spread sampling t)Qttem often appears as poims
across our ,vorking area. However. we may placed syscemarically along parallel lines.
choose among different paaerns in dispers·
ing our samples. The pattern we choose will Syscemacic sampling has an ad,,amage
in tum affect the cost of our samples and the over other sampling panems in tenns ofease
quality of our imerpolacio11. A poor discribu· in planning and description. Field crews
tion of sample poincs may increase errors or quickly understand bow co lay out the sam·
may be inefficient, resulting in unnecessary pie panem. and there is little subjective
coses. judgement required.
Sample size is the second main aspect of However. systematic sa.mpling may
the sampling process we comrol. One might have disadvantages. It is usually not the
believe the correct number is "as many as most statistically efficient sampling pauem
because all areas receive the same sampling
you can afford:· however. ibis is not always
the case. A law of diminishing returns may intensity. If there is more interest or varia-
be reached. and fiinher samples may add rel· tion in ce.rtain ponions of the study area. Ibis
acively Unle information for substancially preference is not addressed by syscemacic
increased c.os,lS. Unfortunately. in mosl prac· sampling. The difficulty aod cost of travel-
tical applications. the available funds or time ing lo the sample points is cLOt considered. It
are the main limiting factors. Most surfaces may be difficul1 or impossible to scay on line
are undersam.pled.. and additionaJ funds and between sampling points. R,ough rerrau,.
samples would usually increase che quali1y physical barriers. or Jack oflegal access may
of the interpolated surface. To date. there preclude sampling at prescribed locations.
have been relatively few studies or well- In addition. systematic sampling may
establisbed guidelines for decenoiuing che introduce a bias. particularly if there are pat·
optimum sample number for most interpola- terns in cbe measured variable that coincide
tion methods. ,,~th the sampling interval. For example.
There are thnes when we co01rol neither there may be a regular succession of ridges
the distribution nor 1be number of sample and valleys associated with underlying gco-
points. Th.is often occurs when we are work- logjc coodition.c;. If the systematic sampling
ing with "found .. variables. for example. the in1e.rvaJ coincides with this pauem. there
distribmion of illness in a population. We may be a bias in sample values.
may identify the households where a family Rnndom sn111pli11g (Figure 12-2b) may
member has contracted a given illness. avoid some. bui nol all. of the problems that
Allhough we can control neither the number affect systematic sampling. Random sam-
nor the distribution of ill people., we may pling entails selecting point locations based
wish co use these "samples.. in an incerpola· on random numbers. Typically. both 11\e
tion procedure. easting and northing coordinates are chosen
by independent random processes. The.se
512 GIS Fundamentals
• • • • • •
~
•
0
• • • • • • •
• ••
• • • • • • • •
-· ~·,, E: .
0
• • • • • • •\
'• • • 0
,;:::.
.,. \
:, ~ o
'·
• • ,. 0 ,p ~
-,-
'•"W• •
• • .;.r.
, ·-•,~
•••
•
• •• .,.
..I
• • • • • • ., . • •
• • • 0
~ • ,
"',,,, ~
•• A
•
•••• •• .,8Jt
9
.
• • • • • • 0 0
• • • • • • o ..,,.g,,._
0 ""'-"~-•
• i>
Ooi
•
•
••
0
0
0
•
,• '
,:.,JI .
~·i#t:;.,
..• O I
•• -~~1 ~~
o) b)
••
a!Ji ;fJJ,.t/i,•• • 0
o~ ··••· •
• ~ • • ,I,
~;!o
~f' ,
, .·i··. -
/.$' -:3 • • •
'?
~
J1E
•
\
•
;,,t• •
;i,.Jh
P{>:J.,I ..
-.;:o>
I . . . -·"••
, o ~t'o.'°
} 0,0
••
•
0
d"
~
_-,o ooo- o
J ~ j;,.. '°1_ o H· ,:_,
~ ~,,t ~~,.·,,
4i .•• . . r...• o's. ' 0 0
•• •• .
• • ·e .,
•
·~ a1. oot J
• 0
, . 1,
0
• •
o<;oo
~o
·•_,..d ,
••
•'t
·f
I I
g•~ O o .. 0 l).r
fi&i.a•T I ?-2: E.umplc:s or o) 5)'$tc:m.itic:. b) r.mdom, t) c:l\lster. and d) ndapth-e sa.inpl.ing poucms.
Sample points arc shown as solid circle, . Coutoun Cor lhc: sur6ioc are shown as line,. Sampling mc:th·
ods diff« ii:i the. dll'ttibutiou of t:1unplc: po11\1s.
Chapler 12: Spalial Eslimation 513
may be plotted on a map and/or listed, and surveys thal entail significant off.road travel
1hen visited wi1h !he aid of a GNSS or other because of the reduction in U3Vel times.
positioning technology 10 collect 1be sample. There are several varia11ts of cluster
The points do not have to be visited in the sampling. Cluster cemers m.ay be located
order in which they were selec1ed. so in randomly or systematically. Samples "ithin
some instances. travel distances between a cluster may also be plac.ed al random or
points will be quite small. On average. the systematically around the clus1er center.
distances will be no shoner tban with a sys.. Both approaches have merit. although it is
tematic sample. so travel costs are likely 10 more common to locate cluster c.enters at
be al least no worse than with systemalic random and distribu1e samples within a clus·
sampling. ter according 10 some systemaric panent.
Random samples have an advantage This approach is used by the U.S. Forest Ser-
over S)'Stematic samples in tluu they are vice to conduc.i national sur"eys of fores1
unlikely 10 n1a1cb any panen, in the land· conditions. and by many prospectors during
scape. HeJ1ce. 1he chances are lower for 111ineral exploration.
biased sampling and inae-curate predictions. Adnpti"e sampling is a final method we
However. like systematic sampling. ran- \\ill describe. 11 is characterized by frequent
dom sampling does nothing to distribute sampling in variable areas and sparse sam.
samples in ar<l:ls of high varia1ion. More piing in uniform areas (Fig,ire I 2·2d). Adap-
samples than necessary may be collected in tive sampling greally increases sampling
unifom1 areas, and fewer samples than efficiency bec.ause smaJl.scaJe variation is
needed may be collected in variable areas. In bener sampled. large. relat,vely homoge-
addi1ioo. random sanipling is more compli· neous areas are well represented by a few
cated and hence more difficult to unde.rstand samples. reserving more samples for areas
1hi!il sys1ema1i¢ l.'\Jiipli!lg. More lftiiniug wi1b higher spatial ,•11tiation.
may be required for sanipling crews when Adaptive sampling requires a way to
implementing random sampling. Random estima1e feature varia1ion priorto field visits
san1pli11g is seldom chosen when sampling or while in lhc field. or repe-a.1 visi1s to the
over large areas. due 10 these disadwrncages sampling areas. Sample density is adaptively
and relatively few advantages over alterna- increased in areas of high variation. Some·
1ive sampling strategies. rimes ii is quite obvious where the varia1ion
Cluster ~·amp/iug is a technique 1bat is grea1es1 while in the field. For example,
groups samples (Figure 12-2c). Clustercen· when measuring elevation. i t is obvious
1ers are chosen by some random or system· where lhe terrain is more va,iable. Sample
atic method. with a cluster of samples density may be increased based on field
arranged arotu1d each center. The dislances observalions of steepness.
between samples within a cluster are gener· lfchere is no method of identifying
ally much smaller than 1he distances where the fe.atures are most ,,ariable while in
between duster centers.
the rield. then sample densi1y ca111101 be
Redtieed crave! time is the primary increased "on the spot." Samples may
advantage of cluster samples. Travel times require office or lab foranalysis to estimate
withing a cluster are shone,. A sampling variation. Sample locations are tlieo selected
crew may 1rn,·el several hours 10 reach a based on local variation. T11e list or map of
clus1er center. buJ only a few minu1es coordinate loca1ions may be generated and
between each sample within a cluster. Cius. used as a guide in collecti.ng subsequent
tersarupling is often used in natural resource samples.
514 GIS Fundamentals
•.
•
•' . .• .. •
••it
• •
-... • ~ • •
,.,..
• '·
••
~
••
•
••
• • .'re, ••
•
'II
~
• •• •
•
.1'• · •• ·•••t
• •~ 0 • • /f •
~
.. ...
•
• • of•••.".
m
•
0
~
•
I
(
I '
&'
• ~ •• ,...,
••
•
.
1%·N
71.a,c
&.c,U&
Fla,m~ 12-J: Mc:asurc:rucnl points and ,·:dues for indicator ozone co1m:ntratioo for 2014. HiiJtc:r<on¢CU·
tNuioo.t Mmt publi,c hc11hh. wi1b ~:idings O\'a' 85 p11t1:s per billion auocillted with d:un.'1.Jc 10 lung: fi.uic,
tion. pnrtic:uJarly i.n the ~ldcrfy, c:~.itdrrn. :ind J1stlnnn1ks, These 5tnnplc cbt., will be used ln!cr l111h.i.s
ebnpter to dffnotu1Jn;te uiterpobhOn methods.
ing one data ~ int. The error is es.tima1ed a1 independent set of sample points that are
the withheld point as tbe observed minus the withheld from the interpolation process.
imeipolated values. The sample is replaced. Their measured values are tl1e11 compared 10
a new sample selected and withheld. and the the interpolated values. and the mean error.
surface fit and erroragain detennined. This ma."imum error. and perhap.s other error sta·
is repeated for each data point A less effi. tistics identified.
cient testing method entails collecting an
516 GIS Fundamentals
FlJtUT 12-4: A l.brtt-dUncnsjonaJ pcr$pcC"ti\'c rcprcscutalion of Thiessen (ncatnt neighbor) polnons. a,c11-
cr.111cd b>• i.iucrpoL11ion for:\ sel of Slt1l!Plc poi11t1 (bl:\d: dou). Nole that MOS IU'e 11Ui3,t!cd lhc \'11fue of die
nearc:$1 4llmplc p<>inl. cn:.;ating a $c1 of UR"gi.dnr polygon;. Vt.lucs dw1gc 111.mapdy !llong the pc>olygon edge;.
Chapter 12: Spatial Estimation 517
•
• •
• •
0 •
Figure 12-5 shows our ozone sample Fixed Radius - Locall Averag ing
points and Tbiessen polygons based on the
sample points. Note that sampling is denser Fixed radius interpolation is more com-
near some urban areas. panicularly dte Pltil· plex than nearest neighbor interpolation. bul
adelpbia-New York City co1Tidor in the Less complex than most other interpolation
upper right, Denver on the left, and St. methods. In a fixed radius imerpola1io11 a
Louis. Dallas, and Atlanta along the mid 10 raster grid is specified in a region of interest
Lower porrioos of the figure. Thiessen poly· Cell values are estimated b•sed on 11te aver-
gons are sn:i.aUer where sampling density is age of nearby samples.
highest. The satnples used to calc11Ja1e a cell
Thiessen polygons provide an exac1 value depend on a search radius. The search
imerpo/01or. This means the interpolated radius defines the size of a circle that is cen-
surface equal;; the sampled values at each tered on each cell. Sample points fow1d
sample poinL The value for each sample inside the circle arc averaged 10 interpolate
location is preserved. so there is no differ· 1be value for that cell (Figure 12-6). Points
ence be1weeo 11te true and interpolated val- outside the circle are ignored.
ues at the sample points. Exact inierpolators Figure 12-7shows a perspective view of
have this admirable quality. but often are not fixed radius sampling. Note thal there is a
the best interpolators at unsan1pled points: sample data Layer. shown a, '!be top of Figure
for example. the Thiessen polygon method is 12-7. venically aliSJled witb the interpolated
usually in error at nonsampled locations. surface. This surface is a raste.r data layer
often more so than other inexact interpol•· \\itb interpolated values in each raster cell.
tors. A fo~cd radius circle is cemered over a ras1cr
cell. The average is calculat.ed for all sam-
ples comained "ithin the sa:mple circle. and
thi.s average is placed in the appropriate out-
518 GIS Fundamentals
• obs.e.rvot,on
+ inrerpolotion PQlflt
Input Output
•
6 • .s·
••5 7
..
, 10
. 15
• 9
•(;, 5
·•·.s
6
••3 •
• -:·
Fla:ui•t 1~6: A dia.pam and c11::uuplc of a fixed r:ldiut i.ittC!}'Olatiou. Values with.in each ~mp,lioi;: ci.r<lc
-att l\\'a'a$,Cd to ntmutc an Oll1puJ ,·:'lluc for 1.bc corrcspo11din3 poinL
output value
is based on Interpolated
one sompte surface
Flaui'f' 12-7: A ~ t i\'c di11pmn of fixed nidiw umpling. A circle it centered on c~cb i'tl.i-l'crcdl
loc.n.lion.. S,unplef ·within tbc cirdc c0111ri\mtc lo die \'aluc a,i.igocd 10 ucl1 com,pot1ding nutcrccll
(adapted froru Mitichcll. 1999).
Chapter 12: Spatial Estimation 519
. . 1-- • .
•
• •
•••• • •..
• • •
•
••. •
l ~
- ,.• ' •.
••
• ....• " • ,11?./'oft . .~,
•
d> "'•c /:,, too
!~,1:
. ·.;·=
cif> o ' I
.....
O o•
••• 6 "o .;p . o-.
. if
• 0 r O,Ofle 11i;ti
• • • ..
•
, IB'
•
....• . ,,
~ t~M,S
• .. • • •::/ ~8 11
71.7,1
• • '
\
-.• -. ~
•
i
•\
•
u.7e
1$-84
14-ll&
Flgnrt 1.2~8: Ol'i~lUll cbt11 ai,d -..,upk pointt, nnd ft fl.iced rndittt llltcrpol111ion. Note lh111 this method
inny lcrwc pps in ~reu of '!).vie tiutiplct, iind 11\'cr.\Jc O\'crdc1:tSC 11nmplc areas.. 1.t'lcre:i~iog the IWcrng•
lug radlUs w;ill d«rc\llSc g11ps. but iocrQsc data Sl))oothiog Ill highly s;arupled IU'C.I&,
pu1 rasier cell. The process is repe,ned for may smooth lhe data too much. In the
eacil raster cell in the surface. Tile fixe<I extreme case. a search radius may be defined
radius circles are shown corresponding 10 that includes all sample points for all cells.
three raster cells. contauliog three. zero. and resulting in a single in1erpola1ed value for all
one sample points. respectively. Circles may cells. Some intermediate search radius is
contain no points. in which c.ase a zero or no chosen.
da1a value is place<! iu lhe rasier cell. The Fixed radius interpolators are not exac1
radius for the- circle is typically much laiger io1erpolators because they may average sev-
than 1he raster cell's widtb. This means cir- eral points in the vicinity of a sample. and so
cles overlap for adjoining cells. causing 1hey are unlikely 10 place the measured value
neigllboring cell values to be similar. al sample points in lbe inle!Jlolated surface.
The fixed radius: interpolator tends to
smoolh tlte sample da1a (figure 12·8). Large
or small values sampled at a given point are Inverse Distance Weighted Inter-
maintajned when only that one sample point polation
falls wilhin a search radius for a cell. Values The im-erse distance weiglued(ID\V)
are brought toward the overall sample mean interpola1or es1.i1ua1es lhe value at unkllowu
when averaged \\rjthin a search radius. points using the sampled vatues and distance
The search radius affects the values of to nearby known points. The weight of each
the interpolated surface. Too small a search sample point is an inverse p.roponion to the
radius resuhs in many emp1y cells. with no distance. thus 1he name. The fanller away
data or null ,•alues. Too large a search radius 1he point, the less weight the point has in
520 GIS Fundamentals
helping define the value al an unsampled weighted by the inverse of the distance from
loca1ion. Values a,e estimated by: the unknown. intel]lolated location. These
weighted values are added. The result is
divided by tJ1e sum of lhe weights 10 "scale..
Z; the weights to the measurement units. This
Ld"IJ
j
produces an es1ima1e for the unsample<l
location.
(121)
I IDW is an exac1 interpolator. lmerpo-
i::d~
j IJ
lated values are equal to the sampled values
a1 each sampled point. As a <\! becomes very
small (srunpte points near the interpolated
loca1i_onl: the 1/dij becomes very large. The
where ZJ is the estimated value for the contnbuuon from the nearby sample point
unknown point at. loe-at:ion j. d, is the dis- dwarfs the contributions from aUI other
tance from known point I to un~own point j, points. The values l/dtj are very near zero for
Z, is 1he value for the known poim 1. and n is all i values except the one very uear the sam·
a user-defined exponent Any number of pied poim. so 1he values a1 all other points
points greater than two may be used, up to are effectively multiplied by zero in the
all points in 1he sample. 'Typically. some numerator of the lDW equation. The sum in
fixed number of close points is used: for the denominator reduces 10 11\e weight Vdi.
example. 1he three ne.arest sampled points The weights on the top and the bottom of t~e
will be used 10 es1imate values a1 unknown row equation become more similar, and the
locations. Note 1hat n controls how fast a fraction approaches 1. Thus. a1 a sampled
point's influence ·wanes with distance. The point. the !DIV intel]lolation fonnuta
larger then. the smaller the weight (1/d"ul- reduces to:
so the less influe10ce a point has on the esti·
mate of the w1known point. ( 12.2)
Figure 12-9 iilus1ra1es an IDW interpo-
lation calculation. The three nearest samples
are used. Each measured sample value is
1
kno.....n p()in I knowt"I po1rH
50 30 d"u
·~~· 1nterpo101ed
By simple division this is reduced matht·
point matically 10 Z;, the value measured at the
sampling location.
lnverse distance weighting results in
smooth interpolated surfaces (Fi,gure 12-10).
The values do not jump disconti nuously at
edges. as occurs with Thiessen polygons.
and sometimes with fo<.ed radius interpola-
• 3945 tion. While IDW is easily and 1Yidely
52
k~po101 applied. care must be taken in evaluating the
Fi&Ul'C' l l-SI: An example: c:Jilc:u.lation for a lincM values of n and i. The effects of d1anging n
llm::ri;c dis.tao~ wd$,ht<:d intc:rpolator. Tbe values and i should be tested in an ove~ampled
with
l1f cac1t lroov;-i,poi.i•1(50, 52 , 30) 11tc 11.\'<~gcd case or using retention and repeat fitting
wcifhts based on 1J1c di.s12nccs (di, d.2. d3) from
the mtc:rpob1tcd poim. methods. described later. where adequate
Chapter 12: Spatial Estimation 521
..
• ojo
•
•
• .
••
• .. ......
• •
•• •
I' •
•
.
~. ,,
•
'
le,
•
• •• •
•
., 0" • •
.e. .
: ·..
•
.,,
8oA.u."
,.. ,··~
•
• •
ti •J.; ':'.'...V. • 0
0
;,.;o -.~)
•
'
..i;..,
.,
.-----i
• ,.f A• •
• •:•: • <>
..
O»'IC. #0
• ••
•
... 1:1·6~
~!-t!
l •?/'
• '
;< •• ...n 71.74
... i
•
• •
•• 71·~
1, Ti
• • • " lib
.•,ofo;)
,• ..
'
••
•• •
• •
•••
.
•
•
•• •
•
• •
'
• ••
0
8
'
•
• .,
Flautt IZ...10: lnwrsc di&tancc wci;btcd U1tclpolariou typically r«ulls in a SlllOOtb. ;ap-~c SU1face.
E~poncm 01'dcr :a:1,d u mple ti.u 11.ffttt tbc iln'ftlce &b11pc (ot 1.hi-t i1ltc:rpol111or. The top intcrpol~1io1, relics o,,
lhc 12 nurqt J>Qinl.s, uitb an exponent of two, \\•bile 1hc lower intcrpobtion relic, Oii the fO\lr nc:nrC$1
pointt. and au cxpo1lC1ll of three. Lexa.I infloenccs arc stronger H the e.:qx>tieru incrcaws and 1be nwnlxr Qf
$81l.lpk poictt dc<tt3tCS.
522 GIS Fundamentals
withl1eld points can be compared to interpo· S_pline fw1c1ions are conslru:cted from a
lated points. The 'lDW, and all otl1er in1e,p<>- set ofjoined polyno,nial fianctions. Line
la1ors. should be applied only afttr the user functions \\~ll be described here.., but the
is convinced the method provides estimates principles also awty 10 surface splines.
with sufficient accuracy. In the case of IDW. 1>olynomial functions are fit to sJ1ort seg-
this may involve testing the interpolntorover ments. Ao exact or a least squares method
may be used 10 fit the lines 1hrough 1he
a range of n and i values. and selecting the points found in the segment For example, a
combination that most often gives accept· third-order polynomial may be fit 10 a line
able results. se~nent (Figure l 2-11). A different 1bird-
The size of the user-defined exponent, n, order polynomial will be fit 10 tbe next line
affects 1he shape of the inte,pota1ed surface segment. These polynomials are by their
(figure 12-10). 1Vhei1 a larger n is specified. namre smooth curves wi1.ltin a gi\'Cll seg-
ment
the closer points become more influential.
Higher exponents resuh in surfaces with Splines are typically first. second. or
higher peaks. lower valleys. and steeper gm- third order. corresponding to 1be 1n.1xum1111
die111s near the sampled points. Contours exponen1 in 1he equation used to fit each seg-
ment (e.g.. second order for x2. third order
become much more concemrated near sam· for x3 or x2y). Segments meet at knots. or
pie points when n • 2 (Figure 12- 10. top) join points. These join points may fall on a
than when n . 3 (Figure 12-10. bottom). sampled point, or they may fall between
These changing shades reflect steeper gmdi· sampled points.
ems near the known data points.
The number of points. i. used 10 estimate one polynomial equotion
an interpolated point.;. also affects the esti· IS fit 10 po.ill$ Of'lt:
mated surface, but effecls are often complex 1tro"9h five. v • f(s)
and diflieult to ge,1erali2e. b~au~e they
depend on the dis1ribmion and magnin,des
of the specific sample points. A larger num-
ber of sample points lends 10 resuh in a 01 1he jOll'l Poir'l1
smooiber interpolated surface. f(x) • 9(x) smoothed
slope of f{x) • slope of g(x) lln•
c.h0n9t lf'I ~loi,t. of f(x) e~uols
Splines the: c.hOnge in slope of g(x)
Constraints are sel on spline functions lo points) and show a smooth tran.silion (Figure
ensure the einire line or surface remains 12·I2). Stricllyenforeingex:act i111erpola1ion
smooth at the join poims. These constraints can sometimes lead 10 artifaccs at the knots
are in<:orporaled into the mathematical form or between points. Large loops or deviations
of the futietiC>n for each segment. They may occur. The spLiue functions are often
require tbai tbe slope oftbe lines and the modified to allow some etTOr in tbe fit, par-
di.1nge in slope oflhe lines be equal across 1icutarly when fitting surfaces rather lhan
segments on either side of the join point. lines. This usually removes the artifacts of
Typically. spl.ine functions give exact inter- spline fits, while maimaining die smooth and
polation (the splines pass through the sample cominuous interpolated lines or surfaces.
'.
•
~;-
•
• . •
•
. •... •
••
• •
•
•
•
.
.
'. .,
~•
•
•• •
•
•
•
• ••• • •
•
Spatial Prediction
Spatial predictions arc based on mathe- and sample coordinates to estimate the tar-
matical models. often built via a slatlstical get variable.s at unknown locati!ons, while
process. These statistically based models spatial prediction usually inc-orporates
use coordinate l<>eation and measured or additional variables.
observed independent variables to predict Spatial predictions are often improved
values for impor1an1 but unknown depen· due to spatial autoccrrela1io11. which is is
dent variables. Spatial pcediction is differ- the tendency of nearby objects 10 vary in
ent from interpolation because it uses a concert. High values occur together. as do
statistical fitting process rather than a pre· low values. Explanatio,ts of this commoo
defined algoritlun. and because spatial pre- condition often refer 10 tbe observation of
diction uses independent variables as well Waldo Tobler. that "everything in the tuti-
as coordinate locations to estimate verse is related to everything else. but closer
unknown variabl.es. \Ve admit lhat our dis. 1hings are more related." However. the
tinction between spatial predic1'ion and nature of the correlalion may d1ange from
interpolation is artificial. but it is useful in one variable 10 the next. or it may change in
organizing our discussion. and highlights space. Correlations may l>e strong in one
an importam distincc.ion between ourdata- region bm poor in another, or positive in one
driven models and our fixed interpolation area a.ad negative in another. We 013)'
me«hods. improve our predictions if we sn,dy the SPQ·
Spatial predictions a.re a special case of tial autocorrelation and incorporate the cor-
general predictiv,e modeling. a major focus relation stn1c1ure into our models.
of applied statistics. TI1erc is a rich litera-
In addition 10 spatial autocorrelation.
ture devoted to spatial statistics in general. 1here may be. cross..corre.lario11 between dif·
and spalial predictive modeling in particu· ferent variables: the tendency f<>r 1wo vari-
tar. We will only scratch the surface of th.is ables to change in concert. This means two
field: the reader ls referred 10 the introduc- different variables at the same OT nearby
tory spa1fal sta1is1ics texts Jisted at the end locations may be lligb or low together (posi-
of tl1is chapter. tive cross-correlation). or highs in one vari-
Our discussions will be re.stricled to able correspond to lows in another (negative
predicting contin.uous spatial variables. cross-correlation). SPQtial prediction metli-
These variables arc conceptualized as sp11- ods may incorporate auto-and cross-<:orrela-
linl fields that occur across an area. are tion in predictions.
measured on an imervaVratio scale, and Surfaces with tow and high SPQtial auto-
t)'J)ically have values that vary in concert correlation and with strong cro$S-correlation
- that is. they are spatially correlated. This are shown in Figure 12-13. Figtcre 12-130
is in contrast to discrete objec:1s. such as shows two surfaces. Loyor l. with a high
point. line. or polygon features. While the autocorrelation, and Loyer 2. wi•h a low
occurrence and propertie.s of discrete fea- autocorrelation. Scatter diagrams of sample
lllles may be predicted using spatial mod- pairs separated by a unifontl sb<ln tag dis-
els. this is less common. and most discrete 1ance are shown to 1he righl of ~ch corre-
object predictio1.tS use a different set of sponding layer. mgher autocorrelation, as
tools tlmt will nol be discussed here. shown in Layer l. indicates lbai points near
Spatial prediction may be considered each other are alike. Asample from • sur-
more general than interpola1ion. Bot11 are face with high autocorrelation provides sub·
used to estimate ,>alues of a target variable stamial information abom the vaJues a1
at unknown locations. l.nterpolation meth- nearby locatioos (Figure 12-1 30. top). Sam-
ods use only the measured target variable ples from a surface with low autocorrelation
Chapter 12: Spatial Estimation 525
o) spotiol outocorrelalion
Loyu1
,.
~
g
..... ... ::~.:
..., • •_ · ; , ·
'Jtlf..~t-}." ••
-1 1100 ,.
"'O
~ ""'
-
~ -
.00
S:.-.·~~r·'
~\,··
Loytr 2
N
--
.3
~
~
J-
,:.., •.'•''t. • '\ l I •
";"- .....
" "" -~··:.......,.,?;·:
~ . • ..• ~
.... ,.~ ..:,. . . . .
~ ~
"'
"'
b) spofiol cross-con-dotion
'""' ~ -----------~
'. ......
••
''"'
.. !100
lJ 1000
·' .. ·: .
• . . .. •• t•
t,.... •• .,.•
... .If,~
.9 '°"
.. ~-----------"'
""'
flaurt- 12- 1.l: PM o lhO\\'!I spati:11J1,, :ntr<>Cotrd111cd (l ~ r I) ruld sp:ui~lly uoeo1nln1cd (Loyer 2) <bt!l
l~yc:r$, Plots of i.1mplc ~in wi1b .t fag distn.:rn;c h • 1 sbow ,imifar , ·~lun for- lbc nutoconcl.3tcd Loyer I,
and unrelated ,·alues for uncorrelated Loyer 2. Panel b show$ two eross-com:latcd layen. Loytr Ahas:
hiJb« ,·aluc5' on a,·mi..3< than toyer 8 , but the two laym \'IU)' io cou«r1. Both fe"acb luab aud low ,·111-
ucs in siruilnr 11re.u.
526 GIS Fundamentals
do not provide much information al values in Figure 12· 14 shows a Moran's I c.alcu.
the vicinity of the sample point (Figure 12· lotion applied to a raster data set. Here a
130. bonom). "rook's case" nlle is applied in that neigh-
1\vo cross-correlated raster layers are bors are only cells tl1a1 share a full edge.
shown in Figure I l· lJb. Positive cross-.or- A rather sparse weiglu table may be
related layers have values 1hat tend to both constmcted to organize lhe wu (Figure 12-
be high in some regions and be tow in other 14). Each row is derived from the rook' s
regions. Many feamres are posith•ety cor- adjacencies centered on a given cell, idemi-
related. such as housing prices and average fied in 1he firs:1 column of the 1able. There is
income. or donut shop density and number a non-zero entry in a row for each rook's
of security guards. Negative cross-correta- case adjacency. and zeros otherwise. For
1ion occurs when varfables change in the example-. the rook's template cemered on
opposile sense - areas with high values for cell d in Figure 12-14 yields o, e, and gas
one variable are low for the other. for exam- neighbors to d, so these columns have non-
ple. low temperanire-s at higher elevations. zero entries i.n row d in the weights 1able.
The Moran's l statistic is an established Each row is nom1alized so that d:te summed
measure of global. or average level of cor- row entries equal I. yielding the sum of all
relation across a data set: weights 10 equal the nllmber of fean1Ces. and
since wu • n. they cancel each other by divi.
sion in the Moran's l fonuuta.
n n
Moran's I approaches a value of ·l in
n L L w0(Z; - Z)( Z1- Z) data sets with positive spatial correlation. in
l= l j = I
which like values tend 10 occur together. The
1- ...:........:...,_:.~ ~~~~- (12.3) numerator in the equa1ion is 1.he cross prod-
n n n
- , Ult of fe.1m1eS ttMt Mtb other. Each time a
L (Z,- zr L L WJJ large value occurs near another large value.
the numerator calculation will be large. If
I= ! l= l j = I
small values also co-occur, they will both be
large negati\'e numbers. and mul1iptica1ion
where z, and z, are the variable Vlllues at will also result in a la.rge positiv,e number.
points I and J, respectively: z with a bar The swn of the cross-product will tl1en be
above it is the variable mean. The calcula- large for spatially correlated data secs. bui
tions standardize individual observations standardized by the denominator to be near
by subtracting the mean so that tl1e Moran's l.
J range is constrained.. and to ease interpre-
tation. lf lbere is oo sparial soniug i.o a data set.
large values a.re just as likely to be near
The wuare weight values that decline small values as elsewhere. with positi\'e vaJ.
wilh distance, laking a posi1ive value ifz1 ues tending to balance negative ·values. mak·
and z, are considered neighbors. and Oif the ing the sum in the Moran's I numerator near
values are 001. Di.stance declines can be zero. T11e spo1ial layers ha"e low spatial cor-
spec.ified in sever.al ways, with a specific relation because knowing a value at a loca-
distance threshold. for direct adjacency such tion does 1101 provide much info:rmation
as reqlliring shared edges for polygons or about values in adjacent locations - 1.bey
raster cells, or using the rook•s orqueen's are just as likely 10 be different or similar 10
case adjacencies described in moving win· the observed value.
dow analysis in Chapter IO. Weights are typ·
ically zero for all olher, more distant features Moran's l approaches -1when values are
or cells relative 10 any cell or polygon. anticorrelate<I - a large value is more likely
to be next to smaUvalues than next to other
large ,'llh1es.
Chapter 12: Spatial Estimation 527
Moran's I Calculations
Test aver Formula For our example
• • raster at the far left,
3c 5b 2c n 'L i: Wu(Z,. Z)(Zj. Z)
!·I J•I n,9
9d 11 4 I•" 11t
Z. 3.5.z . ....• a • 6.67
• r L (Z;. 'ZJ' 'L L wij 9
•l ••1 J•I
8. 10h 8k To make lhe
notation simpler
let 6 , • (Z1• Z)
If we adopt the rook's case adjacency, the~ e.g ..
the Wlj weights take the general form of: ~
6 0 -(3-6.667) •·3.667
Wij weights table, normalized ' a
0
'
- 3.667
0 b c d e f Q h k
0 0 0.5 0 0.5 0 0 0 0 0 b -1.667
b 0.333 0 0.333 0 0.333 0 0 0 0 c -4.667
c 0 0 .5 0 0 0 0.5
0 0 0 d 2.333
d 0.333 0 0 0 0.33 0 0.333 0 0 e 4.333
f -2.667
e 0 0.25 0 0.25 0 0.25 0 0.25 0
f
g
0
0
0 0.333 0 0.333 0
0 0 0.5 0 0
0
0
0 0.333
0.5 0
-
9
h
1.333
3.333
k 1.333
h 0 0 0 0 0.333 0 0.333 0 0.333
k 0 0 0 0 0 0.5 0 0.5 0 and
L• (Z,. Z)'
• • itl
Re-arranging. then n ·L L wu 6; 6 J
1
applyi ng the formula:! =- .-"-' ~J·- -, - . - - becomes
z
~ 6, :84
Li•J t.' ·Ii=J Z: w0
I )=I
25.2 = 0.300
=
84
Fla:u~ I ? 14: Mornn't I is a measure: of the com:L11ion amou11 neatby obse:r\-atioo1. This; example
0
sbows 1bc enl.:ul:nion of Moran·, If«" smnll Nl$terd.1tll f c1. The fomml11 is" ,vci_gbtcd combll1.,1ion of
rnlues in 3 ne:i!Wborhood. repeated aCfOss .ill cells. Positive Monm·s I v.ilues indicate po5itive spatUI
com:l.atioo. aod negative ,,.lucs indicate auti<0rrdatioo.
528 GIS Fundamentals
Note that cakulation using different TI1ere are many olher global and local
software may calculate different values for indices of spatial au1ocorrela1ion including
Moran's I when using the same data. TI1is Geary's C. or the Gi of Ge1is and Ord
is largely due to how the weight matrix is (1992). and 1hey perform jn a manner simi-
cons1nic1ed, and if and how row uonnaliza. lar to Moran's I. The indices vary in bow
tion is impleme111ed. Different weights will they estima1e die correla1ion and in the spe-
give a different answer, such that sublle cific c.alculations of relatedness and separa-
difference in crea1ion \\1U give slightly 10 tion. TI,ese and a number of additional
largely different numbers. Analysts should ,opics are quite well covered in the sug·
be careful in comparisons across softwares gested reading at the end of this chapter.
unless tltey can comrol all aspects of tl1e
calcula1ions. partkularly the weight matrix. Spatial Regression
The Moran' ~ I is tem1ed a global statis-
tic because ii idemifies the average or area- Spatial regression and other statisli-
wide spatial correlation. It doesn't identify cally based models typically 11se observa-
how correlation varies across the area of tions of dependent variables. other
interest. A local Moran's I. or LISA (local independent variables. and sample coordi-
iodicalor of spatial autocorrelation) is often nates 10 develop prediction equat ions. For
used to identify local cluslering. Spatial example. we estimate temperomre across a
correlation is mapped by calculating an region using a network of temperature sta-
index for each feamre. and then planing the tions. We may interpolate as described in
correla1ion at each feature. Under the typi· the previous section to estimate tempera-
cal standardization, local weights sum 10 1. mre. using only the station coordinates and
and the local Moran's I is calcttlated as: tbe corresponding temper.uttre measure-
mcms. However. we may note a s1rong
cooling 1rend with elevation. and combine
temperature measurements witli elevation,
Z-2 la1imde. and longitude in a siatistical model
I• = ' · L,
'<"' WIJ · (Z- 2)
l'I J that provides bener temperantre predic-
L (Z, -l:1' j .J•I tions. We would then use this model to esli·
mate raster 1emperamre layers for the
region.
We must remember that the Z·bar in Spatial predictions are often described
the above equation is t.he rue.an of all fea- mathematically by a general ftcnct ion. such
tures. and 21 is sununed overall features in as:
the data set in Ille de11omina1or of fraction.
Weigh1s w0 are defined as zero over most of
the data set. with nonzero values only
"near" the local feamre. and Stun to one.
Other neigJ1borhoods may be specified, 2 1 = f(x1, y1, a 1, ~J) (12.5)
changing the number of neigl1borsand their
weighlS for each focal feanire. A "quee11 's
case.. neighborhood may be adopted. with
all 8 adjacent cells panicipating and an
eq11al weight or distaoce-<lependen1 weight where Zi is the estimated oulput value. at the
set that sums to one. Weights forvector coordinates Xi: Yi at point i: <Xi are variables
data sets may be more complicaled, but measured at point i: and Jlj are variables
usually are specified by shared edges. measured at other locations.
nodes. or with centroids within a specified
proximjty.
Chapter 12: Spatial Estimation 529
Trend Suriface and Simple Spa· There must be at least one more sample
tlal Regression point than the number of est:imated o coefli·
cients due to statistical constraints. This isn't
Trend surface prediction is a iype of spa- a problem for most applications. because the
tial regression tbat involves fining a statisti- best polynomial models often have fewer
cal model. or !rend surface. lhrough !he thrut IO coefficients much fewer than are
measured points. The surface is typically a typical sample sizes.
polynomial iu the x and Y coordinate sys-
tem. For example. a second-order polyno- Trend surfaces are not exact predictors
mial model woold be: in that the surface typically does not pass
through the measured points. There is a dif·
fereuce between the interpolated surface and
lhe measurement at most locations. Trend
surfaces are often among the most accurate
Z • •o • o,x • o2y • •,>f · o,y2 • o5 ><y U2.6l methods when fitting smoodtly varying Slit·
faces. such as mean daily temperature. Trend
surfaces typically do not have ..bull's-eye..
artifacts due to excessive lo(al influence in
where Z is the value at any point X and Y, and io\•erse distance weigtued imerpolators.
each op is a ooefficient estimated in a regres-
sion model. Least squares methods. Trend surface methods often perform
described in most introductory statistical poorly when there is a highly convoluted
textbooks. are used to estimate 1he bestset of surface (Figure 12-15). Ozo ne as shown in
oP: values. The oJ) values are chosen to mini- the raw obse"'atioos tan change rapidly
mize 1he a\'erage difference between the over shon distances. as can .Precipitation
measured z values and the prediction Slit· from a single summer thunderstonn, or pop-
face. ulation deoSity lrt a nlixed-use neighbor-
hood: this type of abmpt variation is often
.. • •
•••
•
J • • • •
• .' 't
. ~
t,
•
•
•
•
• '
>{II
•
'9
• " i>
• ;i,
•
• • •
0
• •
(
,.o.&. .... ...
.• • • ,1,
•
f'i;1u~ U -15i; A third-orde-r trffld surf:Ke fit to the mup!c poiub,
530 GIS Fundamentals
poorly estimated with a trend surf-ace. Even Much like !OW i111erpola1ors, weights in
high-order polynomials may 1101 be suffi- kriging are used with measured :sample vari-
ciently flexible t<> fit these complex. convo- ables 10 estimate values at unknown loca-
luted surfaces. tions. Witll kriging. the weights are chosen
in a sca1is1ically optimal fashion.. given a
Trend :mrfac~& may be exiended 10
include independent variables that provide specific kriging model and assumptions
some help in predicting the variable of inter- about 1he trend, autocorrelation, and stochas-
est: tic ,.,,riation in the predicted variable.
Kriging methods are. !he ce11terpiece of
geos1a1is1ics, initiaUy de\'eloped in the early
1900s for use in mining. Ore samples may
be expensive to obtain or process. and accu·
rate mineral occurrence and density predic·
dons difficult and valuable. Kriging
estimators were developed to incorporat"e
where x and Yare lbe coordinate locations. trends, autocorrelation. and stochastic varia·
and Q and w are independent variables mea- tion and also provide some estimate of the
sured al the poinl (X. Y). and Z is the depen- local variance in 1he predicled v.ariable.
dent variable 10 t,,e predic1ed a1 the poin1 Kriging uses 1he concept of a lag dis·
(X.Y). The op values are coefficiems for the 1a11ce. often symbolized by 1he lener h. Con-
predictive equation. usually estimated sider the sample set shown in Fi,gure 12·16.
through a least squares sca1is1ical process. Each value for the variable z is shown plot-
The value Z may li>e predicted at any location ted over a region. Individual points may be
where we have values for x. Y. Q, and W. listed as z1. z2. z3. etc., to zk. wben there are
k sample p0ili1s. The lag dirnu;ee for a pair
ofpoims is the distance between them. and
Kriging and Co·Kriglng by convention is denoted by h. The lag dis-
Kl'igiug is a s~atistically based estimator tance is calculated from 1be X and Ycoordi-
of spatial variables. II differs from 1be trend nate values for the sample poims. based on
surface approach in tha1 predictions a.re tbe PytbagoreaJt formula. In our example in
based on regional ized variable dteory. which Figure 12-16, the lag (horizontal) distance
includes three ma in components. The first
component is the spatial trend. an increase or
decrease in a vari:able that depends on direc-
tion: for example, precipitation may
decrease 1owards the west.
The second component describes the
local spatial auiocorrelation. that is. the ten-
dency for points mear each other to have sim-
ilar values. Kriging is unique and powerful
because we use tine observed cbaoge in SJXI·
tial autocorrela1ion with distance to estimate
values at our unknown locations.
The 1bird componem in 1he prediction is
random, stochastic variation. These three
components are combined iu a rua1be1na1ical
model 10 develop an estimation function.
The fmtction is then applied to the measured Fiam"ll' 11·16: Lag dl'Stru,c:e, , used in c~lcubt·
data to estimate values across the study area. Uig scm.i,-ariau«s for kriging.
Chapter 12: Spatial Estimation 531
and increases 10 a. plateau as the lag distance flgurt U - 1.9": A \'arioa,am, a pl<>< o(calculated
h increases. This is the typical fonn of a ,,a.r- 2111d fit tc:mi,::uiance \'t. litg d.itltu)('c:.
iogmru. Tbe 1111gger is the initial semivari-
ance when the autocorrelation typically is
highest. The nugget is shown at the left of plus or minus the lag tolerance. The semi·
the diagram in Figure 12-18.1he semivari- variance is tben calculated for each lag dis-
ance at a lag distance of zero. This is the tance. An example is shown in Fig11re 12-19.
intercept of the variognun. The sill is the Remember. each of these points ~s calculated
point at which lhe variogram levels off.11tis from equation 12.7 for a given tag distance.
is the "background'' variance, and may be A line may tbeu be fit through the set of
thought of as the inherent ,,.,riatiou when semivariance points. and the variogrnm esti.
there is l!nle auto,correlatfon. The nmge is mated. This line is sometimes called the var,
the lag distance al which the sill is reached. iogram model.
The nugget, sill. and range will differ among Spatial prediction is among the most
spatial variables. important applications of the variogram
A set of sample points is used 10 esti- ~ux lel (Figure 12·20). There are many varia-
mate the shape of the variogn,nl First a set l(OOS and types ofkriging models, but the
of lag distances h 1. h2. h3, etc.. are de6ned: sun piest and most couunouty applied rely ou
each distance signifies a given lag dis1ance. the variogram to estimate "optimal" we.ights
for prediction. These weights are used to
estimate \lalues a1 unknown localions by:
n
,,.,..111
............. " :";;"cc', _._--;---- (12.9)
•.
.•
• •.,.
•• .
•
' {:
....
:/ >.i·· ·:
• •
• • ..•
• • •
•
'••
~ .a • •"
-, .,,·a e•
• ' ,! •~~'IJ· ',,. ..•!., I!
•
olJ,
• ''
• 1!'
• o O
·O
. ,,i
A
•
. .·' .,8,~
.
..,. .& • .... ... •
. .
0
• •• • • •
• • #O'O&f. ••
''*"""'•Oz
~
,e.. o*'° •
"°''.. '
ti>
.
0 • ,,. .,
a··,, •.•~.o. •
' '
• • • ,t .•"" .•
;
•
o •IP
.. . ..
•. •
.
°"""""
0
•
".,
....
~-11
0.,. • ·~ • •
••e ,....
11+1,
- .• .1 74.75
Flc:u1~ 12. 20: Sample points and indicted value:, from au application ofkriaing to die ozone data.
Th<"rc: :ire lllilll)' altc:nutti\•c: fomu of finint ~ krigi.i1, n,odd . <:h0$ct\ biscd on goOdnc:ss of fit of 11 ,·11rio·
gnun nK'Cld. and cm ll 5:llllplc de1cm1incd mtcipoln.11011 di31:,.1m:.
directional di.fferenc.es in the variance. These Error values are squared to remove the sign
considerations are beyond the scope of our effect. and then the square roo1 raken on the
present discussioo. and tbe interested reader sum to re1um 10 the measured uni1 scale.
is referred to more complete treatmenss. instead of a squared unil sc.ale. Predictions
such as lsaaks and Srivastava (1989) or either above or below the observed values
McKillup and Dyar (2010). listed under sug- are generally considered to be equally bad.
gested reading at the end of this chapter. and rhe error is averaged ove.r all samples.
There are more advanced spatial predic- However. squaring the errors magnifies the
tion methods. and spatial estimation is an influence of outliers. extremely large posi-
active area of research. with more complex tive or negative errors. so some .argue thal
techniques such as spatial Bayesian estima- 1his is an overly pes.simis1ic estimate of
tion and space.time models. These topics are error. or at least when there are large outU·
more appropriately treated in more advanced er>.
courses and texts- The meou absolute en-or is an alterna-
tive error metric. less often used but less sen-
sitive to outliers than the RMSE. The MAE is
Prediction Accuracy
defined as:
We often need to characterize the accu-
racy of our spa1ial estimations. This helps us
choose the best model and place limits on
model applicatio11. Model assessment is a ( 12.12)
well-developed field. and will not be thor-
oughly reviewed here, but a few main COIJ·
cepts are introduced.
Accuracy is measured at asses.smem II ~ubstimtes theabsolute vah1e operation for
poims. locationswhere we know both the the squaring/square root operations and so is
rme value and the estimated values for a less sensitive to outliers. but othel1\~se is
variable.We oflen describe a sample set with quite similar to the RMSE.
n points. with estimated or interpolated val- Another accuracy metric is the mem,
ues at any itb point denoted by Pt, and the bias error.
true or observed vaJue at tJ1e poiut denoted
by o1• Each assessment point provides an
eJTOr estimate:
(12.10)
(12.11)
Chapter 12: Spatial Estimation 535
Overall measures of agreement between or all ofour samples while interpolating, and
an estimated and tn1e surface have been pro· leave few or none for an accuracy assess-
posed. includ ing Willmott's index of agree- ment.
ment: . Exac, ime_rpolators are particularly vex-
A mg, As you nugbt reta.11. Thiessen polygons.
!nverse distance weighted. and some spline
mterpolators have zero error a, au sample
d= I - s •c,
- CL.._ __ (12.14) poims by definition. because !hey are fonnu-
' lated to exactly ren1m the observed values at
I <IP.I +loJ>' !he fined points. One might !hi11k Iha! we
o- I mus, hold a se, of poims in restr\'e in order
with to gel a 1rue estimate of the imerpolator
accuracy.
l11ere are related techniques. known
variously as Jea,,e..one·out, bootstrapping. or
P,' = P1 -0 (1215) cross-validation. which address botl> the
t~ndersan~plfog and robust accuracy es1ima-
uon requirements. Bootstrapping fits the Sur·
face as many times as there are sample
and points. eacl11ime withholding one of 1he
pomts. We fit the surface the first time, with-
holding !he firs, poin1. We C1lll !hen sublr8C!
the witl!held measured value (01) 10 the
(1216) mrerpola!e<I value (P1), and obtain one esli-
mate oftlie error. We tlien repeat tliis protess
for the res, of !he sample points. For n san1-
ples. we fit the surface n times. We can then
compare !he withheld poi111's true value. o
Prim!'IY cita1io~ of dlese and o!her accuracy
memcs are provided at the end of this chap- to the fi1 value P1. giving us o error values. ~.
ter, and in the considerable literature on We can !hen apply equations 12.10 through
interpolation and spa1ial estimation. 12. 16 10 charac1erize the accuracy of each
surface.
Assessing the accuracy of an interpo-
la!ed surface requires we collect bo!h Bootstrapping or similar accuracy esti~
observed and predi<1ed values a1 a se.1of mates should be used because the RMSE
points. In an ide.a1 asse.ssment, these would estin>ated from the fit points ofleo gives an
be independent of the samples we use to esti· optimistic eslimate of accuracy, particularly
mate the surface, bm this is rarely possible. when sample size is small U not provided
Samples are often e.xpensive, difficuJt to col- by the G!S software used in finu1g. 1heo
lect. and spar,se. and most interpola1ed sur· sample data should be exponed 10 a s1atis1i-
faces would benefit from additional cally oriented surface fining system. for
sampling. If each new sample can materially example, 1he op,,n source statistical package
improve our interpola1ion. we are hard. R.
pressed 10 ho.Id them in rt'Serve for an accu-
racy assessment. We are tempted 10 use most
536 GIS Fundamentals
c.alculate distance from sample points 10 the the standard error dislance should c,olllain
mean center. The largest distance. average 68% of the data. Previous e><perieuce may
distance. or tbe standard deviation of tbe dis- help: for example.•one migbt know 1hat in a
tance from points to the center then may be particular region.. 90% or more of a wolf
detennu,ed. pack core area is within L0.8 km of a mean
Mean cirdes have the advanlages of ceme-r.
simplicity and ease of constn,ction, but they In many c.ases. circular core areas are
assume a uniformly circular shape for the sul>optimal because 110,ny variables are
core area. Some measures of me.an center known 10 exhibit nonregular shapes. and a
may be biased by extreme poims; for exam- circular core area is identified when using
ple. tile maximum distance circle in Figure the mean cemer I mean circle methods.
12-21. Note that the outlier near X '"' J5 and While me.an circle methods .are often used in
Y • 17.5 results in a large ma~imum distance exploratory da1a analyses. other methods
circle. This circle contains substantial area have been developed 10 more effectively
with no points nearby.audit is probably an identify irregularly shaped core areas.
overestimation of the core area. It is not
cleartha1 the inean distance or standard error
dis1ance circles are bener at defining a core Convex Hulls
area. The core areas defined by these me.a- Convex hulls. also known as minimum
sures may be appropriate for some applica· convex polygons. are perhaps the simplest
tions. but they are often too small in others. way to identify core areas with irregular
Some multiple of the. mean distance or stan. shapes. A co,"'e." hull is 1.he smallest poly-
dard error ,ua.y be chosen based on statistical gon created by edges (Lines) that 00111·
assumptions. or past experience. For exam· pletely enclose a set of poiuts. and for
pie, if we asswne lhe samples follow a ran- which all eitterior aogles be1ween edges are
dom normal distribution. then a core area greaterthan or equalto 180 degrees (Figure
defined by a circle appro:<imately 1.8 times 12-22). An exterior angle is measured from
a) b)
~ .. . • .. •
·. . ..·,.
•' ..•.
. . ..
- .. . •. -
•
•• •• ·J
. ...., ~
c)
~ ..
f'i&ut'f 12-22: A iet of t'(>illl9 (a) Ul-3)'
be cucloscd by coon«.bng the ..outct·
most'' points i.u the «-t. 'flus "'hu.11"
.• ·' ddinct 11; poln~on. Hulls m:.y be cb:u-·
a-ctcrized as coi,c:iwc (b). when some
- ..
cxtcrioraug1« arc leu than ISO
degrees, Ol'COO\'<'.:< (c}, when :t.U c.'<tt'•
rior angles 1trc gi-ca1cr tbn.u Oii' cqulll lo
JSO deg:r«s.
•I "•
•
538 GIS Fundamentals
one edge or side 'lo another through the Convex bulls are widely us.eel because
region "outside" of a polygon. Squares. tri- they are simple 10 develop and interpret.
angles. and regular pentagons are all exam- and there is little or no subjectivity in their
ples of convex hulls. while stars and application. The shape of the convex poly-
crosses ate examples of aoncouvex bulls. gon is de!ellllined solely by the arrange-
While these geometric figures have regular ment of the sample points. and not by
shapes. most convex hulls derived from c,ontrolling parameters 1hat must be speci.
sampled points \Vill not. fied by the human applying the· method.
Convex hulls are often considered a
They represent the irregular shapes com-
natural bounding atea for a set of points. mon to most sampling.
This assertion is accepted by most analysts A convex bull may be easiUy created
when there are no outlying da1a points. far with a "sweep.. algorithm appli;ed to a set
removed from the rest. \Vhen outliers are of sample points (Figure 12-230). These
present the convex hull will often be are the locatioos oft11e evems of interest.
unreasonably large. for example. observations of a rnre animal
or crime locations. An extreme point is
identified from !be set, usually the sample
wi1h !be largest or smalles1 X or Y coordi-
a} b}
• ••• ,,, .
. . .. . . ..
.. .• ., .,... •.
..., ...
..,
• : .. , :
'
•
....·. .. .. ..
• • ···L ,
'
c} d}
.... ' •
~-.
. . .. . ...
•
•.
..., .....
. " v
.-., •.,.
••
• : ... I •
•
•
... .
':. .• ....
~
•
Fia:m•,r U-2.J: 11:ie conve;<. bull for a sec of poiuts (0) may be calculated from a 5\\'C<'P 3lgoritbm. Start·
ina: \\ith a.u c:'<tttmc point such as S (b). successive minimum dcO«tioo aua.J« ar¢ selected (C) until
die: srnrtin_g poinr it rc::.chcd (d).
Chapter 12: Spatial Estimation 539
. ' ..... .. . .
• • ..
•• • ••
• ": ..
•
... .
. .
. .-: . ..'..
~
•
-
. 'i· -..._.,· "-'·
,. ... . "I •Y
~
. :..
fia:n~ 12~24; An Q.amplc ofa core: 11ru dcfiucd by n conn~ bull (o). aud cbanM:tcristic bulJ polygons
(CHP. iu b). Th< Wdcd area is oJf<~ as a core a.rca. wid1 a smaller. higher sample dcruity. aud argu-
ably mote t1e<:t1tn1c ta'C'n ide1uiticd by the CHP.
540 GIS Fundamentals
large.st polygons:. however. the method is An example will help illustrate these
easy to apply and arguably provides a bet· ideas and the process of kernel mapping.
ter estimate of core areas when compared Consider samples to detect the density of
to a convex hull. particularly when outliers defects in a tile floor. Each tile is 0.5 in
are frequent. across. We tOlltlt the number of defeclS per
tile. l>cginning at one edge of the tile
mosaic. We will s.how the samples coJ.
Kernel Map pl ng
lected along a line. but the process and
Kemel mopping uses a set of sample principles are similar in two dimensions.
locations to estimate a continuous density Figure I2-25 shows the resul1s of a
surface. Kernel mapping is widely applied sampling along a line segmen1. One defe<:1.
be<:ause it is mathematically flexible, rela- or faull. is found on a 1ile located 2 in from
tively easy to implement. may be robust to 1he Sia.rt. and it is represemed by a rectan·
oulliers.. readily incorporates clustered gte 1wo units tall. Each fault represents a
samples_ can represent irregular shaped density of nvo units. because each tile is
core areas. and is often statistically based. 0.5 in across - hence l/0.5 • 2 faults/in.
Kernel mapping is based on a density We observe two fai~ts a1 2.5 in (four faults/
distribution that is assumed for each sam- in). one at 3.5 in. and addi1iona I observa-
ple point. These density distributions are tions until our las1 fault observed at 12.5 in.
placed over the s,imple plane, one for each Note that the densi1y is in 1he form of rect·
observation poin,. and vertically added to angles 1hat are "stacked" two u11its high for
detem1ioe the compos.ite density from the e.ach fault observed fo r a tile.
sample. This composite deusity may be Note two things about the density esti-
used 10 identify a core area. selecting the mates.. First. we assume a characteristic
densest areas fi~1. shape for the density derived ft6il! Meli
obser\laiion. In Figure 12-25. we assume
the shape of a rectangle for each observa-
tion, with a uniform density across the tile.
-;., 12 Eoch bot" represents Otl ovtl"09e denSlty. the l'lelgt,t of the bor t 2 for each
~ fouU observed w, o tlle. where eoch hie Is 0.5 inches 'Aldi? Th:s gf'/es a loco1
£ 10 eslimote of fhe density of faults per .nch of tile For example. 1he hie 01
.~ )( d,stonce • 5 5 hos fov foults. y1eld1ng on averoge <ltt1S1ty there of .4 I 0.5. Ot'
";::: 8 foull'S pe.r 11eh \
is _
§
S6
,:;- - -
·.;;
.," 4 - .... -
"'
,:
a!" 2 -
I 2
-I
'' 2 t
-~
2 3
-
I
0 .12 , - - - - - - - - - - - - - - - - - ,
o) two sample points. at x • 22 and
.a 010 x • 85, plotted individual
density distributions.
tc:: 008
-
withh·22
:,
0 .06
004
0 .02
0 ·00 +-
o_ .e:,.._2~0-...:,,_4To---6~0-.e:...._s~--'~100
sample location
012
co b) many sample points.
0.10 plotted individual
:;::: and composite density
g 008 distributions. with h=2.2
.2
>-, 006
~
(/)
c o.o•
(>)
-0
002
000
40 60
sample location
Fl11.11•t 12~27: lndi\'idual dc:utity di,tributions may be plotted for c:,ch sample obscn·ation. as. sl1owu for
IWO poir11i ia ll onc-dimcnl"ionnl samplini: (0, llbo\'c). OU.lnl>ution# for the nnirc &:1tl1plc set «re ploucd,
and ackkd to crc.;itc .i combined mimntc of lbc ckns.ity di.s.tributiou (b). We us.,i..,11y d100,c .i bandwidtJ,
parameter. b.1ba1 c:ontro~ the s.bap: of the iudh·idu.,1aod bcncccompos-i1c dc:osity distributions. Narrower
bandwid1.b1 ttsult i.n highc:1' aod n:irrowcr pcnb {G).
Chapter 12: Spatial Estimation 543
When all observed points are plotted, there used to estimate both density distributions,
is a collllllensurately large number of small. each observation is spread •cross a brooder
overlapping bumps. as shown by the thin in1erval when we choose a larger bandwid!lt.
lines in Figure 12-27b. These may then be We observe the same change in the
summed venically 10 create the cumulative
peakedness when we change the bandwidth
density distribution. shown by die thick line for continuous density distributions. such as
in Figure I 2-27b. I.be Gaussian distribution sbowa in Figure
We often choose oondwidtlt parameters. 12-27 and equation 12. 16. A sample is plot-
symbolized i>y h. that define the "spread" or ted using a Gaussian density function for
width oftlte individual density distributions each observation and a bandwidth of h • 2.2
(Figure 12-27c and Figure 12-28). Pcrt,aps in Figure 12-27b. Reducing ·u,e bandwidth to
the simplest way to unders1and the band- h = I narrows the shape for each individual
widlh is to thlll.k of the bi1ming inter\lal in sample and resulls in higher~narrower. more
our example in Figure 12-25. There. we peaked shapes in the cumulative distribmion
counted tile defects for each 0.5 in tile. and shown in Figure 12-27c.
plotted a rectangle corresponding to the Kernel mapping is generally a three-step
resultant faul1 density. Our bandwidth was process. as may be surmised from the pre-
se.t at 0.5 in. \Ve just as weU could use a ceding discussion. First, we collect samples
bandwidth of l in.. counting the number of and the co11comi1an1 coordinate locations.
defects per two tiles ( I in). along our sam- Second. we choose a kernel density function.
pling line. This would give a related. but f inally. we choose a bandwidtl~ h. apply tl1e
slightly different estimate or the density dis- kernel density distribution. and sum across
tribution of defects along our sampling line. eac1t sample area to achieve our composite
As sliown int.he right panel of Figure 12-28. estimate of density.
the estimated density distribution for the first
7 in or our sampled line is less "peaked" or
"spikey." Although tile same sample set is
12
bandwidth h • 0.5 bandwidth h • J
.s5-;:: 10 10
.2: 8 - 8
E
:,
.5 6 6
l:'
·;;
-
~ ( - - 4
-
'5 2
~ - 2
I 2 3 4 2
111 2 7 2
0 25 5.0 0 25 5.0
x • Distance from starting point (Inches)
Fltn,.. t2..28: We choose bclh 1bc ~c:imnJ f<nnl of l11c denslcy disuibutiou, rutd a bao.dwidtb
p:tr;unclcr ihat :iiffeclS the $hn1>c of tbc di.itribu1-i~n. Here. die f'C'CtanguJnr1,l~pc ofthe fauh dcuri1y
b«oruC5 broad.er and shorter as the bru,d";<bb cbang« from 0. 5 lo 1,
544 GIS Fundamentals
,......
'"
\·"'...~
OOtO
1::\
.i '...."'\
......
,..,
'""
b)
Chapter 12: Spatial Estimation 545
reel optimum. The motivations behjnd vari- bandwidth (b) relative to the largest band-
ous OJ)limum bandwidtlis are described in widtlt (c). These different bandwidths result
the books by Silverman (1986) and by Foth- in different core area polygons (d through O.
eringham et al. (2000). listed at the end of Empirical tests and experie11ce guide the
this chapter. choice of best bandwidth.
Core area delineation is a primary use
for esrima1ed density distributions. As
expected. the identified core areas arc
dependent on the selected bandwidth. Figure
12-30 shows vertical views of two-dimen-
sional densiry distribmious for optimum (a).
below-optimum (b). and above-optimum (c)
bandwidth. O.rker shades of gray show
higher densities. and note cbe narrower.
more concentrated distributions at the lowest
• • • • •
•
•
• •
• • •
• •
•• •
·~.. ..• .•..•
• • •
a) b)
t· .o
c)
• • • • •
• •
•
• •
• •
......, . . . . • ......' • •
• . •.
··~. . •
• • • • • •
•
• •
• • ..
• • •
• •
,-:,· ...
,..t ...
:-v; .(\
.,
,....-•
•
,.~-..
••
d) e) f)
Flautt 12.JO,: Kc1nd mappiu; may be used 10 idclllify core areas. ahhougb die areas th3it wilJ be dc(11icd
dqxttd on the mcl1.od t$cd. P2nds O 11tmt~b C sl1ow Gtluu iitu dcflfl1y dislnotltio1a for:. snmp le sci under
\'nrying baudwid1hs.. while d thrwgh f sbowcorresponding core (lfC..15 encompan i.ng 90t<+ of the de11$"ily
dlslributious.
546 GIS Fundamentals
Time-Geograp hic Density Estl· 9,1 • ( PjlOf P, P,) + O((P, P1) S M L) I} (12,21)
matlon
D~sity estimators have been devel-
oped for space util.ization by moving objects. where D(P. P1) is a distance between any
typically animals for home range analysis, poim P and the control point P1• and ML is
al!bougll sometio,es olher objects. An object dte maximum distance the object could pos-
may be observed periodically through SJ)ace, sibly travel between the successjve conLrOl
for example, whe.n a GNSS is attached to a points P1and P;, ML may be estimated by:
migrating penguin, nod the position relayed
10 a bases1a1ion. These positions are often
called control poilHs. because they establish
the location of the tracked object a, a fixed
112 22)
point in time. This sequence of control
points defines a path (Figure 12-31 ).
Much as when using kernel deosi1y t 2. Our task is to caJcuJate Lhe area of tbe
fuuctfons for es1imatiug a core area. a time· elliptic volume that represeot.s the occu-
geographic estimate of space use is a com- pancy probability. given our observat.ions.
posite of many obseniations. Here, e.ach pair Two paths are shown betwoon the points,
of observations ruay be considered a density one traveling diSlauce dm and auother d.,.
volurue. proportional to the probabiUty 1ha1 These two paths have the same length. by
lhe object occupied a location during 1he the definition of the boundiag ellipsoid. and
time illlef\'lll (Figure 12-33). A uniform den· Ibey are also each equal 10 the long axis
sity function .is the siruples110 understand. length of the ellipsoid. 2o. Geometric rela·
implying the objec.1 was moving at maxi- tionships between the interpoint distances
mum velocity between the 1,vo controlling and the dimensions of an ellipse allow us 10
observations. but aloog an unkn0\\11 path calculate a and b. rwo characteris1ic dim en·
within the ellipse. A unifonn density func- sions. which in tum allow us 10 calculate the
tion should have a vohune equal to the like Ii· area. nob. This may then be scaled by height
hood of occus>ancy. as with a standard kernel 10 assign an occupation probability (Figure
density estim:ator. For simple shapes such as 12-33. lower halO.
a three-dimensional u11ifom1 probability dis. The process is repeated for overlapping
1ribution. tbe volume is equal to the area point pairs across the set of observed co01rol
times height. poinrs. TI1e next two points in the se.quence.
Figure 12-33 shows two points. P1 Pz and P3. are paired. and 1be density ellipse
observed a1 ti.me t 1• and P2 obsef\1ed at time calculated. S\1m1ning the deasities where
P1 tit (x 1• y 1). and P2 al (x2• Y2) Fl2nre 12..33: The proccu of cakulat·
d,. ·a,·
ML · ( It· 11) ·v
ini :a gcodlip" dens ity bc1wttl'.I two
5eq\Kntial poioti.1 rq)ire&cuting tbc lil.c-
O• MU Z lilKK>d of occupation. Her¢. a uuifonn
probability is assumed 11crou the
cl • (.Xi - X1)2 • (v., • Y1) 2 ob$m1t1:ion, .
bl • oz • ,z
Area • 1tob
l
'" ···~ .••..• ,.,,,,dn"• ••·····
.......
1 • Two points. P1 and Pz, are rMasured
at time 1J and 12, for an ,object w;th
a, maximum \IOlocity v The point
locations, time inle,vaf. and v define
an ellipse.
• The ellipse area can be calculated,
with lhe ellipse heighl SC8ted 10
a density volume proportion.al 10
1he likelihood of OCCUPc'OOn
\ ..
....~-·'..
./··
Time Interval / ............ P,i'
1
2
... - ·,..}
• 5 j
!
• 10
Flam~ 12-.H: Au cltruuplc oftbc o,•tt111ppi.i13 sc-1 or$,COcllipscs used to c-rc3te a c:<nnpotitc dcoiity function
from a tirnc~gcogl'.lpb.ic d,,1a Set. The &cqucncc bcgi.us \\ith P1 i,1thc lower left, through P9 in the upper left.
Note that tJ.e eontrol point ~,ubol size: ckno1es the time i.utcr,al sioi.c the last ~otrol point ob,c:T\·ati011
(adapted &om Do"'llS ct al.. 2011).
geoelLipses overlap. The process is repeated much. relative to how far it miglu have
for points Pi, Pi.J until lhe last poiut is moved in the time interval between obsen.ia·
reached. Figure 12-34 illust:ra1cs the overlap- 1ions.
ping se.1of geoell;pses from a sample set. The composite time.geographic density
EUipses may vary in shape. depending func1ion is shown in equation ( 12.23). where
on time interval. distance between fe.atures. f(x) is 1he densiiy a1 any poi111 across a sur-
and the 111.1xim11111 velocity (Figure 12-34). face; n is t11e number of obsen.1ed control
Longer 1ime in1ervats be1w~n observations poillls: Is and '• are stan and end times.
result in larger ellipses. irrespec1ive ofihe respcc1jvely: 11 and tj are consecu1ive point
distance between subsequent points. As the pairs: vis the m.aximwn velocity and D(P. Pi)
inierpoilll distance approaches the maximwn is the dis1ance func1ion. as described in
se1 by the maximmn velociiy. v. 1be ellipses equa1ioo 12.20. This equa1ion is used for a
become longer and narrower. and reduce Lo a set of point.s to estimate the density across
line wheu the poin1s are sp.1ced al 1be ma.xi· space. The numerator sums the weighted
mum possible distauce. Conversely.11te distance ellipse functions for each pair of
ellipses approach circles when the time sampling poiuts. and the denominator sc~1les
interval belween points is long but the this by the Jllaximum dis1ance that may have
observed dis1ance be1ween poillls is smaU. b~n iraveted during 1ba1 lime imerval.
This occurs when the object has not moved
f( ) _ I
x - (n-l)[(te - t 5}·vJ2 ~H
' I [ D(P.P,} , D(P.PJ)
(ti - tjl.v
J (1223)
Chapter 12: Spatial Estimation 549
o) b)
Hours
s
10
Ot -- -< 10
""'
c) d)
Fi&OJ't U..JS: An e.umplc ti.mc-~ogrophic den.sit)' function for a sc,quc:111ial kt of contl'Ol points. TilC
path (a) s.bO\VS both the: 1r.1.jccl01)' and Ultm·al bctwcc:11 obsa<rati00$. S11<:«ssi\·c fii,'lfcS (b tbtou;b d)
l11owdiffc:ttn11tU!timum \'cloci1y, •ialuc~ wi1b velocities of 2 (b), 2.S (C), ruw! 3 (d) f.:ml110Ut(courtel)•J.
Downs) .
550 GIS Fundamentals
where the control. poinls are cluslered and spatial variability oflhe target fe.ature and
where the distanoe between observations is the intended use of the interpola.,ed surface.
short relative 10 ti.me period benveen obser- Sample values are combined with sam-
vations, which in tum is dependent on maxi- ple locations 10 estimate or pred ic,t values at
mum velocities. This implies the net object
movement was small between control
unsampled locations. TI1ere are many spatial
prediction methods, but the mos.I common
points, although there is a denser area of are Thies~en (nearest neighbor) polygon,
likelihood there. local averaging (fixed radius). [o\'erse dis-
\Vhile a maximum velocity may be tance weighted. trend surface, and kriging
established from observations or theoretical interpolation. Each of these me1l1ods has
values. the shape of the distance f1utc1ion advantages and disadvantages relative to
often is 001. and must be carefully devel- each other, and there is no methc:xt that is
oped. A unifortn !function may be more uniformly best. Each method should be
defensible if the object is moving at near the tested for the variables of interest. under
ma.><imwn speed for most of tbe duration. conditions in the study area of interest The
However. a linear dec.ay function may make best tests involve comparisons of interpola·
more sense when the sampling interval var- tor es1ima1es against wi1llheld sample poims.
ies in frequency. and the object is often mov- Measures of c.are area are commonly
ing much more slowly than the ma.'<imum identified from spatially dis1ribu1ed observa-
velocity. TGDE is a developing field. and tions. This fonn of prediction identifies
the interested reader should refer 10 the regions of high probability for a.n object or
papers by Downs and colle~gues listed at the event Mean center or mean circle are simple
end of this chapter. measures. A convex bull. defined as the min-
imum area polygon encompassing all points
Summary and with tonvex exterior angle!.. is toil\,
mouly applied. More sophisticated measures
lmerpola1ion and spatial prediction include kernel mapping. based on centering
allow us to esrim.ate values at loca1ions scaled distribution functions aver e~1c.h
where they have not been measured. These observation. and vertically summing the dis-
methods are commonly used because our tribution functions.
budgets are limited. samples may be lost or
found wanting, or because time has passed
since data collec1io1t We may also interp<>·
late when coovening between data models.
for example, whe:n calculating a raster grid
from a se1 of contour Lines. or when resam-
pling a raster grid 10 a finer resolution.
Spatial prediction involves collecting
samples at known locations and using rules
and equations to assign values a1 unsampled
locations. There are many ways to distribute
a san1ple. including a random selection of
sample locations. a systematic panem. clus-
tering samples, adaptive sampling_. or a com-
binatioo of these. TI,e sampling regime
should consider tile cost of travel and col-
lecting samples. as well as the nature of the
Chapter 12: Spatial Estimation 551
Suggested Reading
Andersoni, 0.J. {19$2). The bome range: a new nonparametric e.stimation technique.
Ecol<>&)•. 6: I 03- ll 2.
Angulo-Martinez, M., Lopez-Vicente. M., Vicente.Serrano, S.M., Begueria. S.
(200!>). Mapping rainfall erosivi,y at a regional scale: a compariso11 of intell)Ola-
tion methods in the Ebro Basin (NE Spain). Hydrology a,;d Earrl, Sysmms Sci-
e11ce. 13: 1910-1920.
Anselin, L. ( 1995). Local indicators of spatial association - LISA. Gecgmpltical
Aunlysis. 27:93-l l 5.
Anselin. L. (2002). Under the hood: issues in 1be specification and interpre,ation of
spatial regression models. Agricultural £conowics, 17:247- 267.
Anselin. L.. Rey. SJ. (20 14)..41ode111 Spatial Econometrics in Pmc1ice, a G1dde to
Geod'a, Geodaspace a11d Pysal. Chicago: GeoOa Press.
Anselin. L.. Syabri. I., Kho, Y. (2006). GeoOa: An introduction to spatial data analy-
sis. GeograpMcal Analysis. 38:~22.
Ayeni. 0 . 0. ( 1982). Optimum sampling for digital terrain models. Pltorogrammetric
Engineer;ug and Remote Se11si11g. 48: I687- L694.
Bmgess. T.M.. Webster. R. (1984). OJ)limal smnpling strategies for mapping soil
,ypes. l. Distribution of boundary spacing. Jo11r11al ofSoil Science. 35:641-654.
Cressie. N. (199 1). Statistics for Spmfal Data. New York: Wiley.
552 GIS Fundamentals
Dale. MR.T.. ifor1in. M.J. (2014). Spatial ,lualysis, a G11ide for Ecologists. Cam·
bridge: Cambridge University Press.
DeGruijter. J.J.. Tcr Braak. C.J.F. ( 1990). Model-free estin,ation from spatial samples:
a reappraisal of classic.al sampling theol)'. Matliemarical Gealog)', 22:407-415.
Downs. J.A .. Homer. M.W. (2009). A characteristic-hull based method for h-ome
range estimation. Transactions i11 GIS, 13:527-537.
Downs. J.A, Homer, M.W., Tucker, A.O. (2011 ). Time·geographic density estimation
for home-range analysis. A1111als of GJS, 17: 163-L71 .
Du brute. 0. ( 1994). Comparing splines and kriging. Co111p111ers a11d Geoscie11ces.
I0:327-338.
Fotheringham. A,, Bnmsdon. C.. Charlton. M. (2000). Q11a111i1a1il'e Geogmph)•: Per·
specrives 011 Sp(l{ia/ Dara Analysis. London: Sage Publications.
Gelfand. A.E .. Diggle. PJ .. Fueates. M.. Guttorp. P. (201 0). Handbook ofSpatial Sta-
tistics. Boca Raton: CRC Press.
Getis. A., Ord. J.K. (1992). The analysis of spatial association by use of dist,u1ce sta-
tistics. Geographical ,l11alysis. 24: 189-206.
Goovaerts. P. (1997). Geostatis1ics for Nn111ra/ Resource E,·aluntion. New York:
Oxford U11iversity Press.
Griffith. D.A.. Layne. A. ( 1999). Casebookfor Spatial Statistical Data ,l11alysis.
Oxford: Oxford University Press.
Hutchinson. M.F. (1995). Interpolating mean rainfall with thin plate smoothing
splines. lmemnrioual Journal o[Geograpl,;cnl Jnfonumiou Systems, 9:385-404.
lsaaks, .E.H.. S:rivastava, R.M. (1989). ,l11 /111rcd11c1ic1110 Applied Geosratlsrlcs. New
York: Oxford University Press.
Lam. N.S. ( t 983). Spatial interpolation methods: a review. ,lmerietm Canograpl,er,
I0:129-l<fl9.
Lege11dre, P. (1993). Spatial autocorrelation: Trouble or new paradigm? Ecology.
74: 1659- 1673.
Mark. D.M. (1 987). Recursive algorithm for determinatio11 of proximal (Thiessen)
polygons in any metric space. Geogrnpl,;cal A11nlysis. 19:264-272.
McKillup. S.. Dyar. M.O. (20l0) Geos1n1l.s1lcs £xplni11ed, nn /1111'()(/uct01}' Guidefor
Eanh Scientists. Cambridge: Cambridge University Press.
o· SuUivan. D.. Unwin. O.J. (20 IO). Goog,.aphic h,fonnalion Analysis (2nd Ed.). New
York: John Wiley and Sons.
Silvem1an. B.W. (1986}. Density Esti111ntion. London: Chapman and Hall.
Varekamp. C.. Skidmore. A.K.. Burrough. PA. (1996). Using public domain geosta-
tis.tical and GIS software for spatiafinterpolation. Phorogrnmmetrlc E11ginee1iug
and Remote Sens ing, 62:845- 854.
Wi llmott, C.J. (1981}. On the validation of models. Physical Geography, 2: 184- 191.
\\Cillmon. C.J .. Matsuura. K. (2005). Advantages of the mean absolute error (MAE)
over the root mean square error (RMSE) in assessing average model performance.
Cllmate Resen~h. 30:79- 82.
Wonon. B.J. (1987). A review of models of home range for animal movement. Eco·
logical Modelling. 38:277- 298.
554 GIS Fundamentals
Study Questfons
12.2 - Describe four different sampling patterns. and provide the relative advamages
or disadvantages of each. Which do you think is used most in practice. and why?
12.3 - Draw a :systematic sampling pattern on the area below. left, and an adaptive
sampling paneru on the area below, right. Use the same number of sample p<>ints.
e.g.. approxim:ately 50. on both. Which do you think will give a bener estimate ofter-
rain localions a1 unknown points? \Vhy? \Vould increasing the sample number
change wltlch :sampling design you would think is best?
12.4 - Draw a duster sampling pattern on the area below. left. and an adaptive sam-
pling pattern on 1he are.a befow, right. Use the same numbe.r of sample poin1s. e.g.,
approximately 50. on both. Which do you think wiU give a better estimate of ten-ain
locations at uu:knowu points? Why? \Vould increasing the sample n\lmber change
which sampling design you would think is best?
Chapter 12: Spatial Estimation 555
12.5 - Draw tl1e TI1iesse11 polygons (nearest neighbor interpolation) for tl1e set of
poims below.
12.6 - Draw the TI1iesse11 polygons (nearest neighbor interpolation) for the set of
points below.
• •
• •
•
•
•
• •
•
•
•
556 GIS Fundamentals
12. 7 - Calculate the cell values indicated at the crosses below. using fixed radius sam-
pling size with the shown circle.
6
•
18
•
18
•
• 29
c+ •+
10 22
8 • ,5
• •
5• . 31
•
• 13
•
. 11
+d 3 +t
•
56 14
• • 12
•
•0
9
•
12.8. Calculate the cell values indicated at the crosses below. using fi.~ed radius sa,n.
piing size \\ath tbe shown circle.
c 6
b+ + •
18
•
18
, 29 •
o,
10
8 • ,5 22
• •
d+
5•
4 . 11 3
• 13
• 31 •
•
•+ •
56
f+
14 12
• •
0
•
• 9
Chapter 12: Spatial Estimation 557
12.9 - Calculate the Z values for the unknown points listed below, using au inverse
distance weighted approach. Use the three nearest known points (use i - 3. o ~ I).
Known points are sbown in map as filled circles aud corresponding coordinate and Z
values in the table at right.
x y z
...
10
0 153,95! 9 .t,478,7J,t 6 20406
11.10 - Calculate the Z value, for ihe unknown points lisied below, using an inverse
distance weighted approach. Use the three nearest known points (use i • 3. n • I).
Known points are shown in map as filled circles and corresponding coordinate and Z
values in the table at right.
JO x y z
0 153,9519 4,478,714 6 2040.6
·"• .t .o
.• l 151.280.9 4,479,647.3 1863.0
2 148.228 5 4.472.1~34 1992.I
·" .~
3 1•8.906 8
152,3832
4,46•,978.6
4,466.9288
2540 I
21063
'5 158.8273 4,475.746.9 2283.2
.a
.' c.10 .9
6 !60.607.9 4,475.874.2 1933.5
·' d ,12
,13
7
8
!63.02• '
164.<165 8
4.477.3579
4,-481,1735
1836•
1838.3
9 166,416.0 4,468,285 4 2523.9
Unktlown points'. !O 164.!69.I 4,468.370 4 2!388
ID X y z !I !60.6927 4,468.709 3 1854 2
0 155,8596 4,477.159.0 12 !56,537.9 4,464,724 2 18669
b !47,580.6 4.478.884 2 13 167,306 3 ,,, 62.8!6.5 24538
< 162.5357 4,469,960.2 !4 !<3,946.6 4.482.445 4 !8379
d 151,7149 4,463.755.2 15 144.7097 4 475.3230 19128
558 GIS Fundamentals
12.12 - \Vhat i:s 1.he primary difference between a treod surfac.e interpolation and a
kriged interpolation?
12.13 - Describe die variogram. What does it represe111 on the X and Y axes. and what
are the important regions/points of the plot?
12.14 - Draw the approximate mean cemer. standard deviation circle. and m:aximum
circle for the following data:
•
•
•• • ••• • • •
•
• • • ••' ..
• :• • •
• •• •• •
• • • • • ••
•
• •
•
12.16 - Draw the convex hull for the points depicted below:
..
..
•
Chapter 12: Spatial Estimation 559
12.17 - D:raw the convex hull for tJ1e points depicted below.
•
•
•
•
•
•
•
•
• ••
• •
•
•
•
• • •
•
•
•
12. 18 - O.escribe/define a kernel deusiiy map. Include bow tl1e values are based on
the samples.
12.19 - \\1ri1e down and de.scribe at least one equation used to generate a density sur-
face.
560 GIS Fundamentals
12.20 - Which image below illustrates a wider bandwidtl1? If you used the same data
for both plots. give two reasons for your answer. assuming the color scale ha.s ct,e
same upper and lower bounds for both surfaces.
A) Bl
..
12.21 - Wltich image below illustrates a larger bandwidtlt? If you used the sacne data
to generate both plots. give two reasons for your answer. assuming the color scale has
the same upper and lower bounds for both surfaces.
A} B)
••
•
•
-
••
. ..
'
561
Introduction
A model is a description of reality. ing, while simple spatial models often
Some model;; are analog. as in how a phys- apply sets of equations to predict a specific
ical globe isa model of the Eanh. Here. our continuous variable across space. Carto-
inleres1 is restric1ed to digital models, com- graphic model outputs are often nominal
puter-based represeuunions of spatial phe- (suitable or unsuitable) or ordinal (low.
nomena. These models desc.ribe the basic. medi1101. or high s11itability). Figure 13-1 .
properties or processes for a se1 of spatial for example, shows the ordinal suitability
feat11res. and. help us understand their fonn of locations for septic systems as a function
and beJ1avior. of soils. elevation. we1Jands, and water-
Many compUter-based modeis use spa· course proxunity. In contrast. the outputs
tial data. and are developed and nm using from simple spatial models are often imer-
some combination of GIS. general and spe- val/ratio measures sud1 as population den·
cialized computer programing languages. sity, accident frequency. or soil erosion
and spinial and non-spatial analytical tools. rates.
Spatially explicit models are a primary Corrographic models combine spatial
benefit of GIS technologies. and many spa- layers via overlay. b111Ters. reclassification,
tial models a.re based on data in a GIS. and other spatial operations. These models
These models may be run in the GIS. or the often employ the concepts of map algebra.
spatial data may be prepared in a GIS and describe<! in Chapter 10. b11t:may include a
then exported 10 a model that is developed broader range of operations. Sflitability
and nm outside a GIS. tmalyse.r. defined here as lhe classification
While there may be as many classes of of land according to its utility for specific
models as there are modelers. here we split uses. are among the most common carte,.
spatial models into three broad and over- graphic models but there are many others.
lapping categories: cartographic models. Most cartographic mode Is are tempo-
simple spatial models. and spario-remporr,I rally static because they re1>resent spatial
models. Joseph Betry. an early and well- features at a fixed point in time. Data in
known developer and proponem of spatial oose layers are mappoo for given periods.
modeling, deicribed canogrophic models These data are the b11$ii for spatial opera-
as automating manual map anaJysis and tions that may create new da1a layers. For
processing. while spatial models focus on example. we may be interest<?<! in identify·
applying mathematical relationships. Car- ing the land that is most suitable for siting
tographic mode.ls are most often applied to septic systems. Costs of installing the sys-
identify areas in suppon of decision mak- tem may depend on the slope (steeper is
562 GIS Fundamentals
costlier), soil type (some soils are harder to relationship for specific calculatfons of aver-
move or Jess useful for filtration). current age costs: for example. we can characterize
land cover (covered by concrete is unsuit- the relationships between soil types and cost
able. forests are expensive to clear). or dis- to excavate din, and we may use a cost-per-
tance 10 roads (heavy macllinery is mile for trauspon based on local rates. These
expensive 10 move). Spatial data on eleva- spatial data are combined in a canographic
tion. soil propenies. currenl land use. and model to assign a land value to each location
road proximity may be combined to catego- in a study region. The model is 1emporally
rize sites by their suitability for installing static in 1l1a1 the values for the spatial vari-
seplic systems. \Ve may use a mathema1ic.al
s
•
- /
':5~ ·
'
f'iao1·t 13+1: An ex3.mplc of a w il~bility ni.ap. produced by combining soill. clc\11tion. wctl;ind. ~nd
w11tCfCOW'Sc data. lucse anal)'$¢$ arc often automated vi.a a canogrnpbie mode:! to produce suitability
or otbct uoi.niu.11 ot ordiull.l milkii1gs O\'Cl' l.tflC :ll'CM (roun:csy state of DclawlU'C').
Chapter 13: Spatial Models and Modeling 563
ables. such as soiJ type or distance to roads. virus is a sometimes fatal disease that varies
do not change during the analyses. in prevalence through space and time. in part
Cartographic models may be used to because mosqt1itoes transmit 1he vims
analyze change e,·en if they are genera Uy not among birds and people.
temporally dynamic. For example, we may Cooke and his associates developed rel·
wish to analyze vegetation change over a 10· evant input variables. They compiled data on
year period, based on vegetation maps pro- the frequency of bird and hmnan infections
duced at the stan and end of the period. Each within each zip code in Mississippi over sev-
data layer represents the vegetation boundar- eral years. Human and bird ,cases were clus-
ies at a fixed point in time. Tite model is 1ered. with outbreaks concentrated in ru.ml
static in the sense tltat the polygon boundar- areas. and so road density ,vas used as a Slit·
ies for a giveJl layer do not change because rogate for ruraVurban laoduse. They also
the vegetation boundaries are mapped as compiled spatial ,~riables related to mos-
found at each poim in time. Our cartographic quito habitat quality. including stream den-
model is 'temporal' in how it compares veg- sity. vegetation type. remperanire. and
etation change through time, but lhe model precipitation surplus. These were combined
does not generate new boundaries at new with vims infection frequency at specific
locations. lt does crea1e a composite of exiSl- locations 10 fit a predictive statistical model.
ing li.nes thal exist in the input d.11a layers Mapped spatial variables were then applied
and notes. as attributes. the kind of cbMge in the model to predict outbreak risk across
that occurred. Most GIS-based spotial mod- the state.
els are cartographic models tbat are tempo.. Simple spatial models are co1nmo1l with
raJJy static. hundreds of examples found across a range
Simple sparia/ model, typic~lly apply a of disciplines. They typically 01clude a
set of equations to spatially rtsolved vari- model derived tlllougb sampling ruid a s1A1is-
ables (Figure 13-2). They often rest on equa- 1ical fining pro<:ess. a model that is subse-
1ions that are estimated from data at a set of quently applied across space to estima1e
observations :at poin1s or sub areas, and then important events. densities, or othtr chnrnc·
applied across broader geogniphic areas. teristics.
For example, William Cooke and col- Spalio-rempoml models are dynamic in
leagues reported on a model of West Nile both space and time. They differ from cano-
vims infecticr11among birds and the related graphic or predictive spatiaU models in that
risk of 1ransmission to humans. West Nile time passes explicitly within the running of
f{A,B.C.D.E) \
fil{u1·~ 13· 2: An c.umplc of a .-implc $patial model. wbctc a function is "pplicd to spatial inputs lo
cstuna1c an unpoltant spatial output.
564 GIS Fundamentals
the model, and changes in linte·driven pro· an oil spill may vary as the mod.el pro-
cesses within the model cause changes in gresses.
spatial variables. :Spatio-temporal models Spa1io-1e111pornl models are typically
often anempt to explicitly represenl pro- more Jimi1ed 1ban other modeling
cesses within the model.
approaches in the range and number of spa·
The dispersion of oil after a spill is an 1ial !hemes analyzed. bul 1bey pro,;de a
example of a process that may be analyzed more mechanistic represen1a1i01l of dynamic
via a spa1io-1emporal model. Currents. processes. S11bs1an1ial effon goes in10 devel·
winds. wa,•e ac.tion. and 1he physics of oil oping submodels of important processes.
separation and ev.aporation on exposure to Model components and structures focus on
air migb1 be combined in a model 10 predict one or a few key ou1pm spatial variables.
the changing location of an oil slick. The and input data themes are included ooly as
actions of objects as 1hey move across an Ibey are needed by 1bese subprocess models.
environmem may also be represeuied in a These 1empornlly dynamic models explicilly
spatio-temporal model. calculate the changes in the outpu1 spatial
Spatio-temporal models include time- variables through time. Feature boundaries.
driven processes within 1he framework of point feamre Joca1ions. and anribme vari·
the model. These processes are typically ables 1ba1 reflect 1he spacial and aspa1ial
quite detailed and include substantial com- characteristics of key output variables may
puter code to represent importam subpro· change wi1hin 1be model run. 1ypically mul-
cesses. Our oil evaporation example tiple 1imes. and with an explicit temporal
demonstrates 1he subprocesses represen1ed frequency.
in a dynamic spacial model. Oil evaporation Simple spacial models a.ud spacial s1alis-
rates depend on many factors. including oil 1ical analyses are often used as precursors to
visc~i1y. COil!j)Omettr oil ft.lcrioos. wii1d spario-reiil))(>ral n\odell. By Uttcovei'ing key
speed. 1emperature. wave height and ac1ion. processes or races. 1hey can guide further
and sunlight intensity. These processes may analysis. For example. in our oiU spill exam.
be modeled by su i1able functions applied 10 pie. the specific relationship bet, .reen wave
1
spatially defined pa1ches of oil. The sub- beigb1 or frequency and oil separa1ion may
mode I !llay estimate evaporation of various be represen1ed by an equa1iou. but 1be spe-
componenls of rhe oil in the palch. and cific parame1ers 1ba1 defme 1be shape oflhe
upda1e Ille characteristics of oil in 1ba1 pa1ch. rel31iousbip may be es1ima1ed via a s1a1isti·
Oil c-hemisrry and viscosi1y may change due cal proce.ss. Experiments or observations on
to more rapid evaporation of lighter compo-- separation rales at various wave heiglus may
nents. in nim affe<:1ing fun,re evapora1iou be collecied. and 1be specific model parnme-
calcula1ions. SpatiaJ feanires may cbirnge 1ers es1inia1ed. These may 1ben be included
through time due ·10 the represented dynamic as a component ofthe larger spatio·te.mporal
process: for example. the bouudary defining model.
Chapter 13: Spatial Models and Modeling 565
Cartographic Modeling
A cartographic model provides infor- in wc1lands. The model uses three input data
mation through a combination of spa rial data layers, containing lakes. roads. and bydric
sets. functions. and operations. These func- status for a common s1udy area. Spatial
tions and opc1ations often include reclassifi· operations arc appLied 10 the spatial da1a lay-
cation. overlay. interpolation. terrain ers. including reclassification. buffering. and
analyses. buffering, and oilier functions. overlay. These result in a sui1ability layer.
Mnhiple da1a. layers are combined via these This s11itabiLi1y layer can then be used to nar-
01>erations. and !he infomiation is lypically row sites for further evaluation. identify
in !he form of a spatial data layer. Map alge· owners, or otherwise aid in park site selec·
bra. described in Cbap1er 10. is often used to rion.
specify canographic models for raster data Cartographic models have been used for
se.ts. a variety of applications. These include
Suitability analyses are perllaps 1be most landuse planning. transponation ro111e and
common examples of cartographic models. corridorstucties, lhedesign and developmeru
These analyses rank laud according 10 its of water distribution systems. modeling the
utilily for various purposes. Suitability anal- spread of hwuru, disease or introduced plam
yses often involve the overlay, weighting. and animal spec.ie.s. building and business
and rating of mulliple data layers 10 catego- site selection. pollution response plalllling.
rize lands imo various classes. Relevant data and endangered species preservation. Cano-
layers are combined an<l the resultant poly- graphic models are so extensively used
gons are classified based on the combination because they provide infom,atioo useful to
of attribu1cs. :Figure 13·3 illustrates a sim- managers. the public. and policy makers.
plistic canog_raphic model for the identific.a- and help guide decisions requiring the con-
tion of potential park sites. Suitable sites are sideration of spatial locatio11 across multiple
those that are near lakes. near roads. and not themes.
.I ____ ~
Fl;u,. 13..J: An cn,.npk of l'I c-ttrtopphic model. llic inodd idcmific~ mi1J1blc p:i.:tk tit·« b11scd on the
proxU11i1y to :ro:MU ;md Likes. 11,1~ 1ltc :ibsencc of wcll~ods.
566 GIS Fundamentals
meant by "too sleep.·· General or qualitative men ts must be made for each criterion. We
cri1eria may be provided and these mus, be mus, quantify o,e range and any relative
converted to spedfic. quantitative measures. preferences for southern aspects. relative
The conversion from a qualilative to quanti· soil suitabilities. whal defines a main road,
tative specification is often an iterative pro· and wba1 consiin11es short a.ud long dis·
cess. wi1h repealed in1eractio11 be1w<en 1he tances.
analyst developing and applying the carto- A second key consideration involves the
graphic model and the manager or decision· availability and qualiry of data. Do 1he
maker who will act on 1lle resultant informa- required data layers exist for the study area?
tion. Are the spatial accuracies. spatial resolution.
We will use a home-site selection exer- and attributes appropria1e for 1he imended
cise to demoos1ra1e this process. The prob- analysis? How will map ge1Jeralizations
lem consists of ranking sites by suitability affecl the analysis: for example. will inclu-
for home con.s1ruc1ion. The area 10 be ana· sions of diffcreu1 soil iypes in a soil polygon
lyzed has steep terrain and is in a seasonally lead 10 inappropriate results·? Is the mini·
cold climate. There are four criteria: mum mapping unil appropriate? If not 1ben
a) Slopes should no1 be too steep. S1eep slopes u,e requisite darn mus, be obtained or devel·
may substintially increase costs or may oped. or 1he goals and cartographic model
preclude consLruc1iou. modified.
b) Southern asJ)('cls are preforr«t. to enhance
solar warming.. Weightings and Rankings
c) Soils suit:tb le for on-site septic sys1ans nr.e
rtquircd. There is a range of soil types in
While some canogmpbic models are
the sn1dy area. with a range of suirn.bilities simple and restricti\'C. many more a1110-
for i:tp1ie SJ'Sltm insta1Ja1io11. grapbic mooels require the combination of
criteria that vary across a range of vaJues,
d) Sites should be far enough from a main road
to offer some pr~'.lcy. but not so far as to
and require an explicit ranki11g of the relative
be isolated. importance of different classes or iypes of
criteria. Asimple, restric.tiveexampJe might
These criteria must be converted to require us 10 idemify parcels greater than a
more specific rules prior to the development certain size and wi1hin acenaio dis1ance of
and application in a canographic model. Tbe water. We may clearly identify areas that
decision.maker must specify what sort of mee1 Uiese desired condi1ioas.
classification is required. Is a simple binary
classification needed. with suitable and A much more common ,class of prob-
uosuitable classes. or is a broader range of lems requires us 10 integrate multiple criteria
classes needed? lf a range of classes is sped· that are qualitatively differe.m. For example.
fied. is an ordinal ranking acceptable. or is site suitability for haumkms waste storage.
an intervaVmrio scale preferred? These ques· depends ou a number of fac,ors. iocludin~
tious are 1ypica1ly answered via discussions distance 10 population centers. 1ranspona:
between 1he analyst and the decision-mak· tion, geology. and aquifer depth and type.
ers. Each criterion can tben be defined ouce We must rate si1es across a range of values
the l)pe and measurement scaJe of the for all of these ,•ariables. Ooce criteria are
results are specified. II may be fairly simple precisely defined. we must obtain appropri·
10 establish the local slope limit 1ha1 prohib· ate data. develop a flowchart or plan for our
its construction. For example, conversalions analysis. and address the more difficuli prob-
with local building expens may ide111ify 30 lem of assigning rankings \\fithin each crite-
degrees as a threshold beyond which con· ria, and assigning the relative weigh1ings
stmction is infeasible. Further work is among criteria. Note 1ba1 in 1he following
required 10 quantify how Jess-steep slopes discussions we use 1be word "rankings''
affect constmction costs. Similar refine· when describing the assignment of relative
568 GIS Fundamentals
values within the same layer, such as how value of l. Before we can assig11 a value to
we rank a sandy soil vs. a si11y soil in a soils any site. we must first oblau1 (or make)
layer. We use 1be word ·~veiglnings" wben floodplain nmps and intelJ)ret the codes in
assigning the relative values or different lay- the maps to deline.ate the most flood-prone
ers. for example. l1ow we weight the values areas. This process allows us 10 rank areas
in an elevation layer vs. 1he values in a based on the likeHbood of flooding.
landuse layer. Keep in mind that weightings We mus1 explici1ly fonnali.ze our rank·
for "•'Y differen1 cri1eria are made compara· ing for each layer used to represent a crite-
ble by considering how well they satisfy the rion. One early decision is whether ranks
goal of the analysis. should be discrete or continuous (Figure l 3·
S). Rankings are discrete when inpm data are
Rankings Within Criteria interpreted such that the criterion data layer
is a map of discrete values. Soils are either
Each criterion in our cartographic model good or bad for cons1ruc1io1t slopes either
is usually express.ed by a data layer. or "cri- 100 steep or acceptable. and the final map
terion layer." Each criterion layer is a spatial defines two or more discrete cla·sses: for
representation of :some constraint or selec- example. sites are categorized as either sui1·
1ion condition: for example. the criterion we able or unsuitable. Ranks are continuous
build outside a aoodplain may consist of a when they vary along a scale: for example.,
set of numbers in a layer identifying nood- soils may be rated from I 10 l00 for con-
plain loca1ions. Floodplain sites may be struction suitability.
assigned a value ofO. and upland sites a
.,.
r production < 66 Own
rank vol~ • O
ror,k vo!uc • 1
Discrete Ronk Loyer
2'4t.810
ro~k voli>C
¢()nfir'IU()U$ rank r'nOy be
oss.1nea bo,tQ on progucrton.
votues ore scol-ed linearly
be-:ween ,~ lowest end
1ii9~.ut to os:s1igl"I o roiik (r()(l'I o ,o JO
Fiau"" 13.5: luokiilp within n layer an: of1e11 discrete (IQJ> right) or c01uinuous (bou.om).
Chapter 13: Spatial Models and Modeling 569
Figure 13-5, lop riglll, shows die assign- nation of mathematical and logical fu.oc1ions
ment of discrete ranking ofland productivity may be represented. 10 refloc1 increasing.
based on values in a soil layer. The source decreasing. or complex relationships.
layer in the lop lefl of dte figure is analyzed. We should have some jus1ifica1ioo for
If the expected production for a given soil
polygon is less than 66. then the outp\11 rank-
allop1ing a s~ific curve when cs1ablisbing
relative ranks within a layer. Forexample,
ing is sel to 0. If lhe production is gre.ater we may wish co represent the mercury baz.
than or equal 10 66, lheu tbe ou1pu1 rank is ard base<! on me1nyl mercury concen11'8tions
set 10 I. A range of inp\11 values bas been in water supplies across a state for detemtin·
placed into two discrete classes. illustrmed ing locations to drill wells for drinking
as dte discrete rank layer in die top-right part water. Tilere may be a broad range of low
of Figure 13·5. mercury concentrations for which there are
Discrete :rankings a.re most often used no or few negative health impacts. However,
when there are clear. discrete classes to be as a 1hresbold concentration is reache<I. there
represented in criteria. A disease may be n1ay be a rapid upturn into a very steep
present or absent. forest stands coniferous or curve. where the risk of severe damage is
deciduous. or a block invemoried or not The great (Figure 13-6). The shapes oflhese
v.alues 10 be represented are discrete catego- curves should be established through sers of
nes. epidemiologicaJ studies. in which mercury
I.n contras1. we may apply criteria as concentration in human blood or tissue was
continuous rankings within a cartographic related 10 drinking water. an.d health impacts
model. These continuous rankings provide a were recorded for thousands of people at
range ofvalu,es 10 characterize a suitability various levels of mercury exposure.
or restric.tion.. and they result in a set of
incrtrt!tftlally va;ying ranks. Ranks (ot
scores) typically range o,•er a real or large
integer interval. for example. from Oto I or
0 10 1.000. Highest suitability is usually
assigned 10 the highest rank. and lowest 10
the bouom.
The bonom right of Figure 13-5 shows •'
•
a continuous ranking over a range ofO to 10. '
A high value of 10 is spec.ified for the most •'
productive soils. and a low value of Ofor the '
'
least producth'e. We may use production
' ''
data gathered, over a set of soil types, and a '
map of soil types 10 assign the relative value ' ''
of each soil. We could scale produc1ion from ,,,,''
the lowest to the highest observed over the
range of O10 10. and in so doing create a
layer that represents a soil productivity cri1e-
__... _-~- ......-- -- /
a)
Roods layer Rood suitability layer
(
Distance to roods layer
10
?; ••
I::
--
02
soo 1.000 1 500 zoob
distance to rood
b) Eleva1,on layer
slope suitability layer
( • SUl10blt!Y
''"'''
0
~
=
.5
0
..••
10
)
i ••
••
..
- to 30
slope
Flautt tJ.7: Enntplcs o( two contiu\1oos score laym to rcprcsau criteria i.n a cartopaphic model.
•• ,0
'°
Pand 0, top. sJ1ows the dunl dit1t:1ncc road cri1nui n1,p1icd 11Sint n -.u i1abili1y func11011MTllis re,uh.t in
;i cootluoou,5. nu1gc of 5liitnbility i.corcs between O and I, ln ~ncl b. a. diffcrcnl r,uitabili1y function is
applied to the: $lope la>·cr. ~uhioi in a continuous $ti.itabi.licy score for lhe slope criterion.
572 GIS Fundamentals
Su1tcbllrty toyer
stiff·
Crif er100 8 foyer WA 2· \'19 · 4 = Obility
layer
value
5 l 3
the values for criterion layer A and criterion from most important 10 least. We may !hen
layer B vary over an approximately equal calculate 1he relative weights according 10:
range. Ifone layer had a range from l .000 to
5.000 and the olher had values of I to 5. lhen n - r1 • 1
this would affect -ihe combination and final w, = - - -'----- -
n
(IJ.1)
Weight
Numerator
(n·r, · l)
(n·r
•·y) l:<n
'
Critmon Rank •••
dtstance ro 2 3/10 • 0.3
nearest competitor
•••
Fl.a:'Urt- 13.9 : Al\ c:<:intplt of one method for cMculati11~ rcl:ui\'e weights for e:ieh offo,11 cri1erill
.iiccording t oQ cqootiou (13. J).
least important the lowest value. The suggested readings section 21'1 the end of this
denominator .is calculated by swnmation. chapter.
an<! then the individual weights calculated,
as shown in the right most column. These
weights may tlien be used 10 combine the Cartographic Models,: A Detailed
data from the various criteria. Example
There are many other metl,ods for defm- Here we provide a detailed description
ing the values for each criterion layer and the of the steps involved in specifying and
relative weightings among layers. These applying a cartographic model. We use a
include methods that auempt to ensure con~ refinement of the general criteria for home-
sistency among weights. but lhey are beyond si1e selection desc-ribed in 11Je previous sec-
the scope of this imroductory text You may 1ion. These general criteria are listed on tl1e
find more detailed descriptions in the excel· left side au<I the refined criteria are shown on
lem book by Malczewski (1999) listed in the the right side of Table 13-1. The refinedcri-
Table 13-1 Original aud ~fined criteria for ear1ograpbjc model example.
Far enough from road to pro- 300 meter< distance to road < 2,000 meters
vide privacy. but no! isolated
574 GIS Fundamentals
teria may have been defined afrer further dis· detail in Figure 13-1 l. This and subsequen1
cussion \\~th the decision-makers. local area detailed figures show a thumbn•il of the s~-
expens. aJ.ld a review of available data and tial data layers at each step in the process.
methods. Data layer names are adjarenl to the spatial
data layer. The first two criteria involve ter-
NOie 1ba1we adop1 the simple.11 weigllt-
ing and ranking scheme in applying the cri- min-related constraints. Suitable sites are
teria in Table 13-il. All criteria are equally re.stric!ed to a sel of slopes and aspects.
weighted. and au criteria are binary- land These cri1eria require slope and aspect data
is categorized as unsuitable or suitable based layers. 10 be calculated and 1hen. classified
on each criterion. A location must pass all into areas that do and do no! meet the respec-
criteria 10 be suitable. and the final rating is tive crit·eria. The elevation data layer is
suitable or unsuitable. shown at the 1op of Figure 13- l 1: low eleva-
1ions in black through hjgher elevations in
In our examp le. we will apply the cano- lighter sl1ades. There are 1wo maiin river sys.
grapbic model described by the Oowchart in ,ems in lhe srndy area. one rnnning from
Figure 13-4 to a s..mall watershed in a moun· west to eas1 in the northern ponion of the
tainous smdy area. Application of the study area. and one running from south to
refined criteria requires 1lU'ee base data lay- nonh. Highland areas are found along 11te
ers - elevation. soils. and roads. For this nonh. west. and easi margins of the snidy
example we assume rhe three data layers are area.
available a, the required positional and a11ri-
bu1e accuracy, and clipped to the study area. Slope and aspect are derive(! from the
The need for ne,v data layers often becomes elevation dam Jayer(Figure 13-11). Lower
apparen1 during tbe process of tra11sla1ing the slope values are shown in light shades.
initial, general criteria to specific, refined higher slope values are sbowu in dark
criteria. or during lhe developiilolll ofllie shades, and aspe(ts are shown in a range of
flowcban describing the cartographic model. light 10 dark shades from O10 360 degrees.
Once data availability and quality have been Slope and aspect layers are reclassified
assured. we can c,omplete the final flow. based on the threshold values specified in
chart. the criteria listed in Table 13· l. A reclassifi·
cation table is used 10 assign vatues 10 the
Figure 13·10 contains a flowchart of a slcpe:_.suiT variable based on the slope layer.
canographic model lluu may identify suit- Cells with a stope_"'l less than 30 are
able sites. Spa1ial data layers are shown as assigned a slope_sult of l. while cells with a
rectangles. and a descriptive data layer name slope.."'! of30 or higher are giv,eo a value
is included within the rectangle. Spatial slope_sui1 of 0. Aspect values are also reclas·
operations or flmctions are contained in sified using a table.
ellipses. and arrows define the sequence of
da1a layers and spatial operations. The three Slope and aspect layers are combined in
base data layers (elevation. soils. and roads) an overlay. convened from raste-r to vector.
are shown at the top of the flowchart. and reclassified 10 produce a suitable terrain
layer (Figure 13-12). Raster to vector con-
Tiiere are three main branches in the version is chosen because rwo oi the three
flowcban in Figure 13 -10. The leftmost base data layers are in a vector fom1a1. and
branch addresses the 1em1in-rela1ed criteria. bec-.ause future complex seleclio.ns might be
the cemer branch addresses the soils criteria. bener supponed by the attribute data struc·
and the right branch applies dte road dis- rure used for vector data sets. This convtr-
tance criteria. All lhree branches join in the sion ere.ates polygons that have the attributes
canographic model. producing a final suit- of lite input ras1erda1a layer. Note this con-
ability classification. version takes place after 1be rasrer layers
The spatial processing steps for the left have be-en reclassified into a small number
branch of the cancgmphic model is shown in of classes. and after the data have been com-
Chapter 13: Spatial Models and Modeling 575
E leva-
tion Roods
I
( t xtr<let ")
'
Slope Aspect Majo r
layer layer roods
Suitable
~
'
( te(IOSS ) soils <.._ buffer .J t buffer .)
Slof,e
'
Aspect
'
2 km 300 m
'
rec ass reclass buffer buffer
layer layer layer layer
~
- j_ ....
All All ~
sites criteria ~ - Suitable
access
Suit able
sites
Fta:u.1•t- 13· 10: FloweInn for di.c home site lui1ability canopapbic model. Three basic data layCfS arc
cmcrcd. A a,equcnce of spntilll opcrt11ioru is used 10 :lf)f)ly cri1cri11 and produce a map of suitable:
sire,.
576 GIS Fundamentals
.,,............ slope
rec loss
< 30°. via
Cl>.11 ID
I
-·,.
128'
0""' "111
I
tabI z 1'19 0
CdtO ~-"" .... I
...3 •••
... ...0
I 437 0 5?.COA 18A 3 I
2
3
...
'"
0.3
...
,,,,
I
I
... recloss
...... .·
..
B2.00.i 0 90<0Spect<270
via toble
Aspect recloss
layer
Fla:urt 13·11: Adccaikd dcpfo1ion of Ulc- leftmost branches of the (artopaDhic model sbo"11 in Fi1,un
13-·10. Slope ,iml iuipcct nn: dcri,•cd from :in clevn1i011 d.i.ta l:lvcr for tile miJ>, res:ion. Bo11, 1:1.ycn nn: tbc11
rccl11$$ificd using a table anignmeal. Slope ,·aJue, < J.O 2.re l°eel~is.ified as suitable (gray). all other slope$
as unsuitable (blaek). Aspect values bc-twccn 90 and 270 arc r«lusi.ficdas , uitablc (sra)'), fill ~hers a,
,i.nmi111blc (bfad:).
Chapter 13: Spatial Models and Modeling 577
bined to a single layer in an overlay. Raster- 0, indica1ing they are unsuitable home sites
to-ve<:tor conve~ion proceeds more quickly based on the slope and/or aspect criteria.
after tbe munber of raster classes has been Because we wish to re<h1ce the number
reduced and the data combined in a single of redu11da111 polygons where possible. a dis-
terrain-suitab iLity layer. solve is applied after the reclassification.
The terrain overlay must then be reclas- This substantially reduces lbe size of the out-
sified 10 identify those areas that meet both put data set. and speeds funire processing,
the slope and the aspect criteria (see the ter- Reclassified. dissolved terra.in data are saved
rain suitability coding in Figure 13-12). in a layer labeled Suffoble Terroin (Figure
Those polygons witlt a I for both slope..sun 13-12. center-right).
and osp..,suit are assigned a ,•alue of I for The central branch of the cartographic
terroin_suil. All Olhers are given a value of model in Figure 13-1 0 is shown in Figure
overlay
Terrain overlay
layer
roster to reclass
vector (bytoble) dissolve
,............ i
.............,/
SIOD<LS\Jil oso_su11 terrom_sult
0 0 0 The terrain suitability code 1s
l 0
assigned based on this recloss
0 table. Only thog~/olygons with
l 0 0 suitable slope aspect ore
l l l oss,gned a terroin..suit value of I.
Flgu rf' l ,.l..12: 11ic recoded slope :md :Uf>O:t d11.1.1 faycrs 11rc combined in 11.11<>Ycrlt1y opcm1ion.
ll[ld the result rec:lau ified. Suit!ltblc lemnn is shown io gray. uusui111blc in bl~ck.
578 GIS Fundamentals
13-13. Digital soil surveys are available that 1he soil type is suitable for septic systems, 0
depict bomogeneoos soil units as polygons. iftlte soil type is not (Figure 13-13).
Attribute data are anacbed 10 each polygon. After reclassification. there may be
including soil type.and soil suitability for many adjacent soil polygons with the same
septic systems. Soils data for the study area soiU11it value. TI1ese are grooped using a
may be reclassified based ou these septic dissolve operation (data between reel ass and
suitability attribunes. A reclassification table dissolve are aot shown in tl1e figure: see
assigns a value of I to tbe variable soiLsuit if Chapter 9 for an eXllmple). The dissolve
removes boundaries between like polygons.
Suitable terrain
& suitable soils
thereby subst.antially reducing the numberof MOJOI" roods extracted. This has the effect of
polygons and hence the number of entries in removing all minor roads from consideration
the anribme table. This may be particularly in further analyses. What co:nstinues a major
imponant with complex data se.1s such as road has been defined prior to this step. In
soils da1a. or with converted raster data. as this case. all divide<I and multi-lane roads in
these often have thousands oi emries. many 1he study area were selected. l\vo buffers are
of which will be combined after lhedissolve. applied, one at a 300 m disraoce and one al a
The right branch of the cartographic 2 km dis1ance from major roods. These buf·
model in Figare 13- 10 is presented in Figure fers are 1hen overlaid. Beca11Se 1he buffer
13-14. The Roods da1a layer is ob1ained and regions extend outside the study area. the
Roods
buffer
Prellm
buffers
Rood
buffers
Fi&Ul't' lJ..14: A detail«! reprcuntation of the riJU1t bnu1<h of the canop,ipbic model sbo,m io Figure
13-10. Road& a.re buffttcd at 300 meter, aiKi 2 lo1orueters. and llmc buffers O\'nllliu. Titc buffen arc
clipped 101hc $nidy rcgion, tuld il1iL,ble nl'cai more th2n 300 mclttS and leu duul 21.::ilomclcri fru.n
iwd$ identified.
580 GIS Fundamentals
buffers muSI be clipped 10 the bom1d.1ry of layercontains the required spati:al data to
the smdy area. These data are then reclassi- ide111ify suitable vs. misuitable sites. This
fied into suitable :and unsuitable areas. overlay layer must be reclassified bosed on
resulting in the Ro<1d buffel"S layer (lower the road, soil, and terrain suitabUlity vari·
left, Figure 13-14 ). ables. classifying all potential sites into a
All data layers are combined in a final suitable or unsuitable class. A final dissolve
set of overlays and reclassifications (Figure yields the fos1I digital data layer, Suitoble
13-15). The Suitoble access layer. derived sites.
from the roads data and criteria, is combined This example analysis. while simple and
wi1h the Terrain & soils layer. The All criteria limited in scope. illustrates both the flexibil·
Rood
Terrain & soils buffers
overlay
u u
I u
0 l
I I 0
l 0 0
• unsuttoble
I I 0 Suitable
l u u O suitoble
sites
! ' I I
Flam·t 13. t S: The o,•a fay ru)d tccbuific:uio110(1bc combined <lllL'l la)'m.
Tcinin. &Oils, n.ud r,o:id t).1ffcr-d.11~ :u-c c:ornbfocd iu n.u o,·crl:l)'. These <Ui.., lkl'C rcc:laued bas.cd <n1 1bt suit·
ability criteria. A fuw dissolve is applied to reduce the numlxr ofpolygous. resulting io a final Ulyer of
suitabk sites..
Chapter 13: Spatial Models and Modeling 581
ity an<I complexity of spatial data analysis Graphical Modeler, and the GRASS wxGUJ.
using cartographic models. The cartographic These modeling tools can create a chain of
model is simple because only three input operations that may be saved. re·mo. shared.
spatial data layers were required, and a snmll or applied \\~th differe nl inpul data or
set of spatial data operations were used. parameters. The models may be viewed as a
Most real analyses use many more data recipe (or a set o( spatial tools. applied 10
inputs. but the basics are similar. Reclass. data, with outpul from operations used as
overlay. and other operations were used inpul in subsequent operations.
repeatedly. The modeLing is flexible because The programin_g enviroruneots often
spatial operations may be tailored to the allow complex flows. including looping
problem. Finally. this example illustrates the through varioos iterations of data or para me·
complexity that can be included in cano· ters.. and branc.hing based or tem1ination
graphic models. as over 20 different based on conditions. Sc.ripts are ofl:en quite
instances of a, spatial operation were applied, helpful in both saving a cartographic model
in a defined sequence. resulting in at least 15 so that it may be repeated with new da1a. and
imenuediate data layers as well as 1he final for documenting the steps applied in an anal·
result layer. ysis. Scripts are particularly· helpful when
processing must be applied 1epetitively to
Scripting and Models different inslances of the same type of data,
e.g.. 10 re·project then re.code hundreds of
Many softwares pro,1de scripting or raster data tiles. This would be tedious 10
programing ~nviromnents to specify a complete with a s1andard gr.aphical use.r
sequence of spatial operations (Figure 13· interface. but m.,y take only minutes of user
16). Examples include ArcGIS Model time when incorporated into a script
Builder. the QGIS Batch Modeler and
flg:u1·f' t3..t6: Ane:utnplc VolJ>bic model dc,:c:lopcd wi1b the. wxGUI (ot GRASS. R«tlllljl<s tq>tc·
3c:nt modules or opcr.uions., ellipses. tqn<Sc:l\t cb1.i1, aod 11,1T()\\'$ sltow ,.,.X'lrf,:- flow (c01tt1:csy OSGEO).
582 GIS Fundamentals
infommlion has been developed for the ilar across county-sized areas spanning tens
entire country by the U.S. Natural Resources ofkilome1ers. other factors <1fte11 change on
Conservation. Service. including soil and cli- a field or subfield basis. Applica1ions often
mate factors for the United States., and the differ in the melhods for estimating the man-
impacts or common crop types and manage- ageruem and crop factors. and in panicular a
ment regimes. The USLE and RUSLE have combination of slope lengdt and steepness
been widely applied in other countries. factors.
The RUSLE has l>een widely applied Estimating driving variables across
within a GIS framework for erosion esti- space often presems cboice.s:., as illustrated in
mates on a catchment or larger st.ales. The the calculation of llie RUSLE slope factors,
model is relatively simple. much of the input Land S (Figure 13-17). Simple spatial mod-
data have been developed and are publicly els are often based on small area studies for
available. and the ou1pu1s are or broad inter- which all variables may be easily measured.
est Methods for applying the model have This is often 001 true when applying the
varied. in pan because the model was devel- models 10 larger areas. Slope steepness (S
oped for individual fie.Ids, but spatial data fac,tor) is easily estimaled within a raster
are often 001 :available on a per-field basis. framework. bm slope lengll1 (L) is consid-
While 1be rainfall factor, R. is generally sim- ered uniforu1 at a fixed Ieng.Iii or22.1 m
OEM
Slope
LS
I
One of various meosure-
ment..derived functions is
opplied. lncorporoting
b<lth slope ongte ond
slope length. to
colculote the LS foctor.
f'iaur~ U · l 7: Su.ling from plot, lo l.aodscnpc, req,a.ire, e,tiu..,1io_g all i.npol d.,111 IK'r'O$S space, For tbe
USLE/RUSLE. se,·.:ral methods Jui,·e been dc,·clopcd for «tiruanng tbe LS factor. gro«alJy based ou a
combio:u:ion o:f , lope: nnd dope k11gtJ1 dtriYed from DEM <Mita..
584 GIS Fundamentals
(72.6 ft) in the srandard RUSLE. Applirntion Remaining K. C, and P factors may be
of lhe RUSLE 10 co1weigen1 or divergent derived from standard spatial data sets. for
slopes or to lengtllS or cell sizes different example. NASS or NLCD data for land-
than the standard may result in prediction cover/crop type and treatment, and K factors
errors. Tllis challenge bas been the focus of from SSURGO data (Figure 13-18). Appli·
many smdies, anc! the book chapter by Wil- cation of the model to the spalial data. here
son and Lorang, listed in the references, in a celJ.by..cell multiplication. yields estj.
describes some of the methods used to effec- mates of erosion across a regio«L
tively estimate a combination ofL and S.
Erosion = R K C P L S
I) R IS COtlSfOl'IT for fhe ST!Ay ateO, 'MTIM F,!more Counry, and set (It 15-5. rrom NRCS llt~ tute
J.) IC Is ck-riwd from SSURCO soils doto, comolned., 1he horizon table:. K voful?-s ore cxtrocttd
tor lht surtoct h0r1::ior,.
roblc Join.
~=·
then -
--
4) C \/Oiues ore ossl9ned 'm o n:dos11Reo1lon. boscd on ~CS robles ptr crop tyi,e
NASS
recJoss
- bosedon
~S10b4e -
5) LS VOlues ore ~o:c:d 109efher. occord1ng to Mc;Cool e1 ct. l987. J989
P R -------
-
flJurt 13...18: USLE/RUSLE cimion cstim111cs m11y be cakufatcd froin appropriately developed Nsc
Chapter 13: Spatial Models and Modeling 585
Spatio-Temporal Models
Spatio-te.mporal models bave been . . more spatial locations. The , alucs are often
1
developed a!Xl applied in a number of d1sct- saved in a spatial dat.a fom1at. such as a layer
plines. This is an active area of both research in a GIS.
and application. as tbere arc man_y fields of Spatio-temporal models involve at least
study and manageOlent that requ!re anal~is a three.dimensional representation of one or
and predictions of spatially and tune varymg more key attributes - variation in planar (X·
phenomena. \Ve will briefly discuss some Y) space and through rime. A fourth dimen-
basic cbaracterisrics of spatio-temporal mod- sion may be added if the vertical (Z) direc-
els. \Ve will then describe their differences tioo is also modeled. We typically treat
from other models. discuss some basic anal- spatially variable oerwork aualyses ~pa·
ysis approaches. and describe two examples rately, because networks are constramed lo a
of spatio-temporal models. subset of two-<limensional space.
Spatio-temporal models 11se s~atially Spatio-temporal models may also be
explicit inpurs to calculate orpredict spa- classified by a number of other criteria:
tially explicit outputs (Figure 13-19). Rules. wl1ether they treat continuous fields or dis·
functions. or some other processes are con1inuous objects, if 1hey are process based
applied using spatial and often nonspatial or rely on purely fit models. and if they are
data. Input ,,ariables such a.s elevau_on, vege- stochastic ordetemlinisrk. Combinations of
tation type. human populauon densuy. or these model characterisiics lead 10 a broad
rainfall may be used as inputs 10 one or more array of spatio-temporal model types.
mathematical <'<luations. These equations are
then used to calculate a value for one or
Spatial and
temporally dependent
functions
Spatial output
Fia,urt JJ.J.9; Spalio-t,:ruponil models toml>iuo: s.patiaJ .1llld HpatWdala \\ilh time-,wiant fuurtious to
prodUCC$pUiaJ OUtp:11.
586 GIS Fundamentals
LI
---
-
I
6.3
I.I
L2-
L3-
L4 L4
time zero. t 0 time one. t 1
Fit:1u-t 13..20: T'mlc \'lll)-inJ «11-bascd models use one 10 multiple iuptll cell ntlucs from II s1aniu; time. t0.
and update cc~I ,11luc.t {or some la~r(s) at 1:inlc t 1• T11c proceu is rq>ctuc:d (01 tl specified lcugtli oftime <w-
uu1il some specific outpu• condition is rca~hed.
588 GIS Fundamentals
Example 1: Process-Based
Hydrologlc Models
Water Oows downhill This simple
knowledge was perhaps sufficient until
hurnans began 10 build houses and roads,
and populations grew to dominate most of
1be Earth's land stirface. Land scarciry has
led humans 10 build in low-lying areas. and .,
fanni ng, wetland drainage. and upstream ;',
development have all con1ribu1ed 10 more •• •
frequen1 and se,>t>re flooding. ;.I.
Wa1er models are needed because
demand for wa1er resources exceed the nani-
ral supply u, many pans of lhe world. Popu-
lation pressures h.ave drh1en fanns. citie-s.
and other human <1evelopmen1s u110 flood.
prone areas: these same developments have
increased 1be speed and amounl of rainfall
runoff. thereby increasu1g flood frequency
and severity. These fac1ors are spurring the
developmen1 of spatio-temporal bydrologic
models. The models are often used 10 es1i-
mate stream wate.r levels. such that we may Fl:ul'(' 13-11: An o:runple of a hydn:,;raphic
oetwork. Lakes aud1or n\'cn fomt an uttcroon-
better manage wa.ter resources. and avoid o«ted network. W11.tcr mt1y be routoc<l front
loss of property or life due 10 flooding. upl11nd WC-M to t.od through 1hi, oc-twork.,
Chapter 13: Spatial Models and Modeling 589
(Chapter 12). Slope. aspect, and other fac- used. flow direclion is defin.ed between adja-
tors that affect hydrologic systems may be ce111 or connected vec1or elements.
derived from DEMs. For example. evapora- Many hydrologic models represent
tion ofsurface water and transpiration ofsoil water Dow through raster grid cells (Figure
water depend on the ruuo,wt of solar rndia·
Il -22), Wa1er falls on each cell via pr~ipita·
tion. Site solar radia,ion depends on the lion. Precipitation eilher infiltra1es into the
slope and aspect at each point. and in moun. soil or Oows aeross lhe surface. depending
1ainous terrain it may also depend on Sur· on the surface pcnneability at the cell. For
rounding eleva1ions. due to_ shading. Site- example. tinle water infihrates for most
speciftc varia.bles repr~ntm_g slope and . human-made surfaces, such as parking lots
aspect are used when esumatmg evaporauon or buildings. These sites have low pem1ea-
or plam use of waier. biliry. so most precipitation becomes surface
Slope and aspect rue often used to define flow. Conversely. neruly all precipitation
an imponam spatial data layer in hydrologic infiltrates imo most fores, soils.
modeling - now direction. This layer Downslope water flow is also calcu·
defines the direction of water Oow at import· la1ed in the model, depending on a number
am poi ms on lhe s,,rface. If a raster da1a of factors at each cell. Slope and flow direc-
stmcn1re is used. flow direction is cakulated tion determine the rate at which water flows
for every celL If a vector data structure is downhill. Downslope Oow evennially
\ y
I'\ II
J
I
/
/
I\
\ / / I
I\ \ wo er.shed
I I b undory
\
1.---,~sler
cell
Stream discharge
• g (ter,o,r,, SOtl. prec P..)
590 GIS Fundamentals
reaches the hydro logic network and is routed lyze lhe relative importance of component
via the ne1wotk 10 the outlet Mathematical inputs and processes. and the nwmre and
functions describiJJg cell-specific precipita- variability of system response. ls it stochas-
tion. flows. and discharge may be.combined tic variation in wind. fuel amou1Jt, or fuel
10 predict the Dow quan1i1y and water level type that is most responsible for the variable
at points in die watershed and dirough 1be namre of fire spread? We will discuss one
nelwork. spatial stochastic model - LANDIS
Spa1io-1emporal hydrologic models (LANdscape D!Sturbance) - ibat utCOIJlO-
often requfre substantial data development ratcs techniques used in a wide range of
Elevation. surface and subsurface permea- models (Figure 13-23).
bility. vegetation. and stream neiwork loca- Forest vegetation changes through time.
tion must be developed prior to the Chruige may be caused by the oamral aging
application of many hydrologic models. and death of a group of trees. replacemenl by
DEM data may require substamial extra edit- other species. or due 10 periodic disturbances
ing because terrai.o largely drives water such as ftre. windstomis. logging. insects. or
movement. For example. local sinks occur disease outbreaks. Because trees are long·
much more frequemly in DEMs than in real Jived organisms. the compositio11 and struc-
surfaces. occuring during data collection or ture of foresrs often change on temporal
during processing. Sinks are particuJarly scales exceeding a oom1al human life spa.IL
troublesome wben 1bey occur at the bonom Actions 1oday may subs1a01iaJ1y· aher the tra-
of a larger accum11la1ion area. Modeled jectory of funire change. aod S-O we need to
water may flow into the sink bul may not analyze how past actions have led 10 current
flow out depending on bow water accumu· forest conditions. and bow present actions
lation is modeled. while on the real eanh will aher future coodjtions.
st~face !he waler may flow fr«ly downhill. F0res1 dis!UfbMlte and thM,ge are
Local spikes ill the model may push water imponant s~tial phenomena for many rea-
incorrectly 10 surrounding cells. allhough sons. Humans are intere.sted in producing
they typically cause fewer problems tl1an wood and fiber, preserving rare species, pro-
sinks. Both sinks and spikes must be tectillg clean water suppLies and fish spawn-
removed prior to application of S-Ome hydro- ing areas, protecting lives and property from
logic models. wildfires. and enjoying forest-based recre-
a1ion.
Example 2: LANDIS, a Stochastic forest change is inherently a spatial
Model of Forest Change phenomenon. Fires. diseases. and other dis-
Many human and natural phenomena mrbances travel across space. The distribu-
are analyzed through spatially explicit sto- tion of a.men! forests largely affec-ts 1he
chastic models. Disease spread. the develop- location and species compositio11 of future
ment of past societies, animal mo,.:ement, forests. Seeds disperse through space. aided
fire spread. and a host of other imponant by wind and water or carried by organisms.
spatial phenomena have been modeled. All Physical and biotic characteristics that
these phenomena have a random element largely determine seed and seedling suf\i,•al.
that substantially affects their behavior. and subsequent forest growth, vary in space.
Events too obscu<e or complex 10 predict Some plants are bener adap1ed t<> grow
may canse large changes ut the system under existing forests. while others are aided
action or function. For example, wind speed by disturbances thal open the canopy. Some
or dryness on a given day dramatically species change soil or unders10,y conditions
affects fire spread, ye1 these phenomena are in ways that prevent other species fonn
notoriously difficuh to predict. Spatially growing beneath them. Plant succession. the
explicit. s1ochas1ic models allow us to ana· replacement of one group of pla.nts or spe-
cies by another tlu'ougb 1in1e. is substantially
Chapter 13: Spatial Models and Modeling 591
ESTASllSHMEN'T
SPROUTING mOOA.81LJT'Y
ABILITY (SITE) UNKAGES SHADE Tot.ERANCE
~
+ ~
SPECIES
ESTABLISHMENT
COMPETITIVE
SUCCESSION -
t t
LONGEVITY
SEEO
DISPERSAi.
PROSABIUTY
HARVESTING
FIRE Vl1NOTHROW
PR08ASllllY !lrr.T'<PE PROBASll.lTY
+ + + •
FIRE
+-- FUEL . WINDTHROW
MORTALITY ACCUMULATION MORTALITY
i t
S'TANt>AGE
t
SP£QES Sile Cl.ASS
SPECIES ANO AGE Cl.ASS SUSCEPTIBll.lTY (TOP DOWN)
SUSCEPTIBllJTY (BOTTOM UP)
Fi&urf' 13-U : The uu.jor procc.$k it rcprucu1cd in L.\NDIS, ll sp:uio-1ci:nport1l forest ~utceuion model
(cou11csy O. Mladenofl).
affected by the current forest dislribution been applied 10 forest biomes across the
and sllUcrure. 11 is 001 surprising 1ha1 many globe. LANOIS incorporates namral and
process-based models of forest change hwnan disturbances with models of seed dis-
incorporate spatial data. persal, plant es1ablislunen1 . .and succession
While forests often look like a cohesive through 1ime 10 predict forest composition
whole. they a.re extremely heterogeneous in over broad spatial scales and for its long
space. and th.is complicates our understand· temporal scales. LANDIS is notable for the
ing and predictions of forest change. Tree broad areas ii may treat at relatively high
species, size. age. soils. water availability. resolulion.. and long lemporal scales.
and other factors change subsiantially over LAi'J'OIS has been used 10 model forest
very short dis.taoces. Each forest s1and is dif. dynamics at a 30 m resolution. over tens of
ferenl. and we stmggle to represent these thousands of hectares. and across five centu-
differences. Given the long time scales. ries.
broad spatial scales. aod inherent spatial LANDIS integrates information about
variability of forests, many organizations forest disturbance and succession 10 predict
have developed spatial Ulodels integrated 10 changes in forest colllposi1ion (Figure 13·
some extent with GIS. 23). As noted above. succession is the
LANDIS. is an example or a spatially replacement of species through time. Sue·
explicit. process·based forest dynamics cession is colllnton u, forests. for e.<ample.
model. LANOIS has been developed by Or. when fast·growing. light·demanding tree
David Mlade11otr and colleagues. and bas species colonize a disturbed! site, and are in
tum replaced by more shade-1oleram. slower
592 GIS Fundamentals
growing species. These shade-tolerant spe- LANDIS uses a raster data model 1ha1
cies may be self-replacing in 1ha1 dieir seeds eases 1he entry of classified sa1ellite imag-
germinate and seedlings survive and grow in ery. eleva1ion. and other data sets reflecting
lhe dense shade. Small gaps from canopy short.range.environmental and forest species
damage or uee de<11bs allow small pa,ches of variation. l111erac1ions such as S<!<!d dispersal.
light to reach d,ese shade-tolerant seedlings. competitions. and fire spread are explicitly
enabling them to :reach the upper canopy. modeled for each species occupying each
This self replacemen, can res uh in a siable, grid cell.
same-species stand over long time periods. LANDIS tracks the presence of age
This cycle may be broken due to fire, wi:nd- classes (termed cohorts) for a nu.mber of spe-
throw, logging. or other disturbance evtnrs cies in each cell and through 1ime. The
that open 11p a stand to a broader range of model begins with an initial condition: the
species. LANDIS simulates large. he1eroge- distribution of species by age class across
neous landscapes. incorporates the interac· the Ja,ldscape. Ten-year age classes arc cur-
lions ofdominant ,ree species. and includes remly represen1ed. n ,e longevity. age of ini-
spatially explicit representalions of ecologi- 1.ial seed production. seed dispersal distance.
cal interactions. shade 1olerance. tire tolerance. and abiUty 10
sprout from damaged stumps or roots is
LANDIS Design Elements recorded for each spec.ies. On undisturbed
sites. cohorts age umil they reach 11teir lon-
The design of LANDIS is driven by the gevity. at which poim diese older cohorts
overall objectives for tile model. simulating "die·• and disappear from the cell. Younger
forest disturbance and succession through cohorts may 1hen appear. depending on the
lime. LANDIS also satisfies a number of avai~1bility of seed.
other requiremenus. LANDIS readily inte-
grates s.1tellite data sets and other appropri- The sparially e.xplitir represeurarion of
ate spatial data. and it simulates the basic seed sources and dispersal is an improve-
processes of disroroance. s1and develop- ment of LANDIS over many earlier forest
ment. seed dispersal. and succession in a succession models. Previous models 1ypi·
spatially explicil manner. cally assumed constant or random seed
avaifability. LANDIS is representative of
LANDIS is an object-orienled model. spatially explicit models in that ·tl,e specific
Specific fean1res or processes are encapsu· locations of a process affect 1bat process.
la1ed in objects, and object-iniemal pro- Disnut>ed siies may be occupied by seed-
cesses are isolated as much as possible from lings from a disiurbed cell or nearby cells. or
other portions of the model. As an example, by sprou1ing from 1rees in a cell prior 10 dis-
1here is a SPECIE objec, 1ha1 encapsulates nirbance. Cells cycle through 1be species
most of 1he imJ)0'1aot infonnation and pro- establis.hment, succession. disturbance, and
cesses for each tree species included in the mortality processes (Figure 13-24).
model. E.1eh ins1ance of a SPECIE bas a
name. for e.xample. "Aspen." and 01her cbar- Theeffeclsof such site characteristics as
ac.teristics such as longevity. shade toter. soil and topography on species es,ablish-
ance, or age 10 moturity, as well as melhods mem and interactions are also represented in
for birth. death. aald other ac1ions or charac- LANDIS. For example. establishment coef.
teristics. Because these characteristics and ficients are used 10 represtnt the interaction
processes are encapsula1ed in a SPECIE between site characteristics and species
object. 1hey may be easily changed as new establishment. These coefficients vary by
knowledge become m•ailable. Many models land type. Fire severi1y also varies by land
are incorporating 'Ibis object-oriented design, type. as may seedling survival. Eleva1ion.
because ii simplifies mainienaoce and modi· aspect, soils. and 01her spatial da1a are used
fications. as inpul to 1he spatial model.
Chapter 13: Spatial Models and Modeling 593
output tp«I~
cbssiJicar.on
""'I"'
.i;-
•• 1/ / L7'-""----1/ /
• • ....
FLre and wmd d1srurt>ances are sunu- developed. and many more are currently
lated based on historical records of distur- under development. As spatial dala collec.
bance sizes. freque ncies, and severities. tion teclrnologies improve and GIS systems
Dis11~bances va,y in 1hese properiies across become more powerful. spa·tio-temporal
the landscape. For example. wind distur· models are becoming standard tools in geo-
bances may be more frequeni and severe ou graphic science, planning, and it, resource
exposed ridges. and fires less frequent. less management.
intense. and sm.aller in wetlands. Distur.
bances are s1ocbaS1.ically generated. but a,eir
variability depends on landscape variables. Summary
For example, fires are generated more fre- Spatial analysis often involves the
quently on dry upland sites. development and use of spatial models.
LANDIS has beeu applied 10 a omuber These models can help us understand bow
of forest science and management problems. phenomena or systems change dtrough space
including the effects of climate change on and time. and they may be used to solve
forest composition mid producrio11 1he problems. lo tl1is chapter we described carto-
impacts of c.hanging hanresting regimes on graphic models. simple sparial models. and
landscape )Xlctems. and regional forest spatio-temporal models.
assessments ( Figure 13-25). Cariograpltic models often combine sev-
Rundreds of olher spatially explicit. eral data layers that represeot criteria and
temporally d)'llamic models have been constraints related 10 estabnsrung the suit-
594 GIS Fundamentals
v... o
i" ....
It
1.
,. .: ',
!
j
j
Voar 360 ~¥,}.~w,1.0IJ , •• i'J.YG!'4'J/J "i'?:VNt'JIJO Pro,.~r15C),.._
" ~ • !" , .. ~.. ' ..
-
--
Y!Ntepin&
Oltlcr forests
--25
A 50KM
fia:urr 13-25: Cb.ang" in the spatial clisuibu1ion of while pine. a forest tree s-pc<"ies, through timc u pre-
dicted by LANDIS. This ~plue e:1:ccnplifics the prcdietioo of a (e11nare ofinterest both spatia.lJy aud tem-
porally, and is rq>rekll.laith•c of !UIIU)' tulll1ytical tools fo uie or midcr de\•elopntei:u.
ability oflocations for a given use. Da1a lay- through lime within the model. The.se mod-
ers are combined through the application of els are typically more detailed. and less flex-
a sequence of spatial operations, including ible than canographic models. io part
overlay. reclassification. and buffering. The because spatio-temporal models often
cartographic model may be specified with a include some representation of process. For
flowchart. a diagra,n representing the data example. many spatio-temporal models have
layers and sequence of spatial operations. been developed to model the. flow of water
Canographic models are static in time rela- through a region, and these models incorpo·
tive to the other model types. rate equations regarding the physics of water
Simple spatial models typically apply a traospon movement.
set of equations 10 variables expressed as AS more people use GIS. they gain
simple scalars and spatial layers. These ac.ce-ss to spatial models run with some com-
models are often expressed as a set equa- bination of GIS, programing languages, and
tions. These equa1ions estimated from data other arialytical tools. Many spa·tial models
at a set of obseivations at single points or ot~y use spatial data developed by GIS or
across sub areas, and then usually applied use GIS for \rjsualization, bu! a gro\\rjng
across brooder geographic (or different) number are either integrated wi(h GJS or
areas. being done emirely in GIS as the techuology
Spatio-temporal models explicitly repre- improves.
sent the changes ln imponam phenomena
Chapter 13: Spatial Models and Modeling 595
Suggested Reading
Anselin. L., Syabri, I., Kho, Y. (2006). GeoOA: an introduction 10 spatial dala analy-
sis. GeograpMcal A11alysis. 38:S-22.
Brady, M., Irwin. E.(20 11 ). Accounling for spacial effects in economic models ofland
use: rece01 developments and dlalleages ahead. Em•i1"011meutol and Resource
Eco110111ics. 48:487-509.
Brown. 0 .. Riolog. R.. Robinson. O.G.. North, M .. Rand. w. (2005). Spacial processes
and da1a models: Towards in1egn11ion of agen1-based models and GIS. Journal of
Geogn1phica/ Systems. 7:25-47.
Burrough. P.A.. McDom,ell. R.A. {1998). P,·i11ciples ofGeographical J,ifon11atio11
S),iems. Oxford: Oxford Universicy Press.
Carlson. S . (2000). The ama1eur scien1is1: Boids of a fea1her Hock 1ogether. Scie111ijlc
A111erica11, 283: 112- 114.
Cliff. A.O.. Ord. J.K. (1981). Spatial Processes: ,\fodels a11d Applicario11s. London:
Pion.
Cooke, \V.H .K., Kalarzyna. G.. Wallis, R.C. {2006).Avian GIS models signal human
risk for \Vest Nile vims in Mississippi. Jmer1101io11ol Journal ofHealth Geogm·
pi,y, 5:36. doi: IO.l 186/1476-072X- 5-36.
Dixon. B•• Uddameri. V. (2016). GIS a11d Geoco111p111atio11 for Water Resource Sci-
ence '11td E11giuee1i1tg. Hoboken: \Viley.
Fo1heringham. S.. Wegener, M. (2000). Spatial Models n11d GIS: Ne..- Pou,minl n11d
New Models. London: Taylor and Francis.
Goodchild. M.F., Steyaert, L.T., Parks, B.0 . (1996). GJS nnd Em'iron111e111a/ Model-
ing: Progress a11d Resem·c/1 Issues. Fort Collins: GIS World Books.
Griffith. 0.A.. uyoe, L.J. (1999). A Casebook for Spntinl Statistical Data Analysis.
oxford: oxford Universicy Press.
He. H.S.. Mladenoff. DJ.. Boeder. J. (1999). An object-oriented forest landscape
model and its representation of tree species. £cologicnl Modeling. JI 9: 1- 19.
Je11en. V.. Govers, G., Hessel, R. (2003). Erosion models: qualily of spatial predic-
rions. Hytf,vlogic !'r'OCesses. 17:887-900.
Johnston. C. (1 998). GIS in Ecology. Bos1on: Blackwell Scien1ific.
Kaufmann, A. {1975). burod11c1io1110 the Theory1 ofFu::v Subsets. New York: Aca·
demic Press. ·
Klir, G.J .. Folger. T.A. (1988). F,c:y Sets. Uncertainty, and /11fonnatio11. Eitglewood
Cliffs: Pre.mice Hall.
Krzanowski. R.. Raper. J. (2001). Sparial £vo/111ionary Modelling. Oxford: Oxford
University Press.
Malczewski. J.C. ( 1999). GIS and M11/1ic,·irerin Decisio11 A11nlysls. New York: Wiley.
McCool. D.J<.. Brown, L.C.. Fosier. G.R.. Mu1cbler. C.K.. Meyer. L.D. (1987).
Revised slope steepness factor for the Universal Soil Loss Equation. Tnmsactious
oftire American Society ofAgric1tl111ral Engineers. 30: 1387-1396.
McCool, D.K.. Fos1er. G.R.. Mu1cbler. C.K.. Meyer. L.D. (1989). Revised snope
length equation for the Universal Soil Loss Equation. Trausocaous <>/the Ameri-
ca11 Sociery ofAg1·icu/111ral £11gi11eers, 32: 1571- 1576.
Mlade11off, D.J .. He. H.S. (1999). Design, bebaviorand application of LANOIS. an
objec1-orien1ed model of fores1 landscape disrurbance and succession. In Mlade-
noff. D.J.. Baker. W.L. (Eds.). Admuces 111 Spnria/ Modeling ofForest Lo11dscnpe
Cltauge: Approaches a11d Applirorions. Cambridge: Cambridge Uuiversi1y Press.
Monmonnier. M. ( 1993). How To Lie lf'rrlt Maps. Chicago: Universily of Chicago
Press.
Moore. LD., Gessler. P.E.. Nielsen. G.A .. Pe1ersou, GA. (1993). Soil anribuie predic-
1.ion using retrain analysis. Soil Scie11ce. 57:443-452.
O'Su!Livau. D•. Perry. G.L. (2013). Sparial Si11111/atio11: Exploring Pa11em and Pro-
cess. Hoboken: John Wiley & Sons.
Paren1. 0 .. LeSage, J.P. (201 0). Aspa1ial dynamic panel model \\Olb random etfec1s
appJied to commtlling times. Trauspor1a1fo11 Research Pan B: J\1ethodologica/,
44:633-6<115.
Piuske, J .. Slade. M.E. (2010). The furure of spa1ial econome1rics. Joumal of
Regfo1111/ Scieuce. 50: I03- 117.
Reynolds. C.\V. ( 1987). Flocks. herds. and schools: A distribuled behavioral model.
Compwer Graphics. 2 1:2S-34.
Chapter 13: Spatial Models and Modeling 597
Running, S.W.. Nemani. R.R.• Heinsch. F.A.. Zhao. M., Reeves. M.. Hashimoto. H.
(2004). A continuous sa1elli1~derived measure of globaJ terrestrial ,primary pro~
duction. Bioscience. 54:547-560.
Stillwell, JA.. Clarke. G. (2004). Applied GJS m,d Spatial Annl)'sis. New Yorlc
Wiley.
Tom)in. C.D. (1983). Digitnl Cnrtog,.npl,;c tlfodeli11g Tec/m;ques in Eml'iro11meutaf
P/auuiug. New Haven: Yale University.
Turner. M.G .. Gardener. R.H. (Eds.) ( 1991). Qun111i1n1fre Methods in La11dscnpe Ecol-
ogy. New York: Springer Verlag.
Wagner. D.F. ( 1997). Cellular Automata and Geographic lnfomiation Systems. £111'i-
1v11mem aud Plam,iug. 24:219-234.
Wilson. J.. Gallant, J. (Eds.) (2000). Ten·ain A11alysis: Principles and Applicnrions.
New York: Wiley.
Wilson. J.P.. Lorang. M.S. (2000). Spacial models of soil erosion and GIS. in Spatial
Models a11d GIS, Fotheringham, A.S., Wegener. M. (Eds.). London: Taylor &
Francis.
Wolfram. S. (2002). A New Kind o/Science. Champaign. IL: Wolfram media.
598 GIS Fundamentals
Study Questfons
13. 1 - Provide an exalllple of a cartographic model, including the criteria and a now-
chan oftbe steps used to apply tl1e model.
13.2 . Why must the criteria be refined in many canograpltic modeling proc.esses?
13.3 . What do we mean when we say tliat most canograpbic models are temporally
static?
13.4 - \Vhat is a discrete vs. continuous weighting in an input layer when combining
layers in a cartographic overlay? How do you develop a reasonable continuous rank-
ing f\mction. that is. justify the shape of the cu,ve vs. the level of lhe input variable?
13.S . Match the output layers B, C, and D. lo tl1e appropriate reclassification graphs
Q • W when applied 10 lhe original DEM. A. Note that a billshade surface is superim-
posed to aid in visualizarfon.
Q)i,or T) :o
t j
~05•
8 I•e ,
j·'j
" I
B .if ,f
"' k.p.i• Lq'ff" lf'lo.Jf Loyer
R) 10' U) :o
• •
! i
c ~··
3
~
1··
~
0
~ ,f
l f'()U' LO)'tt
I 4
ltll)IJJ levtr
I
S) 10 W) :o
I;
t
~
t
~
~0)
8 }"'
# #
trw· Lq,'1'"
f # 4
lnout L.oytr
I
Chapter 13: Spatial Models and Modeling 599
13.6 - Match the output layers 8 , C, and D, to the appropriate reclassification graphs
Q - IV wben applied to the original DEM. A. Note that a billsbade surfa~e is superim-
posed to aid in visualization.
0) 10 T) 10'
'
! !
~OS
<> 8
ii•> f
0
'
# $
'
lfl!),lt Loy~
f # #
1 ~1 1.Cyl!t'
f
R) 10 U) 10
!ios !io,
a 0
•,e j
0
I
I~
I tr,ye:r #
l'l)IJ I Loygr
.f
S) 10 W) 10
' '
!aos !
ii••
a s
~ .f I •# $ i
tnp,u t t.Qy~r ' l.Clytl'" '
l ~l
0
600 GIS Fundamentals
13. 7 - The figure below depicts four flowcharts of canogrnphic models to find areas
most suitable for a new park. Sites are preferred that meet all of the followillg crite-
ria:
\Vithin 0.5 miles of Census polygons with a density of more tJ1an 120 persons per
square mile:
Greater 1.han 0~5 miles from an existing park:
Current land 1tse of grass or vac-ant.
Select the flow1:han wlticb best approximates the proper analysis. given the described
data. For each other flowchan. list at least one primary way it is inferior to the chosen
method. Note that some minor intennediate steps are omined/subswned into opera.
tions for all flowd1ans. so do 1101 cite a step omiued in both tlie best and alternate
flowcharts.
13.8 - The figure below depicts four flowcbans of canographic models 10 find areas
,nos1 suitable for new parks. Sites are preferred that meet all of the following criteria:
\\1thin O.S miles of Census polygons with a density of more than 120 persons per
square mile:
Greater than 0.5 miles from au exisiing park:
Within O.S miles of a school.
Select the Oowcban tliat bes1 approxima1es the proper analysis, gi"en the described
data. For each other Oowchan. lisr at least one primary way it is inferior to the chosen
method. Note that some intermediate steps are omitted for all flowcharts.. so do not
die a step omined in both the best and altemate flowcharts.
C)
0)
602 GIS Fundamentals
13.9 - The figure below depicts four flowcharts of canogrnphic models to find areas
most suitable for wild sheep habitat. Sites are preferred that meet all of the following
criteria:
Non-foresl: Slope greater than 200/o: Ele\lalion above 2,500 m: All areas witliin 5 km
of grassland or meadow: Each con1iguous polygon larger lhan IOOO acres.
Select the flowchart that be.sl approximate.s the proper analysis, given the described
data. For each otl,er flowchart. list a, least one primary way it is inferior 10 the chosen
method. Note 1ha1 some intennedia1e steps are omitted for all nowcbans. so do not
cite a step omitted in both the best and alternate flowcharts.
Chapter 13: Spatial Models and Modeling 603
13.10 - The figure below depicts four flowcharts of canographic models to find areas
most suitable for a desalinization plant. Sites are preferred for which au parts meet all
of the following criteria:
Ar an elevation of 5 m or grealer:
\\'ithin 0.25 km of the coas11i11e:
All areas within O.S km of 1muk water pipe:
Witl1in a single parcel:
Areas larger than IOO ha.
Select the flowcha.n that best approximates the proper analysis. given the described
data. For each other flowchan. list at least one priniary way ii is more likely 10 lead 10
au error. Note 1ha1 some small intermediate steps are omined for all flo1Vcbans. so do
not cil'e a step omitted in both the best and alteroale flowcharts.
A)
8)
0)
E3
604 GIS Fundamentals
605
Introduction
A sta11dmd is an establishe<I or sanc- in the United States. a set of resources 10 aid
tioned measure, form. or method. It is an 1he crea1ion and sharing of digita.l geo-
agre,,d-upon way of doing something. Spa- graphic data. Standards are develope<i
tial data and analysis standards are import- through a set of processes. from proposals
am because of the range of organizations through drafts 10 a FGDC adopte<I standard.
producing and using spatial data, and Standards may be modified through an
because these data are often U'3osfcrred update process. Curremly. there are stan-
among organizations. Data sraodards facili- dards on methods (e.g.. wetlands classifica-
tate a common understanding of the compo- tion), content (U1ili1ies Pata Contenl
nents of a spa1ial data set how data were Standard), iueiadata (data about data). and
develope<I. a,nd 1l1e utility and limitations of data transfer. The most recent NSDI plan at
these cL1ta. Ibis writing covers the 2021- 2024 period.
GIS practitioners use several types of Details are a, www.fgdc.gov.
standards. Dmn sumdm-ds are used 10 for- There are parallel initia1ives in many
mat. assess, docwneo1, and deliver spatial countries. infomtation on which can be
dara. lnteroperobility stoudordJ· identify found through the International Spatial Data
how spatial data are served between hetero- Standards Commissiort The Commission
geneous networksof software and hardware currently serves as a clearinghouse and gate-
systems. for example. betWl'tn wireless way 10 national standards across the world.
mobile devices and shared databases. Analy- The lntemational Standards Organiza-
sis standards ensure tliat 1he mos1 appropri- tion (ISO) organizes international standards.
ate methods are used and 1ha11he spatial and sponsors the ISOfTC2 ll standards
analyses provide the best infonu,ition possi- (hnp:l/w"w.isotc211.org). These specify
ble. Professional or cerlificariou sroudmds ways 10 st0re ru.td represent spatial and
establish the educatioo. knowledge. or expe- related infom1arion. services and data man·
rience of the GIS analyst, thereby improving agement. processing. transferring. and pre·
the likelihood tl1a1 the technology will be senting information. The standards are
used appropriately. oiganized as various project's. for example.
The Federal Geographic Data Commit- standards for represenling coordinates, test·
tee (FGDC) is !he leading government orga- ing standards. or for measuring data quality.
nization in 1l>e United States in defining data Many standards are in acti\lC development.
standards. The FGDC focuses on the and inasmuch as these standards become
National Spa-iial Data lnfrastrucnire (NSDO stable. it will ease da1a and infonuation
606 GIS Fundamentals
transfer among di.ffere nt GIS software. developed 30 fom1ats, bul data ·sharing is
organizations. and lhrougb tune. inhibited without standards. The OGC bas
The Open Geospatial Consortium developed such a data standard. with the par-
(OGC) is an ad hoc. self-selected group of ticipation of software, research, business.
and government representatives.
over 500 companies. research insti111tioos.
government bodies, and individuals dedi- It has proven more difficult to develop
cated to developi,ig interoperabili1y stan· professional and analysis standards that are
dards. Interoperation problems are inclusive across all disciplines. Standard
identified. sue.It as general difficulties in methods for one discipline may be inappro-
accessing lime-varying spatial location dam priate for another. For example. acceptable
through a distributed wireless network. and data collection methods for cadamal survey-
standards for access proposed. These are ors may be different than those for foresters.
reviewed, discussed, amended. and adopted. due to differences in accuracy and attribute
Web mapping setvices (WMS) stan- requirements.
dards are an example ofOGC initiatives. The GIS Certification Institute (GISCO
Web mapping services aUow GIS software has developed a GIS Professional (GISP)
to access data acr,oss the imemet as if they cenificateihat is gaining popularity. It seeks
were stored 011 lhe local hard disk. A GIS to certify qualifying levels for education.
program or utility "maps .. the WMS to the professional experience. knowledge. and
local computer, meanin.g it may access the ethics. and help guide i11divid101! profes-
data with the same protocols as if ii were sional development and continuing educa-
stored locally. wi1hout downloading a per- tion. It offers exani-based qualification.
manent copy to store on the local hard disk. There are parallel efforts to develop a
Web setvices such as WMS are in1port· set of standards in U.S. GIS professional
ant for 1he fon1re ,of cloud-based computing. practice. Known as competency 111ode/.s. they
where cL11a, programs. and processing are define a set of skills considered essential for
seamlessly distributed on computers con· effec1ive work in a field. and have been
nected across the web. Cloud-based geospa· developed for a growing onmber of indus-
tial compuiing is inherently dependent on tries. All have a common fou ndation of basic
robust, well.defined interoperability s.tan· personal and workplace competencies., with
dards such as those being developed by the industry· and then occtipancy-specific ski Us
OGC. Standards identify data fomiats and built on top.
content parts and naming, metadata, how
connections are made and data are passed
between program$ across distributed net- The Geospatlal Technology Com-
works. and error checking in transfer. Stan- petency Model
dards allow data robe combined across The Geospatial Technology Compe-
different organizations. with local storage tency Model (Figure 14-1) identifies a set of
and access fonn and protocols. and a stan. core and industry sector geosparial abilities.
dard way of setvi.ng up data to others The Competency Model identifies examples
through a service. of over 40 "Critical Work Func6ons•· tl1at
The /11door GML is an example of a geographic teclmology professionals are
newer OGC standard. under development 10 commooJy expected to master and use in
define daia forma.1s for interior building spa· their careers, and tl1e backgrow1d knowledge
tial data. Three-dimensional data for build· on whicb these Critical Work Functions are
ing interiors are useful to real estate. law based. This competency model ,s based in
enforcemenl. design. and constmction appli- pan on 1he Geographic lnfornmtion Science
cations, among others. Various software ven· and Technology Body of Knowledge. first
dors and research organizations have created by the University Consonitun for
Chapter 14: Data Standards and Data Quality 607
Geographic tnfomiation Science (UCGIS) two are substantially more complex than tJ1e
and published by the Association ofAmeri- first !WO,
can Geograpbers in 2006. Critical work Media S1a11dmds refer to the physical
f1u1ctions include operations in basic geod- fomi in which data are iransferred. They
esy. data cotll!<:tion systems. data structures. define specific fomiats for CD·ROM. mag·
GIS operarion and programing. anali1ical netic lape, optical or solid state storage, or
methods. cartography. the place of geo- some proprietary drive or otber media type.
grnphic infoanation science and technology Standardized fom1ats arc specified by the
in socicly, and organization and instinuions. lntemational Standards Organization OSO).
A set of higher-level requirements are noted
for specific occupa1ions. Fonnat sta11dmrfs speci.fy daia file com-
ponents and structures. A fom1at standard
establishes lite number of files used to store
Spatial Data Standards a spotial da1a set. as well as 'the basic compo-
Spa,;af dara staudmrls can be defined nents contained in each file. The order, size.
as metltods for sttttcruring. describing. and and range of values for the data element con·
delivering spatially referenced data Spatial 1ained in each file are defined. Jnfonnation
data standards may be categorized i1110 four such as spacing. variable type. and file
areas: media standards. forma1 standards. encoding may be included.
accuracy standards, and doc.umentation stan- Format standards aid in the practical
dards. All are important. although the lasi task of transferring data between computer
systems. either within or between organ.iza-
Manogemenl o«upo11on-Sp,e:c1t1c
Requirements
lnckis-fry·Se:ctor T«hnlcol Competenciu
PoSfliOrW'lg On.d Anolys1s Softwore and
Ooto Ac.qu..s1tion and Applicotio,i
MOCl-tlil"lej Ot\'Clopmeru
lnduslry·\Vide Te:ehr.col Compettl'l(.itS
Croucutllf'l9 CtospOtiC' A~ hU one! K.~<19c
Ceod~sy ClS Atlelytlcol Methods
0010 Oticltty Proorormw~ Ccrtogropt,y, V!S!Jalotion
GNSS/Mcmu!"fflltl'II P'f'of~ssionohrn GIS&t end Sc>eitfy
Re,not1;.S~ 0010 M()delr!g/TolX>&Ogy ~OflOn Ond lnihhltlOf'l!i
Cor1CJ9f"Whv ~(1)tvn1 ~/Oe3ign Others
Workptoce:
l(on,wo.'k CrtGfM: PIOf"U'I~ 5 Problem fool& g Chcc..1111"19. ~
Thril,ng OrQon Zll'l9 Solv,n; T«.IYIOIC>gy Recorthng /:'undarnet1IOOI
AGOdtmtc
Rt:odrlg. \Vrmn9
SGiettGel
Moth Cc01TOohv E,v.iint'.ennq
Comporer
a.,,..
Ptt'sot\01Effeetweness
JMliQ11','f:
figu~ 1..._1: TI1e Oeo5po.li~l Technology Competency Model adopted by tbe U.S. Ocpm"?ent of ~bor.
Please'": bnp:J/www.car«rou«1op,org/Compe1cncyM.odcVcom.pe1cncy-rnodclalgcospanal-t«bnol-
()j)'.aspx for II complete description of die model and compoucnt.t.
608 GIS Fundamentals
lions. Producers and users may not use the resources, and the documentation of dala
same hardware or GIS software. The inter- accuracy uicomple1e. Adherence 10 spatial
change between diff.,enr software systems data accuracy standards ensures we assess
is aided by general. standard fom1S in which and communicate spatial data quality in a
data may be delivered. well-defined. established mamier.
Many govenunenl or vendor fom1ats Doc1011e11rmio11 standanls define how
have become widely supported because data we describe spatial data. Data ace derived
are commonly delivered using these fonnats. from a se1 oforigiluil measurements 1ake11 by
For example, the U.S. government supports specific individuals or organizat ions at a
the Spatial Data Transfer Standard (SOTS). specified time. Data documentation stan.
This fon11a1 specifies the logic . formal. and dards are an agreed-upon way of describing
encoding for raster, vector. and topological the source. development, and fonn of spatial
data transfer of spatial data. ESRI shapefiles data. When documentation standards are
(a cluster of files including .shp, .sh.,. and used. they ensure a complete descrip1iou of
.dbf) are a conunonly supported vector for- the data origin. methods of development.
mat. and many organizations transfer data accuracy. and delivery fonnats. Standards
using them. These proprietary form31s are allow any po1emial user 10 assess the appro·
001 truly standard$ because the fom1a1s may pri:neness of these data for an in1eoded task.
be changed by the vendors that created tl1em. Data quality standards add value 10 our
Until data fonnats are agreed 10 by a stan- data. There are many ways to describe data
dardizing body. ambiguity in form will hin- positional and attribute error. An incomplete
der interpretation and hence use. description of spatial data quality may not
Spatial dota a ccuracy srnuda,ds docu· allow a user 10 judge if the data .are accept-
ment the quality of the positional and attri- able for an intended applicalion. A data
bute values. Kn6wledge of data 411ality i~ 4uality s1andard becomes faJlliliar tlii'ougli
cmcial 10 the effective use of GIS. but we use. We may know what levels of average
are often remiss in documenting spatial data error are likely to resuh in w1accep1able.
quality. This is dt1e in pan 10 the cost of ade- data. The standard allows us 10 compare two
quately estimating the errors in our spatial data selS in light of this past exp.,ience.
data sets. Field sa.mpling is expensive. Data
production is ofte.n pushed to available
Chapter 14: Data Standards and Data Quality 609
Data Accuracy
An accurate observation reflects the true Spatial data accuracy Olay be degraded
shape. localion. or characteristics of the phe- during laboratory processing or data reduc-
nomena represen1ed in a GIS. When tl1e con- tion. Mis-copies during the transcription of
cept of accuracy is applied 10 spatial field no,es. errors during keyboard entry. or
variables. it is a measure of how often or by mistakes during data manipulation may alter
how much our data values are in error. Accu- coordinate values used to represent a spatial
racy may be reponcd as a frequency. for data feature. Improper representation in the
example. wh.en we repon 1ha1 20% of tbe compu1er may cause problems. such as
land cover class labeled as cropland is actu- rounding errors when mulliplying large
ally pereuuial grasses. Al<enrntively. accu- numbers.
racy may be expressed as an average error Data may aJso be in error due to changes
magni<ude: for example. light poles may be through tin1e (Figure 14-2). The wotld is
displaced on :average by 12.4 m from their dynamic. while our representation in a spa-
tme locations. tial data set caplures a snapsl101 at the time of
lnadequa.cies in our spa1ial daia model data collection. Vegerntion boundaries may
may cause spatial data error. W11en we use a be al<ered by fire. logging. coos<ruction.
raster data sel with a fixed cell size. ,ve have conversjon to agricullure, or a host of other
set a limit on our positional accuracy. The bwnan or natural disturbances. Even in
raster model ass,uues a homogeneous pLxel. instances where positions are static. anri-
If more than one category or value for a vari- bu1es may change through time. A two-lane
able is found in the pixel. then the attribute gravel road may be paved o~ widened, caus..
value may be in error. This generalization ing attributes to be in error. :Layers should
error may also occur iJl vec.tor da1.a sets. Any have a recommended update interval that
feature smaller than the minimum mapping may vary by type. Elevation. geology. and
unit may not be represented. vec,or data sets soils may be updated rarely.. and still main-
may poorly represent gradual changes. so tain their accuracy. VegetatiQn. population.
there can be i.ncreased attribute error near land use, or other factors cltange at faster
vector boundaries. Digital soils data are rates. and should be updated more frequently
often provide·d in a vector data model. yet if they are to remain accurate.
the boundaries between soil types are often
not discre1e. but change over a zone of a few
to several me;ers. Documenting Spatial! Data Accu-
racy
Errors are often introduced during spa-
tial data collection. Many positional data are We must unambiguously identify tme
currently coll.ected using GNSS technolo- conditions if we are to document spatial data
gies. The spatial uncertainty in GNSS posi- accuracy. For example. a road segment may
tions described in Chapter S is incorporated be completely paved. or not. The data record
into !he positiional data. Fe.ature localions for that road segment is accurate if it
derived from digili?ed maps or aerial pbOlO· describes the surface correc1ly. and inaccu-
graphs also contain positional errors due to rate if it does 1101. However. in many cases,
optical, mechanical. and human deficiencies. the tmtb is not completely known. TI1e loca~
Lenses, cameras. or scanners may distort tions for the above roads may be precisely
images. positional errors may be introduced surveyed using the latest car:rier phase GNSS
during registraiion. or errors may be pan of methods. Road centerlines and inte.rseelions
the digitization process. Blunders, fatigue. or may be known 10 the neares.t 0.5 cm. While
differences among operators in abilities or this is a very sn~,ll error. this represents
at1in1des may result in posi1iooal uncertainty. some.ambiguity in what we deem to be the
truth. Establishing the accuracy of a data set
610 GIS Fundamentals
Flaurt I~-2: Sp:ui:111d.t111 m11.y be i.11 c:ITOr b«:ltilc of tbc p:w.,igc of1ii11e. R<>11.d nUtps b.ucd on :2003 phote>
graplis (left) from !he town of Hugo. Minncs~~. 11.tc in c:m,r in ?OISI (rigi.) due to new nnd re·o.ligncd roads
atld' new building~ asso,;ia1cd with wlmrban de\·clopmct1.1. Changes with the: pauagc: of time: uuy add errors
to datll (coun«y ~·lNGEO).
requires we know·the accuracy of our mea~ assessed using different methods than anri-
sure oftnub. bute data. Nominal anribute data (e.g,. d!e
In most cases. the truth is defined based type of land cover). will be assessed differ-
OD some independent. higher order measure-
ently than a measurement recorded on a con-
ments. In our roads example, we may desire tinuous range (e.g.. purchase price of a
that our data layer be accurate to 15 m or parcel).
bener. Gaged OD chis scale. the 0.5 cm accu- There are four primary ways we
racy from our carrier phase GNSS measure- describe spatial data accuracy: posftiounl
ment may be considered 1n,e. occ,n·ac,~amibute accumc)~logiro/ co11sfs·
te,1cy. and completeness (Figure 14-3).
Accuracy is 11.1os1 reliably derennined by
a comparison of tn1e vah1es to the values These fourcomponents may be comple-
represented in a SJ"'tial data set. Tllis mented with infomullion on the lineage of a
requires wecoUect data a1 an adequate set of data set 10 dcfrnc the accuracy a11d quality of
sample Jocatfon.s. True values are collec1ed a data set These components are described
at these sample locaiioas. Corresponding in turn below.
values are collect<d for tbe digital spatial Posfrfounl occumcydescribes bow close
data. The tme and data values are compared. the loca1ions of objects represented in a digi·
errors calculated. and summ:uy statistics tal data set correspond 10 the true locations
generated. forthe real-world emi1ies. In practice. 1rutl1
The source for 011r truth. the sampling is detennined from some higher.order posi-
method. 011r method for calculating error. tioning 1ech11ology.
and the sununary statistics we choose ,,~II Amibme occuracv swnmar~zes how dif-
depend on tl1e type of spatial data that are to ferent the anributes are from ~le 1n,e values.
be evaluated. Positional data will be Anribute accuracies are us1•1lly reported as a
Chapter 14: Data Standards and Data Quality 611
o) Positional occurocy
-b) Attribute accuracy
Fi,&UJ~ 14-J: E:xomelcs of errors of ,-ariom types. This fig\1tc sl~ws: digitlzed fca~, (lines or_potllts) over·
Jamon a soUNc amal pboto~gh. Errors arc lab¢lcd \,i tb uumbd's.. lu o tbc bouscs labdcd J. 2. aud 3 suffer
&o,u pofitiot~I ~lllccumcy, wiu1c b dcmo1d1n11c.s :mrlbtuc :l.CC'l.lr.W:Y in iiuit pra;e.s ai I :\1'C' fabdcd aa bowcs.
3.ud 2 t111d 3 sbow 11~1111,ciits and h011R$ mi, ..l.;ibclcd. Panel Cshows tbn.1dn.111 m :1y not be logie11U>· co!lsis1c111,
,\;th Ligl11_poles at locallons 1. 2. acd 3 in a street. and d shows d~t data may lack co1nplc~cncu. "ith bou§es
I. 2. ,ind 3 not dia,itiz.cd. as shown by the missius; oudll'lc.
mean error or quantile above a threshold data layer may omit cer1ain stmctures, and
error for attributes measured on interval/ the frequency of these omissions reflects an
ratio S<:ales. and as percentages or propor- incomplete data set.
tions accurate forordinal or categoric.al attri- Data sets may be incomplete because of
butes. generalizations during map production or
Loglcal cousisreucy reflects the pres- digitizi.ng. For example. a minimwn map-
ence, absence. or frequency of inconsistenl ping unit may be sel at 2 ha when compiling
data. Tests for logical consistency often a "egetation map. Isolated s.maUpastures
require comp:arisous among 1hemes. for scattered through the forest may not be rep-
example. all roads occur on dry land. This is resented because they are only slightly larger
different than positional accuracy in that than this minimum mapping unit. and erro-
both the road and the lake locations may neously tbey are not represented in the data
contain positional error. Howeve.r. these layer.
errors do not cause impossible or illogical li11ell$e deS<:ribes the sources. methods.
juxtapositions. Logical consistency may also timing, and persons respons..ible for the
be applied to attributes. for example. wet- development of a data set. Lineage helps
land soils erroneously listed as suitable for establish bounds on the other measures of
constniction. or lakes with zero depth. accuracy described abo\'e, because knowl-
Comple1e11ess describes bow weJJ the edge about cenain primary data sources
data set capoires all the features. A buildings helps defme the accuracy of a data set.
612 GIS Fundamentals
Positional Accu racy are precise. but they may no! be 11e.ar the true
value. meaning they are inaccuN!te. Abia_s
Posirioual aocuracy measures how close may exist. defined as a syste~at,c offset m
a databose representation of an object is 10 coordinate values. A Jess precise process
the trne value. Accurate positions have small will result in a set of points that are more
errors. Small is defined subjecti\'ely. bm widely spread. However. their average error
may at least be quamified. may be substantially less, therefore., lhe set 1s
Precision refers to the consistency of a more accurate.
measurement method. Pn..~ision is ust1ally Figure 14-4 illustrates the difference .
defuted in 1enns of how dispersed a set of between accuracy and precision. Four dig, ..
repeat measuremems are from the average 1izing sessions are shown. Tl>e goal is to
measurement. A precise measurement sys· place several poinrs at rbe center of rhe c.lo-
tern provides rightly packed results. Precise verleaf intersection in Figure I4-4. The
digitizing means we may repeatedly place a
upper left panel shows a digit_izlng p_rocess
point in the same location. that is bo1b accurate and precise. Points.
Accuracy and precision are often con· shown as light-colored circles, are clustered
fused. bm they are 1wo differem cbaracteris· 1igb1ly and accurately over the i11tended
tics. both desirable. that may change location.
independently. A :set of measurements may The upper right panel of Figure 14-4
be precise but ina.ccurate. Repeat measure- shows points that are precisely placed
mems may be well clustered. meaning they (tightly clustered), but not accurately
Fta:urt 14-1:AC('Un'IC)' and precision. Poinl$ (circles) arc digitized in rcpci:al attc111p1s t4!.r)qn«eot.lhcbenter
of tbc clon:r1caf' i.ntcl'!l«tion. Diiitizcd data ma)' be ac-cur.uc ti.11 001pr«1sc {lower !en , or prccu:c. l!t 001
i1ccun1tc (upper rigbt), or wort:t of 211. i.iuc--cumtc. al'!d iruprcc-isc (l<>\\'cr lcfi). We prefer ll proccu lh:u y:iclds
bod1 accuracy ;md prccislo11 (upper left).
Chapter 14: Data Standards and Data Quality 613
located. This might be due to an equipmenl nates for each well. This leaves us with a list
failure or some problem in registr.uion: the of errors. one associated with each wen loca-
operator may· have made some bhmder in tion (Figure 14-5). Distance is measured
photo registration and introduced a bias. using the Pythagorean fornrula \\~th 1he true
The lower left panel of Figure 14-4 aod database coordinates. Distances are
shows points that are. accurately but i_mpre- always positive because of the form ofdte
cisely digiti.led. The average location for fommla.
these poinis is quite oear the desired posi- We may compute the in.can error by
tion. the center of the cloverleaf intersec.tion. summing 1he errors and dividing the slun by
even though individual points are widely I.be number of observations. This gives us
scattered. These poims are not very close to our average error. a useful statistic some-
the mean value and so precision is Jow. even where near the midpoint of our errors. We
though accuracy is high. are often interested in the distribution of our
The panel at the lower right of Figure errors. and so we also commonly use a fre-
l4-4 shows points with positions thal are quency hisl'og.mm 10 stmun.irize our spa1ial
both imprecise and inaccurate. TI1e mean error. The histogram is a graph of the nwu·
value is not near the true location. nor are 1be ber of error observations by a range of error
values tightly ch1stered. values. for example. the uumber of error val-
ues between Oand I , between I and 2.
The thresholds tbat constinue high accu- between 2 and 3. and so on for au our obser-
racy or precision are often subjectively vations. The graph will indkate the largest
defined. A du !fer may consider as accurate and smalle.st errors. and also give some indi·
any golf shot that lands on the green. This cation of the mean and mosl commonerrors.
definition of acc.uracy may be based on thou-
sands of previous attempts. For a profes-
sional golfer. anything farther than 2 m from
the hole may be an inaccurate shOI. In a sim-
ilar fashion. tl1e spatial accuracy sought by a
land surveyor may be different than those of
a federal land maoager. Cadastral surveys
require the utmost in accuracy because peo.
pie teod 10 get upset wheu there is material
permanent trespass. as when a neighbor
builds a garage on their land. LO\ver accu·
racy is accepcable in other applications: for
example•., statewide map definiug vege1a-
1ion type may be acceptable even though
bowtdaries are off by tens of meters. true point
The mean error and an error frequency
coOfdlnote <x,. Y,)
locotlon dotobose po,nt
threshold are the st3listics most often used 10 coord,note
document positional data accuracy. Consider locohon
a set of wells represented as point features in
a spatial data layer. Suppose that after we
have digitized our well locations. we gain
access to a G:NSS system that effectively
gives us 1he lrue coordinate locations for error distance • V(>, - >a! •(Y, - Y~l
each well. We may then compare tliese well
locations to 1J1e coordinate locations in our f'ia;un U -5: PositioiW fflOl"5 arc meMured by the
Pvthai,O!C3ll diitau« bctw«o a 'IJUC and database
database. We begin by calculating the dis· cOO'rdmatc for a location.
tance between our 1n1e and database coordi-
614 GIS Fundamentals
Examples of ,error frequency distribu. Because the mean s1atistic alone does
1ions for ~vo differe111 dalll sets are shown in nOI provide infonnation ou I.he distribution
Figure 14-6. Eacl, plo1 shows 1he frequency of positional errors. an error frequency
of errors across a range of error dislances. threshold can be reported. An error fre-
For example. 1he 1op graph shows tl1a1 quency threshold is a value above or below
approxima1ely I percem of e11ors have a which a proponion of the error observa1ions
value of near 4.S :m. and lhe mean error is occurs. Figure 14-6 shows 1he 9:.5% fre-
near 13 m. quency 1hreshold for 1wo error dis1ribu1ions.
The mean error value does 001 indicate The 1breshold is placed such ma-i 95% of die
the distribution, or spread of the errors. 1\vo errors are smaller than the threshold and 5%
data se1s may have the same mean error but are larger than the threshold. Tb-e lop graph
one may be inferior: the data se1 may have shows a 95% frequency 1bresbold of approx-
more large errors. The bonom graph in Fig- imately 21.S m. This indicates tbat approxi-
ure 14-6 has 1be same mean error. 13 m. as mately 95% of 1he positions 1es1:ed from a
the top graph. Note that tile errors bave a sample of a spatial dalllbase are less 11\an or
narrower distribution. meaning the errors are
equal lo 21.S m from lhe 1me loca1ions. The
chu11ped closer 10 11\e mean than in 1he 1op bottom panel in Figure 14-6 has a 95% fre-
graph. and 1here are fewer large errors. quency threshold a1 17.6 m. This means 5%
Although the mean error is the same. many of1he errors in 1he second iested database
would consider the data represented in 1he are larger than 17.6 m from their 1rne loca-
bonom graph of Figure 14-610 be more tion. Jfwe are concerned wi1h llte frequency
accurate.
oflarge errors. this may be a bener sununary
statistic than the mean error.
Accuracy Calculations
' The calculation of point accuracies and
3 summary s1a1is1ics are 1be next steps in accu-
racy assessment. First. 1he coordinates of
5% both the true and data layer posi-iions for a
feamre are recorded. These coordinates are
used to calculate a positiooal difference.
known as a positional error. based on the dis-
• 12 1 16
"'frequency
I" tance between 1he m,e coordina1es and the
mean data layer coordinates (Figure 14-5). The
threshold
true coordinates faH in a differeul location
1ban 1he coordinates derived from lhe da1a
I', layer. Eacb test poim yields an error distance
e. shown in Figure l4-5 and defined by the
equation:
5%
(14J)
•• 8 111 16 ' 20 1,
meQn frequer\Cy where x,. Yr are true coordinates and ~ · y4
rhreshold are the da1a layer coordinates fo:r a point.
positional error The squan.'<i e11or differences are then
fii:u~ 14-6: Mean ,error aOO freciuency thre5bo1ds
calc.ulated, and the sum.. average. and root
are often used to rcp,or1 positiouarerror,The ligbd)' mean square error (RMSE) sta1i~1ics deter-
s.baded gray trea rcprcseu1s 9S%o( the dn.1:1.
Chapter 14: Data Standards and Data Quality 615
mined for the data set. As previously defined get from our sample to detennine whal is tJie
in this book. the RMSE is: likelihood of a large or small error. A large
RMSE means 1he errors are widely spread,
e.j' + e.i' + ... e '~ and a small RMSE means lhe errors are
(142)
RMSE= packed tightly around the mean value.
n
Figure 14-7 shows an example set of test
where e: is defined as in equation 14. l. and n points for road data layer. aud au image
is the number of test points used. backdrop. Prior knowledge leads us 10
The RMSE is no11he same as the aver- expect average errors in excess of 6 m. Jn
age distance error, nora "typical'' distance this example, we have selected high resolu-
error. The Rl"ISE is a su11is1ic that is 11seful tion GNSS as our tnie data source. We kJ1ow
in detennini11.g probability thresholds for these photographs have a positional error of
error. The RMSE is reJated to the siatistical less than J5 cm. on average. from metadata.
variance of the positionaJ error. lf we assume These images were selected because they
the x and y e-.rrors follow a bell-shaped 111eet our accuracy requirem.ents and are
Gaussian curve conunonly observed when available for the enlire work area.
san1plu1g. then the RMSE tells us something The display of road locations on top of
about the distribution of distance errors. We the images shows there are substanlial dif-
can use knowledge about the RMSE thal we ferences in tnie positions of feanires and
FlJU1't 14-7: A ~ch data fa)'Cf displayed OYt-r a g:corcfco:nccd i.uui3c. Tcs-t point locations art- 4bown
l\$filled citt lu. T11c tci 1 p0i111 1ruc c001din:1.lc ,,,iluct sJ1ould be from n. lii3!111.cctm1,cy ""'1J'('c , e .g.., c cn•
time Ier lc,-cl ONSS ~utYcv;. TI1c dall\ coordimtc \·n.lucs l\~ ~lmc:tcd &om the road$ l.1ycr, OiffCl'C'occ$
in dle&c locations would be used to cstllootc the posi1iooal a.:cur.1cy ofthe toadj dat11 Layer.
616 GIS Fundamentals
their representations in the roads data layer. Positional Accuracy Sta·n dards
Any right-angle iJuersection is a pros~ctive
1es1 point. There are three commonly used Sllln·
<lards for map and rela1ed digital data posi-
The inset in the lower right of Figure 14· tional accuracy in North Americ:a, and
7 shows tile m1e point location.1 relative to adopted in much of the World. The first is
the road intersections. Road centerlines were known as the National Map Accuracy Stan-
digitized. These true loca1ions would be dard (NMAS), and was develop<d by the
idemified on the images. perhaps by point- U.S. Geological Survey 10 specify positional
ing a cursor at a georeferenced image dis- accuracy on hardcopy maps (ASPRS 1990).
played on a computer monitor. The data 11 specified a I/30th of an inch or larger error
coordinates would then be extracted for the on no more than IO%of1es1 poims for maps
corresponding rood intersection. and these a1 a scale of 1:20.000 or larger. wltlch trans-
two coordinate pairs. the true X. Yand data lates 10 a 4.2 111 ( 13.9 fl) threshold on a
x. Y. would be one 1es1 point used in acct1- I :5.000 scale map. Given current dominance
racy calculations. of digital cartography. the standard might be
Test points must be clearly identifiable best interpreted as applying 10 spa1,,1 errors
in both the test data set and in the 1ruth data at the scale of di2.ital aerial imaaes used in a
set Points that are precisely, unambiguously data collection effort. b111 even ~,en. the
defined are best. For example. we may wish errors are quite large relative to ,current tech-
to document 1he accuracy of roads dam com- nological c,,pabilities, so this standard bas
piled from medium-scale sources and repre- linle relevance except when prooucing hard·
sented by a single line in a digital layer. copy materials.
Right-angle rood intersections are preferred The American Society of Pbo1ogran11ne-
over other features bec.ause the positions 1ry and Remote Sensing bas deV'eloped a
represented in the database may be precisely standard for digital orthounagery and eleva-
detennined. The coordinates for the precise tions derived therefrom (ASPRS. 2015). The
center of the road intersections may also be standard specifies methods for reporting
detennined from a higher-accuracy data set. accuracy lhresbolds, all based on the mea-
for example. from digital onhopho1ograpbs sured RMSE. An x or y error threshold may
or field surveys. Other road features are le.S,S be established. e.g.. the RMSE ni11s1 be less
appropriate for lest p-0in1s, including road than 5 cm for agive11 orthopboto mosaic,
intersections a1 obluse aogles or acute and assuming aonnal errordistrjbution. this
curves. because there may be subs1an1ial threshold multiplied by 1.414 as a radial
uncenainty when matching the data layer 10 error(e. g., 7.1 cm. or 1.414'5). or multiplied
true coordinates. by 2.448 for specificalioa at a 95% hori2on~
The source of the true coordinate posi· ta! acc,,racy confidence interval (e.g. 12.2
1ion should match our minimum accuracy cm. from 2.448'5). It also establishes report-
specification. or at least an order of magni· ing requirements for vertical accuracy. with
tude more accurate than the errors. GNSS differem standards for vegetated and noo-
are a common source oftmth. as lhe accu· vegemted areas. reflecting the greater diffi-
racy may be set by collection equipment and culty in venical measurement for images or
me.thods, but any source of tnllh that lidar where vegetation obseures tl1e ground.
matches our requiremems is acceptable. The standard specifies accuracy testing
and reporting guidelines, most ootably on
the number and distribution of c beck (test)
points to acceptably quantify errors. Check-
point numbers increase with project area.
with 20 check points required for projects
encompassing less than 500 km2. growing to
Chapter 14: Data Standards and Data Quality 617
x x x (X y y y (y sum
ID (IIUO) (date) ditlor- d11fe,. ~,ue) (date) ddlor- (Wfo,. X diff2 •
'
2
,2
18
,o
22
ooce
2
...
IJJKe'f
16
4 TaiF ~
234 228
..
ence
6
eoce)~
10
36
y diffl
20
52
3 7 12 .5 25 265 266 .1 1 26
4 34 34 0 0 243 240 3 9 9
5 15 19 ... 16 291 287 4 16 32
6 33 24 9 81 211 215 ... 16 97
7 28 29 -1 1 267 271 ... 16 17
8 7 12 .5 25 273 266 5 25 50
9 45 44 1 1 245 244 1 1 2
10 110 99 11 121 221 225 ... 16 137
11 54 65 . 11 121 212 206 4 16 137
12 87 93 -6 36 284 276 6 36 72
13 23 22 1 I 261 259 2 4 5
14 19 24 -5 25 230 235 .s 25 50
15
16
76
97
80
108
••
. 11
16
121
256
201
260
20.1
.s
-3
25
9
41
130
17 38 43 .s 25 290 288 2 4 29
18 65 n .7 49 2n 282 .s 25 74
19 85 78 7 49 205 201 4 16 65
20 39 44 .5 25 282 278 4 16 41
21 94 90 4 16 246 251 .s 25 41
22 64 56 8 64 233 227 6 36 100
Sum 1227
Average 55.8
RMSE 7:,
NSSDA 12.9
Errors In Linear or Area Features two errors at the endpoints {Figure l4-9).
The NSSDA as described above treats These observaiions are not crue if a straight
line segment is tJsed to approximate a sub-
only the accuracies of point loca1ions. lt is stantially curved line. However. if the line
based on a probabilistic view of point loca- segments are suflicie111ly shon (e.g., me
liOns. We are 1101 sure where each poim is: in1erval along 1he line is small rela1ive 10 the
however. we can specify anerror distance r radius of a curve in the line). and I.he posi-
for a set of foatures. A circle of radius r cen- tional errors ru:e distribu1ed .evenly on both
1ered on a poin1 feanue in our spatial daia sides of1he Line segmems. lllen the NSSDA
layer wil.l inc lude the m,e poin1 location methods described above wm provide an
95%of the time. Unfortunately. there are no approximate upper limit on the 1.inear error.
es1ablished standards for describing the
accuracy or e1TOr of linear or an..v.i fearures.
la some ins1ances, we may assmue the
Attribute Accuracy
well-defined poim fearures described in our Unlike posi1ionaJ accuracy. there is no
accuracy 1es1 above may also represent cbe national standard for measuring and repon-
accuracy for ·vertices of lines in a data layer. ing attribute accuracy. Accuracy for continu-
Venices may be used as test points, provided ous variables may be calculated in an
1bey are well defined and 1be true coordi- analogous maru1er to positional accuracy.
nates are known. However. the errors al Accuracy foreach observation is defined as
imcrveniug locations are not known. for the difference between the true and da1abase
example. midway along a line segment values. A se1 of tes1 da1a points may be iden·
between two vertices. The error along a tified, the true attribule valu e detem1ined for
straight Line segmen1 may be at most equal 10 each oflhose tes1 daia poi11ts. the difference
1he largest error observed a1 the ends of 1he calculated for each tes1 point. and 1he accu·
line segments (Figure i4-9). If 1he data line racy summarized.
segmeu1is parallel to the true line segmen1, The accuracy of ca1egorical anrib111e
1heu 1he errors are unifonn along the full data may be summarized using an e,TO,·
lenglh of the :segme-nt. Vertices that result in
converging or crossing lines will lead 10 mble and asso<:iated accuracy slatistics.
midpoint errors less 1han 1he larger of1he Points can be classified as correc1. tha1 is. the
categorical variable matches the true cate-
gory for a feature. or they may be incorrect.
lncorrec1 observa1ions occur when 1be true
true values are entered via the appropriate Per category accuracy may be e.xtrncted
column and the data layer values are emered from tl>e error table. l\"o types of accuracy
via the appropriate row. The 1able is square. may be calculated. a use,-'s nca 1racy and a
because there is t1te same number of catego- producer 's ncc1o·ncy. The user r•elies on the
ries in both tlte rows and cohuuns. Correctly da1a layer to detenniue 1he categoiy for a
classified (eanires are tallied on the diagonal feature. The user is most often interested in
- the tme value and data layer value are how often a feature is mislabeled for each
identical. so they are noted at the intersec- categoiy. In effect, die user wan•s to know
tion oftl,e categories. Incorrectly assigned bow many feamres that are classified as a
categoiy values fall off the diagonal. categoiy (the row tOlal) are tmly from that
Error tables Sllllllllarize the main charac- categoiy (1be diagonal element for that row).
teristics of confusion among categories. The n,us. the user's acct1tacy is def,ned as the
diagonal elements contain the test fean1res number of correctly assigned features (the
that are correctly categorized. The diagonal diagonal element) divided by the row total
stun is the total num1ber correct The propor- for the category. The producer. on the other
tion correct is the total number correct band. koows the true identi1y of each feanrre
and is often interested in how often these
di,•ided by the toral munber tested. The per·
cent correct can be obtained by mulliplying feamres are assigned to the correct categoiy.
the proportion correct by I00. The producer's accuracy is defined as the
diagonal element divided by the column
total.
true value
wheat 14 4 4 22
corn 2 12 I 3 18
soy I 18 2 21
olfo1f< 3 2 16 I 22
gross 3 1 1 12 17
follow 20 20
20 20 20 20 20 20 ! =92
Figul"f' J.-1,.IO! An aror tAbJc n1ccinctl>· s:uminn.rizcs: the attribute 11ecttr11ey (or t\11tcaori~ I
\'nri.,bk ti.
Chapter 14: Data Standards and Data Quality 621
Error Propagation In Spatial Anal· might be counted in each census a.rea and tJie
ysls density calculated. We have replicated val-
ues from each census area. so we may calcu-
While we have discussed melbods for late a mean and a variance, and the
assessing pos.itional and anribute accuracy. difference between modeled and observed
we have not described how we de1em1ine tl1e values might be compared relative 10 1he ,,.,_
effecis of input errors ou 1he accuracy of ural variation we observe among differe.nl
sP31ial operations. Cle.irly. input erTor ceosus areas. We could also survey an area
affects oulpul values in most calculations. A through time. for example. on successive
large eleva1io,n error in DEM cells will likely days. monlhs. or years, and compare the dif-
cause errors ill slope values. If slope is then ference between tlie model and obsen>ed
combined ,,~th other features from other values for each sample 1ime.
data layers. these errors may in tum proJl3·
gate 1hrough_1be anal}~is. How d? we assess 11 may not be possible or desirable 10
the propagation of errors and their 1mpac1s wait 10 assess accuracy until after complet-
on spatial an•l~1is? ing a spatial analysis. Input da1a for a spe-
cific spatial analysis may be expensive to
There are currently no \\~dely applied, collect. We may not wish 10 develop !be data
general methods for assessing the effect of and a spatial model if errors. in the input pre-
positional errors on spatial models. Research clude a useful output After model applica-
is currently directed at several promising 1ioIL we may wish 10 identify the source of
avenues: however. the range of variables and errors in our final predic1ion.s. Improvements
conditions involved has confom1ded !be in one or two data layers may substantially
developmem of general methods for assess- improve the quality of our predictions: for
ing the.impacts of purely positional errors on example. bener data on forest cover may
spatial models. increase t.he accuracy of our deer density
Several approaches have been deveJ. predic1ioos.
oped 10 es1ima1e 1be impacts of a11ribu1e Error propagation in spatial models is
errors on spatial models. One approach often investigated with repeated model runs.
involves assessing errors in the final resul.t We may employ some son of repeat simula-
irrespective of errors ui the original data. For tion model 1hat adds error 10 data layers and
example. we may develop a canographic records the impacts on model accuracy.
model to estimate deer density in a suburban These simulation models onen employ a
environment. The model may depend on the staodard fom1 known as a lvfo11re Carlo sim-
density of housing. forest loca1ioo.1ype. and ula,iou. The Monie Carlo method assumes
extent. the loc.ation of wetlands. and road each inpm spo1ial value is derh·ed from a
location and traffic vo!W11es. Each of these population of values. For example. ~,nd-
data sources may contain positional and cover nmy range over a set ofvalues foreac.h
attribute errors. cell. Funher. model coefficients may also be
Ques1ioos may arise regarding how altered over a range. In a cartograplllc
these errors in our input data affect the model. the weights are allowed to range over
model predictions for deer density. Rather a specified in1en•al when layers are com-
1hau 1rying 10 identify how errors in 1be input bined.
propoga1e through to affect the final model A Moote Carlo simulation c.ontrols bow
re.suits, we m:ay opt to perfonn an error these input data or model p<trame1ers are
assessmem or our final ou1pu1. We would allowed to vaty. Typically. a random nom1al
perform a field survey of deer deosiry and distribution is assumed for continuous input
compare the ,,alues predicted by the model values. If au,1ariables sa\1e one are beld con-
with the values observed in 1be field. For stant. and several model runs performed on
example. we might subdivide 1he sn>dy area different. random selections of the variable,
into mutually exclusive census areas. Deer we may ge1 an indication of bow a variable
622 GIS Fundamentals
affects the model output. We may find that Inaccuracies may be reported us:ing many
the spatial model is insensitive 10 large methods. including a mean valu,e. a fre.
changes in most of our input da1a values, bu1 quency distribution. or a threshold value. An
sensi1ive 10 small changes in a few. For accuracy assessment or me.asurement applies
example. predicted deer density may not only 10 a specific data set and time.
change much even when landc.over varies Accurac.y should be recogn~zed as dis·
over a wide range of values. but may depend 1inc1 from precision. Precision is a measure
heavily on housing density. However, we of the repealllbility of a process. Imprecise
may also find a set of input data. or a range data collection often leads to poor accuracy.
of input data or coefficients, that substan-
tially control model output. Standards ba,-e been developed for
assessing position..11 accuracy. A-ccuracy
A Monte Carlo or similar simulation is a assessment and reponing depend on sam·
computationally intensive technique. Thou. piing. A set of fearnres is visited in the field.
sands of model nuis are often required over and the 1:rue values collected. These i:rue val·
each of the component units of the spatial ues are then compared to corresponding val·
domain. Thecompulational burden increases ues stored in a data layer. and the differences
as the models beoome more involved. and as between trne and database values quantified.
the number of spatial units increases. How- An adequate number of well-di,;tributed
ever. it is often the onJy practic.al way with samples should be collected. Standard wOlk·
which to assess the impacts of uncenainties sheets and statistics have been developed.
on spatial analyses. uncenainties both in the
input data and the parameters and methods Data documentation standards have
in combining theut been developed in the United States. These
standards. developed by the Fed:eral Geo-
graphic Data Conuniuee: are known as ,he
Summary Content Standard for Digital Goospatial
Data standards. data accuracy assess- Meladata. Tilis standard identifies specific
mem. and data documentation are among the iofomrntioo that is required to fuUy describe
most important ac,tivities in GIS. We cannot a spatial data set.
effectively use spatial data if we do not
know its quality. and the efficiem distribu·
lion of spatial dala depends on a common
understanding of <lalll content.
Data may be inaccurate due to several
causes. Data may be out ofdate. collected
using improper ntetbods or equipment, or
collected by unskalled or inattentive persons.
Accuracy is a measure of cITor. a differ·
ence between a tm e aod represented value.
Chapter 14: Data Standards and Data Quality 623
Suggested Reading
Arbia, 0 .• Griffith, D., Haining, R. ( t999). Error propagation and modeling in raster
GIS: ,overlay operations. buem01io11al Jounwl o/GeograpMcal Jnformatiou Sci·
e11ce. 12:145- 167.
ASPRS. ll 990). ASPRS Accuracy Stru1dards for Large-Scale Maps. Photogm111111e1-
ric Engineering & Remote Seu.sing, 56:1 068-1010.
ASPRS. (2015). ASPRS Positional Accuracy Standards for Digital Geospatial Data.
PhorogrammehJ' E11giueeri11g Remote Se11si11g. 81: 1- 26.
Balazin.ska. M.. Deshpande, A. franklin, M.J., Gibbons, P.B.. Gray, J.. Hansen. M..
Liebhold. M.. Nath. S.. Szalay. A.. Tao. V. (2007). Data mauagemem in tl1e
Worldwide Sensor Web. Permsive, 6:30-40.
Blakemore. M. ( J984). Generalization and error in spatial data bases. Cnrrogmpltico.
2 1:131- 139.
Bolstad, P., Gesster, P., Liltesm1d, T. ( 1990). Positional uncenainty in manually digi-
tized map dara. /111er11atioual Journal of Geographical bifomwrion Systems.
4:399-4 12.
Cl1ong. A.K. (1999). A technique for spatial sampling and error reponing for image
map bases. Phorogrammerric E11gi11eeri11g & Remore Se11st'ng. 65 :1195-1198.
Comber. A.J., Fisher, P.F., Harvey, F., Gahegan. M.• Wadsworu1. R. (2006). Using
meiadaia 10 link uncertaimy and data quality assessmems. Progress i11 Spatial
Dntn Hn11dli11g, 6:279- 292.
Congalton. R.G .. Green. K. (2002). Assessing tlte Acc11mcy ofRemotely Sensed Datn.
'Boe.a Raton: CRC Press.
DiBiase. D.. DeMers. M.. Johnson. A.. Kemp. K.• Luck. A.T.. Plewe. B.. We111z. W.
(2006). Geographic !11for111atio11 Science 011d Tec/1110/ogy Body ofK,,owledge.
.A.ssodaiion of American Geographers.
DiBiase. D.. Corbin. T.. Fox. T.• Francica. J .. Green. K.. Jackson. J.. Jeffress. G..
Jones:, B., Jones. B., Mennis, J .• Schuckman, K.. Smilh, C.. Van Sickle. J. (20 10).
TI1e new GeospatiaJ Technology Competency Model: Bringing workforce needs
into focus. UR/SA Jounml. 22:55-72.
Dunn. R.• Harrison. A.R.. White. J.C. ( 1990). Positional accuracy and mcasuremem
error .in dig.i1al databases on land use: an empirical study. lmernatioi,al Jo11n,al of
Geogn1pl,;cal !11fon11atio11 Systems. 4:385-398.
Fisher. P. (1991). Modelling soil map unit inclusions by Monte Carlo simulation.
!111en.,n1fo11al Joumnl ofGeogrnpl,icnl J,,for111ntio11 Systems. 5: 193-208.
Goodchild. M.f.. Gopal. S. (1 989). The Accuracy ofSpntinl Dntnbases. Loudon: Tay-
lor and Francis.
624 GIS Fundamentals
Guptill. S.C.. Morrison. J.L.(Eds.) (1995). Elemems o/Sparial Dota Quality. New
York Elsevier.
Hannel. R.D.. Smitb. D.R.. King. K.W.. Slade. R.M. (2009). Estimating stonn dis-
charge and water 9uality data uncertainty: A software tool for momtoring and
modeling applications. Em•i,-or,meutal ,Wodeling and Software. 24:832-542.
Heuveliuk. G. (1999). Error Propagation iu E1ll'iro11memal J\1odeUng w;,t, GIS. Lon-
don: Taylor and Francis.
Heuvelink. G.• Brown. J.O.. van Loon. E.E. (2007). A probabiListic framework for
representing and simulating uncertain environmental variables. lmeruarioual
Journal of Geographical lufon11atiou Science. 21: 497-513.
Hunsacker. C.T.. Goodchild. M.F., Friedl. MA.. Case. T.J. (2001). Spatial U11cer-
roi11ty ;,, Ecology: lmpllcations for Remote Sensing and GIS App!ic(llio,rs. New
York Springer- Verlag.
Jones. R.R .. McCaffrey. K.J.W.. Wilson. R. W.• Holdswonh. R.E. (2004). Digiral field
data acquisition: towards increased quantification ofuucenainty during ,geologi-
cal mapping. Geological Society ofl.cudou Special Publications. 239:43- 56.
Kasseube~. O.• De Jong. K. (2005). Dynamic euviro,unental modeling in GHS: 2.
Modelmg error propagatio11. /memotfo11a/ Jouma/ o/Geogrnpl,;cnf /11/ormorion
Science. 19:623-637.
LMIC (1999). Posilioual Accumcy Handbook: Using J/,e Natio11al Sta11dartl for Spa-
tio! Dnra Accumcy to measure oud repon geographic darn quallly. Miru1esota
Plam,ing. St. Paul.
Lodwick. W.A.. Monson. W.. Svoboda. L. (1990). Attribute error and sensitivity anal-
ysis of map operations in geographical information systems. J11remario11al Jo111··
110/ ofGeogrnpMcfll i11formMio11 Systems. 4:413-427.
Lowell. K.. Jaton. A. (1999). Spatial Accuracy Assessment: Umd b,fom,atiou Uuce,.-
rnimy ;,, Nawra/ Resources. Chelsea: Arm Arbor Press.
Tbapa. K.• Bossler. J. (1992). Accuracy of spatial data used in geographic infonnation
systems. Pltorog,.m11me1n·c Engineer'iug and Remote Sensing, 58:84 l-858.
Walsh, S.J., Lightfoot. D.R.. Butler, D.R. ( 1987). Recognition and assessment oferror
in geographic infonnation systems. Photog,·nmmetl'ic Engineerirrg and Remote
Se11si11g. 5.3:1423- 1430.
Chapter 14: Data Standards and Data Quality 625
Study Questions
14.2 - Can you describe processes or activities that are greatly helped by ct,e exis-
tence of standards?
14.3 - \Vhal are the differences between accuracy and precision?
14.4 - How do mean and frequency thresholds differ in the way they re!)on positional
error?
14.S - What arc some of the primary causes of positional error in spa1ia I data?
14.6 - Describe each of the following concepts with reference 10 documenting spatial
data accuracy: positional accuracy. auribure accuracy. logical consistency. and com-
ple,cness.
14.7 - Whal is the NSSDA. and how does ii help us measure positional accuracy?
14.9 - \Vbat are the constraints 011 lhe distribution of sample points under the
NSSDA. mtd why are these constraillts specified?
14.10 - \Vhat are good candidate sources for test points in asse.ssing the accuracy of a
spatial da"la layer?
14.11 - How are errors in nomillal anrib111e data often reported?
15 New Developments in G IS
Introduction
As every economis1. weather forecaster. connections. If our data are always avail-
or politician knows. predicting the future is able, we wilJ internet with tbem ditferently.
fraughl with peril Near.term predictions We can more easily see bow things should
may be safe: if times are goO<I now. they will be in the field. and compare them 10 bow
probably be good nexi momb. However. 1.he 1hey are, forexample, a wiring diagram for a
fanher one reaches into the future, the more roadside telephone interchange panel, or a
likely t~y will be wrong. This chapter building site plan vs. s1akeorn. Records may
describes technologies tb.11 may become be available at any rime. everywhere, offer-
11idespread. 11discusses future trends. with ing ins1am access 10 an agricuhural field's
the expectation that many of these specula· feniliza1io11 history. a water main's flange
tions will not wholly come tnie. size. or bridge's inspection records. improv-
Many changes in GIS are based on ing decisions in the field and streamlining
advances in compu1ers and other electronic data mana.gernen1.
hardware. Computers are becoming smaller Change is also due to increased sophisti-
and Jess expe1isive. This is true for boll, gen· cation in GIS software and users. and
end-purpose machines and for specialized increased familiarity and standardization.
computers.. such as mggedized. portable tab- Change will be driven by new algorithms or
let computers.. The wizards of semiconduc· methods. for example. improved data com·
tors continue to dream up and then produce pressiou techniques 1b.11 spe,ed the re1rie\'al
impossibly clever devices. Given currenl and improve 1he quality of digital images.
trends, we should not be suiprised in the Specialized software packages may be
furure if a pea-sized device holds all the pub- crafted that mm a multi-day. technically
lished works of humankind. Computers may complicated operation into a few mouse
gain personalities, recognize us as individu- clicks. These new tools will be introduced as
als. respond entirely to voice commands. GIS technologies continue 10 evolve and
aod rou1inely conjure 1hree-dimensional will change the way we gather and an.,Jyze
images that floal in space before our eyes. spatial data.
These and other developmen1s will alter bow
we manipulate spatial data.
Changes in GIS will also be due 10 tlte
growing ubiquity of hig)l speed. wireless
628 GIS Fundamentals
GNSS
Three irends wiU dominate GNSS im,o-
vation over the next decade: muhi-coostella-
lion and mulli-signal GNSS receivers,
miniamrization. and system integration.
Multi-GNSS recf'ivers wiU continue to t.ake.
advantage of distinct satellite constellations.
GNSS has been unable 10 pro,;de 10 cm
(subfoot) position accuracies in th.ick forests.
deep valleys, or city centers. Multi-GNSS
tracking systems exist now. supporting GPS.
GLONASS. Galileo. and the Compass sys-
tem have been developed. further increasing
accuracy, availability. and reliability. Future.
low-cost GNSS receh ers will conunonly
1
have hundreds of channels. and track tens of Flgurf' IS.2:A miuiarurizcd GNSS unit tbt11
satellites even when under heavy forest can· n1.i.v be embedded in 11 rnngc of d«tronic:
opies. in canyons. and among iall buildings. dc,:ices.
bringing real-time precise positioning to cen1ime1er range. New systems !integrate
everyone. boih in1proved hardware with suitable pro-
Dual channel GNSS chips at costs less cessing. Systems still require substantial
than S250 now pro,~de cemirue1er accura· modiflc~tion 10 become turn-key. easy,to-
cies. and prices will continue 10 drop (Figure use. and robust. but within the 11.ext decade.
15-1). Earlier inexpensive GNSS receivers cemimeter-level accuracies will be available
tracked a single frequency. e.g .. LI in the in real time., for tens of dollars in hardware
U.S. GPS system. and provided acctiracies cost.
in the few 10 JOs of meters range. Measure- GNSS receivers wilJ cost less, shrink in
me111s in a second freqmmcy allows reduc· size and weight and increase aocuracy for
tion of ionospheric and atmospheric iransi1 some rime to come-. and these improvements
errors. leaving smaller system and clock will spur even more widespread adoption of
errors 1hat are in cbe lens of centimeters to this technology (Figure 15-2). Microelec-
trouic miniamri:wtion is helping_ shape the
GNSS nmrket. As GNSS use grows and
manufacturing methods improve, single chip
GNSS systems have emerged. and these
chips are. decreasing in size. GN'SS chips
smaller tl1an a postage stamp are available.
including some that may be inte.grated into
common electronic devices. Many vendors
are well on a path to system integra1ion. and
it will become more common to embed I.he
antenna. receiver. supporting electronics.
power supply. and di!foren1ial corre1;1ion
radio receivers in a single chip or small cir-
cuit board. Some of these integrated sy>tems
figul"t' I S.. I ; lncpen,in· dm1 frequency chips pro· are smaller than most GNSS ant:ennas of a
,·idc cm-lc,·cl ae1:urac:i« to a broad array of decade ago. and systems wiU continue 10
devices (cout1cS)' Sp11rkfoo). shrink. A bunoo-sized GNSS is !lOt far off.
Chapter 15: New Developments in GIS 629
As receivers shrink in size and cost, it ers may be pasted. decal-lik~. on w.ind· .
becomes proe1ical 10 collect positional infor- shields by 1be lbousands, 10 1ransnu1 !heir
mation on smaller individual objec1s. While data bac-k 10 a traffic manag-emeat center.
GNSS is unlikely 10 help you find your keys. Ubiquitous. inexpensive. or Ire~ differ-
small GNSS receivers will collec1spatial
data (or smaller objects. For example. a (ew
ential correttion signals are substanually
improving the accuracy commonly ac~1eved
years ago i1 was uneconomical.to track
objects smaller than a cargo ship. Now
""'b GNSS. Many U.S. states and nauonal
governments abroad will eS1ablish more
tn1cks or con1ainers are rou1inely followed. complete coverage. Virtual Reference Sta-
In the near future. it may be common to tion (VRS) networks promise 10 allow sub-
track individual packages. metcr and even near-ccmime1er level
GNSS miniaturization means we will positioning in real time. Conunercfal solu-
directly collect much more daia in the field tions will be funher developed and made
than in times pas1. City engineers may smdy less expensive.
traffic panems by placing special-pwpose GNSS systems will add functions.
GNSS receivers into autos. How long does including the abiLity 10 take photos or videos
1he aver-age c-0nunute take? How much of and anach them 10 geograpllic feamres in a
the time is spent sining a, stop signs or database (Figure 15-3). The old adage "a
lights, and where is the congestion most picture is wonh a thousand words" may be
prevalent? How is 1roflic affected by modified 10 "a picmre saves a thousand
weather condj1ions? Analyses of 1rnffic ne1- hours.·, T11ese systems will greatly aid plan-
works will become substantially e.asier with ning. managemen1. and analysis by more
small-unit GNSS. Disposable GNSS receiv-
ftgurr t s..3: GNSS r«ci\'cn with buih~iu cnmcns c.:u,1
l,.c used to cccord di.e loc'-tion (A), im,gcs (B).11.nd 111tn-
butcs (C) of objects (COW1«)' TOPCON).
--
>
·-
·-
"-
•
·:---
··-
..-..-
.....
~
.,..
·--
·-·-
~·
•
-"'-
__
.,.,_.
'
·- --
·1----
-~·..
.·--
··-
•
~ -
630 GIS Fundamentals
easily providing images in GIS. For exam- of fea1ures (figure 15-4). This scanning is
ple. the type. reloiive location. and condition necessary because many fea1ures are modi-
of public utilities such as fire bydran1s may fied over time. for example. roads are
be described witlt both photos and alphanu- changed, buildings are extendecll, exlra sup-
meric data collecced in a daiabase. Ifa work pons may be added 10 1owers. or oil refiner-
order is required ,o re~ir a hydran1. a photo- ies may be re-plumbed. lnven1ories mus1 be
graph may be 1aken in the field and tagged to updated to record the features as built. ralher
the work order. This photograph may be 1ban as designed or observed dusing lhe pre-
inspec1ed to verify lhe l)'J)e ofhydram.10 vious inventory. A three-dimens:ional scan-
perhaps identify the tools needed for repair. ning laser may be combined with a precise
or to recognize which specific pans are GNSS receiver to measure the X. Y, and Z
required for mallnenance. coordinates of importan1 fearur~. The
GNSS is used to detem1iue the l,ocation of
1he scanning laser. The horizomal and verti·
Fi xed and Mobile Three-Dimen- cal offsets from the scan poim are measured
sional Mappi ng by IJ1e laser. These measurements are com-
GNSS is also being combined with new bined with coordinate geome.try to c.alculate
advances in ground-based laser scarming to 1be precise posi1ions for all features scanned
increase lhe scope. accuracy. and efficiency in the field.
of spatial data collection. Three-dimensional The 1tend of nmlti-tecbnology integra-
scam1ing devices have been developed that 1ion wiH accelerate for rapid. cemime1er-
measure the borizomal and vertical location level mobile mapping. Multichannel GNSS
Fi&n"' I S-1: Mllppin_,g $)"lieus coinbiocd \\ilb ON$$, 1hrce•di111c1~sio11, 11l im11gjnJ ~sen, op1ic:)I i.m:.ging
thr«-dimcmiowal wc.uUfflneots in real
')'Stems. aod other ntC3.M'Cllll¢1ll $)"Siems to pro\·idc iutcgntcd
tuue (courtesy Apple).
Chapter 15: New Developments in GIS 631
combined with other positioning systems travel from the device to the database with as
will provide higl~y accurate locations. and liuJe human inter\lention as possible.
three-dimensional laser scanners and 360 Automobile systems would rely on muJ.
degree image-data collection will allow tipie technologies. A GNSS would locate o,e
faster collection of X. Y. and Z coordinates. vehicle to within a few tens of centimeters in
Combined with GNSS systems and real time. Three.dimensional dala on road
mounted on mobile platfonns, three-dimen· centerline, edges. curbs. adj.aceot poles. and
sional laser mapping systems will collect other imponam features would be identified
highly accurate data accessible by anyooe for the trajectory ahead. When combined
with a traffic-enabled GNSS. Approaching \\~th an on-vehicle laser sca:nuer, the system
drivers may be forewarned. 1ravel limes cal· may identify moving automobiles and distin-
culated and new suggested routes identified. guish them from unexpected s1ation.1ry
One can imagine self.driving automobiles objec1s within the roadway, or other changes
that na,igatc via a combined GNSS/LiDAR/ in conditions. A combinatioll of LiDAR.
GlS. using systems co avoid collisions via RADAR. and cameras may help identify
real·time distance measurements and wire· objects. and compare their location 10
less conuim1llcations with ·'nearby" automo- e.xpected features in on-board spatial data.
biles. Automobiles may be reliably identified.
Such mobile systems will help improve Aids. such as virtual illumination on the
the currency and accuracy of digitized trans- windshield. may be used to highlight other
portation net\·vorks. and anything visible vehicles or road edge wben visibility is poor
from them. Every road can be digitized (Figure 15-5), flag upcoming hazards or
while driving;. as well as every building. rums. or warn of unexpected conditions. The
light pole, sign, bench. tree, or any other autos could use mapped information on
11iree-dimeus;0o,I sli\lcrure visible ftom shoulder widtl~ ne~rby off-ramps, the road
them. Effons will move from a focus on the ahead, or other strucrurnl infonnation to exe-
development of integrated, tum.key daia cute 1he appropriate driving maneuvers and
collection systems to sofavare and methods avoid accidents.
that automate workflow. so that data may
Fi;u." ' JS.5: C:iDAR Ill~ oth~ cL,t.11my be ,;em;cd and displ~ycd in nul0$. impro\iog dri,-c:r &11.fe1y ruid trip
eflic1cucy 31 lllgh.1 and under u1dcm(UI we11th-et (counesy NVIDIA),
632 GIS Fundamentals
Ground Based Positioning tions., and are similar 10 GNSS iu using the
same basic principle of range measurements
Ground-based positioning sysrems may and 1riangulation (Figure IS-6). Each
find increased adoption. Although GNSS ground-based station transmits a coded sig-
systems provide remarkable positionjng nal. wbkb is decoded in a receiver 10 calcu-
information. they :ire limited. GNSS signals late a range. and then precise location.
can't pass directly through most solid Centimeter level positioning is possible. and
objects. Buildings, mountains. and dense these gro1u1d-based systems may be used
forest canopies entirely or panially block iudepeudemly or in conjunction with GNSS
GNSS signals. yielding a reth,ced set of positioning. Ground-based an1ennas transmit
obseniations and :reduced spatial accuracy. signals !hat are orders of magnirude stronger
or at wors1, loss of position. GNSS doesn't than GNSS (Figure 15·7). l11is greaily
work in many indoor loca1ions. enhances reception in sub-canopy enviroo-
Signal strength is one limitation of men1s. Since transm.iuers may b.e small,
GNSS systems. Satellite launch coos1raints dettse tleploymems across high buildings
force the use of relatively low-power trans- may effectively remove the urban canyon
mitters. and transmission distances are quite limitations.
large. funher diss ipating energy. Signals at Separate systems have been proposed
the receiver are often weak. and difficult to for indoor positioning. These fuse a number
distinguish from 111ulti-path transmis-sions. of technologies. including Bluet-001h and
Gro1u1d-based positioning services are WiFi beacon measurements. and '·dead reck-
under development that solve many of these oning,., using a physical orelectronic gyro-
problems. These rely on a set of distributed scope to measure distance and direction
transminers and precisely surveyed loca- traveled from the last know11 loC11tion. and
Fi&utt t S.6: A ctiagr,1m 11lwtr.1ting g,o1.1ncl-.b:uC'd posi1ioning 5\'$le11u., J11C:re uM:d i.n conjuuction wilh ONSS
satellite&, Si.mult.anc:ous ntC-11$\lttmcnta to tcrT«trial traosruittef'S aUow precise positiooing. with strongtt sig-
nals. pol«i.tially wore robus1r«cptiou. ai,d th(' ability ,o reduce siinal obstmctiou iu utban ot mouutaiuous
C'll\•irOIU1t('Clti.
Chapter 15: New Developments in GIS 633
Datum Modernization
New darum 1'1'aiizatious will be released bOlb the time and location of the respecti\'e
in the ne:\.1 few years for U.S. territories, measurements. Further. lhe calculations
based on improved measuremen1s and a require we establisha s1a1ionary reference
change in the basic model and methods for frame against which 10 measure points.
danm,s. Dan11ns i;n the U.S. have been based Because of differences in our starting point,
on a no11-Eanl1 ceutered ellipsoid to maintain and in how we account for c:ms.t..al move.
compatibility with previous systems and mem. diere are large differences between the
measurements. Tbe disadvantages of this NAD83 family of dan,,ns used primarily in
system now outweigh the advantages. and so North America. and the ITRF datums used
a new official darum system. originally pro- by most oftbe rest of the World .
posed for Nonh America in 2022. bu1 now The [TRF is an Eanh-centered system
likely for 2024 or later. based on measurement< of the X. Y. and Z
As described in Chapter 3. the conti- locations and velocities of points. It places
nents are moving about the Earth on plates, tl1e origin of !he adopted ellipsoid al the best
sometimes at rotes exccedh1g 2.5 cm (one estimate of the cemer of mass of the Earth at
inch) a year. Ovea several decades. ~,is drift the time of each adjustment. The post-1986
leads to changes i.n the rel.alive positions NADS3 datums are similar in that they are
among points on different plates (Figure 15- centered on an Earth model. In contrast 10
8). In addition. geodesists must factor the the ITRF system. 1he NAD83 danims have
tot.al amount of movement into their devel- nOI adopted the best measureme.ats of the
opment of datums. because measuremenls Earth's center. but rather a position compali·
have been made over severa I decades. so the ble with older NAD83 danmts. This center
relative position of any 11:x:atioo depends on assigned a value relative to aver.age. cmstal
Fig11rt' tS.8: \'doci1y ,-cciors of 1Jte Enrib's surface, me21ourcd :u ITRF sm1101a 11crou tltc globe. Nole tbe
rclati,·ely high \'cl,oc1ties of the c:Mlcrn P~ci.fic: Pl~te Md \'ariousdi.ffercnl directions ofO\'eraH plole tra,-cl.
These: mow:rucn1s most be factored into future datum ~Jiz.atious (with pcnuissiou Ji. Dr<wcs.
w,\'\\s$irgas.orgl61eamtiufdoc:sll3olcciucst.?Ol 7_Orcwu_AC KIM_based_e1,_ l'1Rf20l 4_lAO_K.obe.pdf).
Chapter 15: New Developments in GIS 635
velocit'ies on the North American tectonic mation between lhese two systems will be
plate. raiber than a globol network. There produced. and time dependent. It will likely
were many good reasons for maintaining 1he be in the fonn of an inirial X, Y, and Z traJls-
old origin, primarily because it maintained formation. and then a time-dependent piece
coordinate compatibility witl, older NADS3 in which the time difference between input
danuus. could be deployed rapidly. and was and output damms is accounted ior. 10 incor-
integrated with a relalh-ely dense network of porate relative co111inental ct:rift.
continuously opera.ting stations ·.vithin Nonh The tcn·year plan also describes the
America. process for improving the ven-ical datum for
With the development of the ITRF, tlte Nonh America. lmprovemeuts will be based
active participmion of the NGS. and tlte inte- on gravi1y measurements accross the hemi-
gration of CO RS stations into the ITRF nel- sphere. accouming for changes in gravity
work, there is strong impe.tus to luim1onize fields over time. and also for the rises in
datums in Nonb Ainerk:a with imcmational mean sea level. Much as with the horizontal
efforts. Improved global measurements will datum. it wiUbe based at least in pan on an
suppon more accurate horizontal and \'erti· integrated, global set of satellite-based mea-
cal danun development. and help suppon suremems. and tied 10 ITRF measurements.
precise. rapid, centimeier·level positioning The new venical danim wil.l allow the calcu·
worldwide. lation of heights tied to a measurement
As noted in Chapter 3. there is a plan to epoch.
update 1he cL-itwns used in Nonh America.
with the in1rOOuc1ion of the Noni, Americau
Te,reshinl Refere11ce Frame of2022
(NATRF2022). This will initally remove
much of the positional differtnce between
ITRFiWGS84 and U.S. datums. It will entail
a shift. often 11p to two melers (six feet) in
NAD83(20 1L) coordinates to NATRF2022
coordinates.
Current plans fix the NATRF2022 to
the Noni\ American and related tectonic
plates. This will make position shifts small
due to continental drift, and so generally
result in stable locations across time over
most ofNortb America. This is different
than the ITRF system, which generally holds
the average shift across all tectonic plates to
be zero. tltere1ly distributing shifts across the
globe. The ITRF and NATRF2022 positions
wiU diverge over time. The darum trn.nsfor-
636 GIS Fundamentals
Flautt IS..9: An iJnaac from the Wortd,icw-.3 s:udlitc. Ulkco over Washi.iia.1on, O.C. This s:a1.dli1c pro-
duces up 10 25 e.1• rcs:Olutioll inugcs. tu.d it soon to be joi,~cd by otbus of tbc umc bi$:b ttto11.11ion. We
mayaoo11 ban: daily. global. higb·tt$oluiion im~gcs (court")' Oig.italG!obc),
Chapter 15: New Developments in GIS 637
image. up from the common 1wo meter reso· 4 Three-Oime.nsional GIS ,,~II be more
lmion a rew years pas1. The USGS and other widely developed and practiced due 10 data
organizations are providing subfoot resolu- provided by three-dimensional. ground or
1ion images, perhaps nationwide. J.ndividual near-ground level liOAR (Figure 15-10).
light poles. cwtbs. and even porkii\g lot LiDAR systems carried on foot. ou autos or
cracks may b,e observed in these images. drones. or in high-flying aircraft will pro,,ide
rendering them a rich source of spatial data. feature x. Y. and Z coordinates from various
LiDAR data are another example of venical. oblique. and horizontal perspee-
improved remote sensing. l iOAR systems 1ives. When combined these allow truer
are increasing in accuracy and resolution and lhree-dimensiona1 characterizations of
declining in cost with coordiu.ate data com 4
space. Data development~111anagemem.
monly paired with digital aerial images. analysis. and \!isualizatiou is curremly tak 4
Commercial LiDAR S)''Stems in the recent ing place across architecture. CAD. survey 4
past collected a data point every few square ing. and GIS sof~vares and disciplines, and
meters. while current systems routinely col· substantial development and fusing across
Jee.ts several samples per square meter. Soon. these discipliues is in the of.ling.
tens or buodr«ls orsamples per square meter Advances in fuU-sized and mi,riamre
will be comn,on. allowing unprecedented aircraft (Figure 15-11) are leading to
spatial definition. inc.reased availability of a broader range of
LiDAR data are dropping in cosL and aerial imagery. Pioneered primarily by
county to statewide LiDAR mapping will NASA decades ago. this teC:hoology is mak-
likely become commoa Fusion of LiDAR ing the Je.ap to broad conunercial applica.
data with other image and spatial data will tion. Some experimental UAVs may Oy
conl'inue, and surely create new opportuni- raster and turn tighter than many planes car-
li~s ai!d appliea1ious. rying human pilols. Specialized payloads
may be carried cheaply on 1l1ese crafts, for
Fla:.-tt t S..10: A pbotogrnpb (upper Id\) and tbttc-dimcmio1ial LiDAR &ta sci. rccordutg pr-cci.s.c cootdi 4
n:l.lc& n.12 \'ct)~bigb ~sotu1ion. Titcic a>ordi1utc d:tL<t ,ire uicful for i1up<c1ion.. cugi.1~cain.t;, aud 11Ui.it1c•
llllnl:C, 11ncl wc:rc u,ually 1,103vail11blc. or ql!itc c:<pctUh·c, prior lo LiOAR. (co~irl«)' Land~inl).
638 GIS Fundamentals
.· ..
,•
fi&1U't lS..11: UAV's lilted with camcm, ru.d adYan«d ONSS may collect s1.1b,.cen1itueter ai.cunq· spatW
data froru LiDAR 3:00 aerial i.Jn113ery (courteS) ' Gcodctic.t and Vclodyite).
long periods of time. and in more dangerous ditioos. UAVs ha,•e comple1ed transoceanic
conditions than in hum1n-piloted
.. aircraft flights with guidance only at takeoff and
Small craft may be deployed quickly. and landing. and UAVs are routinely collecting
have delllonstrated ultra-precise three· weather. water quality. and olher environ-
dimensional measurements at very high res- mental data.
olu1ions and for site or small, area-specific There have been parallel effons to
applications. reduce the size. weight. and improve the
There is currently intense development ac.curacy ofLiDAR systems. as most current
of smaU helicopters or airplanes outfiued sensor systems are too large and heavy for
with cameras. Lif>AR. and GNSS position. most small tiAVs. LiOAR systeon weight
control, and telemetry electronics. This increases UAV size. Limits collection time,
allows preprogrammed Oight paths and on· and increases 1be hazards of collection. but
board "intelligence" to lllodify steering h1 there bas been great progress in sys1em min·
response to wind. rain. or other in flight con- iaturization, which should continue.
Chapter 15: New Developments in GIS 639
Fl&m't' 15· 12: An c:x.amplc of c.,;tre:mcly hiab-rc'°huion LiDAR and iiutt,c coordinate dtt1a. This: is a combi·
rl3tion of multiple iinll~ , icwi 10 Cr<tne 11 l1U'C"c-dii1muio11al d.'IUI sc-1 of Mi lll'e'.ll on 1J1c U.C. Bc-d:clC)' cam·
pus. Thc1,c arc- rc11l d,ua., ;md not gc1.1cr.1.1cd nninution,. cn:ntcd &om subceutimcter,rcsolutio11 X. Y, Z <Utn..
bna~c data pro\ide color iufomiatiou aod Li.DAR data localiou au tC:\:t\tre for CY«)' poin1. allowing lifelike
rcoditioos fun any location. Rcs~b seeks to o,·crcoruc da1a coUC'Ccion. stotajc, and ~ cuiui; li.millll·
tions, .1odllt:t such JUI d1csc 11rc routi:nc:lr used iu the l'IOMOO·dist:mt future (courtesy A,·1ddi Zakhor, U .C.
8crkdcy Video "nd lmngc Ptoccn iog lnb).
640 GIS Fundamentals
representations, and combined with exteriors ing model. These provide more than virtual
to create a fulJ coordinate and color record of walk-lhroughs. as usefol as lhese are, 10
building interiors and exteriors. allow distance. area, and volume measure·
Software vendors are developing da1a ment, fixed and movable asset management.
models and wOlkOows 10 use 1bci;c da1a. For and engineering, en\rironme-mal conditfon·
example. ESRI has inlroduced an indoor ing. and remodeling calcnlalions.
mapping produc1 lhal supports development.
analysis. planning. and ou1p1n in a 30 build-
642 GIS Fundamentals
Cloud-Based GIS
Cloud-bast<J GIS provides data storage. ously. always accessible. Large server arrays
analysis. and display capabilities over the may be outfitted with proper backup and
internet wi1h services usually provided from protec1ion. including da1a mirroring at dis-
one or multiple remote locations via the tinct locations. Mirroring pro"ides data
internet. Cloud-based computing includes rechmdancy. because lhe same data are
broadly accessible internet mapping applica- stored concurrently at different physical
tions. but also includes darn storage and a facilities. often miles or even countries apan.
full suite of software-supponed analytical !fa fire. flood. orotherdisaster'befalls one
capabilities. something that has generally data server. the mirror image is likely to
followed a local or private network architec- remain intact.
nire. During the recent past. GIS software Internet mapping is perhaps the simplest
primarily re.sided on a local orclosely net· and most common fonn of cloud-bast<J com-
worked bard disk. and ran on the cemral pro· puting. Many internet applications allow
cessing unit of a local compmer. You users to compose maps on a Web page. Tl1e
downloaded data to a local hard drive. and individual user has some control over the
purchased software to physically install ou data layers shown. tile extent of the mapped
the local computer. although the software area. and the symbols used 10 re,nder the
may have referenced licenses or other map. The internet is different from other
resources on a (us ually) proprietary server technologies because it allows a wide range
on another compmer. Cloud-based comput- of people to cus1om·produce. maps. Each
ing envisions maoy of these resource.s pro- user may choose her own data and cano,.
vided from distant sources. with the local graphic elements 10 display. The user is
compuiers pertiaps serving only as adisplay largely free from any dara de\lelopment
and command entry ponal - da1.a, software. chores and thus needs to know \-~el)' liule
and processing may all be elsewhere. abom d.ita entiy. editing. or the paniculars of
Cloud computing has many potential map projec1ions. coordinates. or other details
advantages. There may be a lower to1al cost required for the produc1ion of accurate spa-
of ownership. because you may only need 10 tial data. TypicaUy. the map itselfis tlte end
use a set of software occasionaUy. and cru.1 product. and may be used for illustration. or
pay as you need it, rather than a fixed price to suppon analyses that will be performed
irrespective of total use. Economies of scale emirely within the user·s bead.
in data storage and maintenance or in com- These internet mapping applications are
pu1ing power may be favorable, as well as panicularly appropriate when a large num·
the centralization of specialized technical ber of users need 10 access a limited number
suppon. Additional capacity lllay be added of data layers to compose maps. The iutemel
as needed, as marl.el share grows, orspec.ific users may select tl,e themes. variables, and
project demands increase. Resources may be symbolization. in contrast 10 a static map
scaled up or back as needed. New capabili· graphic. in which a website canogmpher
ties may be rented or 1ested OJore easily. and defines the propenies of each map.
soflware functions may bener adopt a rental
model and pricing stntcnire. Because most internet mapping applica-
tions are built for users who have little
Cloud rnmpuling may also provide knowledge of spatial data. maps.. and analy-
faster. broader. safer. and more continuous sis. ,be suite of spatial operation1s allowed is
data access. Internet connections are increas- usually very sparse. Most intemet mapping
ing in speed. and solid-state memory in large is currently limited 10 creating simple dis-
installations prov;des faster access yet. plays. This is changing. as query. distance
LaJge se"1er faciLiries may be on continu-
Chapter 15: New Developments in GIS 643
flU1ctions. and basic tools are provided. While this may be overcome to some extent
albei1 in veiy simple fonns. wilb local caching. a111icipa1oiy download·
As noted in Chapter 7. web mapping ing. for exan1ple. pulling data ui a wider area
than lhat immediately viewed, and other
services are another step toward a cloud·
software techniques, but within limi1s. In
ba.1ed GIS model. Data arc sto~ wme·
where "on the cloud:· and a specific link to mru.,y instances there is oo s.ubsti11ne for
1hem provided. Oa1a are accessible after extremely fast in1emet speeds. Widespread.
forging a connection. as if1hey were from fast internet should be forthcoming. but
any other disk source, at least as far as the access may depend on intemet demand as
accessing software view. The GIS program well as supply.
doesn•, dis1inguish betwttn local and cloud To date. analytical tools delivered o,•er
data once the connection is made. lhe internet are still quite mdimentary. and
This brings up one major linu,aiion of may likely remain so for some time. Robust.
1he cloud model. i1s dependeoce on a fast correct opera1ion of an analytical 1001 is dif·
connection to the cloud of resources. Slow fic ult to provide in many instances. and
internet spttds become a hindrance. panicu· requires a sizeable investment Systems fo r
larly widi large data or image se1s 1ha1char· delivery. payment. and protection for a
acterize many spatial analyses. Each zoom. broad. interacting suite of geospatial tools
pan. or layer addition may require a scene to will take development. both technically and
be repain1ed. involving the movement ofbil· culrurally.
lions of pixel:s through the web connection.
Open GIS
Open Standards for GIS larger user com1mu1ity ben"fits when there
are no technical barriers tha·t inhibit the free
Open standards in computing seek to exchange of data and methods. Open stan·
reduce barriers to sharing programs. data. dards seek to establish acoDlJlloo framework
and information. Spatial data structures may for representing. manipulating. and sharing
be veiy complex. perhaps more than many data. Open standards also seek 10 pro,ide
other kinds of data. Data may be raster or methods for \'endors and users 10 certify
veclor. real or binary. or represent point, line. compliance with the stan<lacd. Standards
or area features. In addition, different soli· have been developed in a number of endeav.
ware vendors. may elect to store their raster ors: for example. tlte ISO 9600 specifica·
imagery using differeJll fonnats. and data tions for physical storage fonnats allow any
may be delivered oo different physical manufacturer. data developer, or user 10
media. or fonnaned differem ways. If a per· build. read. write. or share data on hard
son orders an image in one fom1at, bu1 ber drives. optical disks. tapes. or other storage
computing sys1em does not suppon the devices.
physical media on which the data are writ·
Businesses and many ocher orgaaiza-
ten. or does n.,ot tu)derstand the file structures
1ions by their nature have a proprietary inter-
used 10 s1ore the image, then she may not be
able io use ibese data. Incompaiible systems
\'St in the spatial data euuy. ·storage, and
methods they produce. Many vendors star·
are generally described as non-interoperable.
and open standards seek to remove this non. vive by the revenue their GIS products gen-
interoperability. erate. and so have a strong tnterest in
protecting tlteir iuvesunent~ and intellecrual
The development of open srandards in property. However. the developers also may
computing is driven by the notion diat the
644 GIS Fundamentals
spur adoption of their GlS packages and There are open source soflware of many
speed up the development ofcomplementary types. from operating systems to word pro-
software by making the internal workings of cessors. and induduig GlS. Open source
some portions oftheir GIS packages public GIS software projects are directed al a range
knowledge. for example. by publishing the of applications. and notable examples
data stmctures and rom,ats used to store or
include the development general-purpose
their spalial data. Thus. these vendors also GIS (e.g .. GRASS, FMaps) to specific utili-
have a strong interest in making pans of ties (e.g.. MapServer for Web-based spatial
their system open. to tile public. data display. query. and analysi~l or toolkits
Open <tandnr<ls for spatial data are the to support GIS software development (e.g..
responsibility of tll<, Open Geospatial Con· GOAL, sbapelib).
sortium. T11e Open Geospatial Consortium Open source use is a large and growing
has developed a framework to ensure phenomenon for many reasons. High soft.
interoperability. They do tliis by defining a ware costs are driving many org.auiza1ions
general. conuuon set of base data models. toward open source software. Ucenses for
types. domains. and structures, a set of ser- some commercial products are tens to hun 4
vices needed 10 sbnre spatial data. and speci- dreds of thousands of dollars ancually for
fications to ease t.ra11sla1ion among different some large organizations. If these organiza-
represenlations that are compatible with the tions employ staff programmers.. open
Open Geospatial standards. Data developed sour<:e GlS may meet geoprocessing needs
by a civil engineer and stored in a raster for- at a reduced cost.
mat on a Unix version of Arc/Info should be Combiued GIS and analysi$ has grown
readily accessible 10 a soil sciemist using partic.ularly in the open source S'tatistic.al
GRASS on an OS-X Apple system. package R. Abroad range of spatial data for-
O~n standar~s ill GIS art relatively illalS m supported. iileludlng slia~filts.
new. Whlle most ,of the large software ven- geopackages. geodatabases. common raster
dors. data develo~rs. and government and fonnats. government and ISO standard inter·
educational organizations are members of change formats. and standard iniage types.
the Open Geospatial Consonium. some so that there are low barriers 10 data transfer.
components of the standard are still under Packages for direct access to general and
development. In the future, there will be specific spatial databases are freely avail-
increased emphasis on c-0mpliance to the able. Given the tl1ousands of R dc,•elopers
Open Geospatial standards. and open code fonnat. abnost e\'ery analysis
method available in proprietary packages are
also replicated in R packages. pl us thou-
Open Source GIS sands more. WIJile R exhibits many disad-
Open source :software is differe.nt from vantages relative to commercial GIS
mos1 other software in that it is distributed software in production or lnrge-org,,nization
free. along with tile source code. TI1e open senings. due 10 its lack of integtation. steep
source organization (www.opeosource.org) le.aming curve. often relatively slow execu.
requires that the software is not by design tion times. and complexity. it of1en provides
restricted 10 a specific operating system or many advantages in a broad range of specific
other technology, that there can be no royal- analyses.
ties, and that there be no explicit discrimina4
be more efficient and less expensive in ru1 on a standard sel of map projections, and we
open source environment. will combine the spatial data of various
Open somc.e use is expanding in many classes of entities to sol\•e spatial problems.
countries bec-.ause of specific governmental This book is an attempl 10 provide a founda.
iniriatives. China. India, and Brazil ltave all tio11 to effectively use spatial aualysis tools. I
supponed open source software in general, hope it has provided enougb infonnation to
and operating systems in particular. to main· get you started, and has sparG,.ed your interesl
tain independence from foreign fimis. in learning more.
reduce costs to government and local busi.
ness. and develop local infonnatiou technol·
ogy expenise. Because tltese nations are
home. to more than a 1.hird of die world's
population. their ac1ions alone. are substan·
tiaUy increasing the use of open source G!S.
A Hybrid Model
Proprieta,ry software ven<lors may adopt
a hybrid software approach, where they
interact with open software and systems.
This has take11 many guises. Some may sitn·
ply suppon s1andards. and ensure their sys-
tems may access aud generate industry
standard data forms. But a fuller approach
provides the code in a mi~ of open and pro-
prietary pans. Base code may be provided
free. with a charge for extensions or some
set of additional capabilities.
Alleroately. there may be charges for tl>e
base code. but enough source code or adher-
ence to open :standards that open source
extensions can be easily added later. This
aJJows for the development of an "ecosys-
tem'· of extension around a base application.
both proprietary and open source.
Summary
GIS are a dynamic collection of concep-
nial models. iools- aod methods that use spa-
1ial dala. As such, they \\~II continue to
evolve. What becomes srandard practice in
the funire may be quite differem from the
melhods we apply 1oday. However, the ftm.
damental set ,of knowledge will remain
unchanged. We will still gather spatial aod
amibute data,. adopt a spatial data model to
conceptualize re11;Jworld entities. and use
map coordinates to define positions io space.
The coordina·1es are likely m remain based
646 GIS Fundamentals
Suggested Reading
The \Vorld Wide Web is the best source for information about new developments and
trends in spaiial da1a acquisilion. analysis. aud ou1pt11. !11 oomras1 10 previous chap·
1ers. nearly all 1he suggested readings are websites. We apologize 1bat many links may
be short-Lived. but ~ie reader is directed to search for similar and additional sites for
the most current infonnation.
www. gislounge.coru
www.nasa.gov. general NASA entry point
www.gis.com. an ESRI . .sponsored website, general informalion
""~v.u-blox.com/en/blogs/insights/live-key-trends-gps
Chapter 15: New Developments in GIS 647
Study Questions
Wltich of the described new lechnologies is likely to have the largest impact in
1s..1 -
GIS over the next five years? Why?
15.2 - Wltat are areas of spatial data emry. analysis. output. or storage lb.at are in dire
need of innovation or new and better methods? What is a major bottleneck to for-
ther advancement of spatial infonnation science and technology?
15.3 - What is Open Source GIS? How will this change spatial computing?
648 GIS Fundamentals
649
Appendi x A : Glossary
Terms used In GIS and Spatial Data Development and Ana lysis
A<'cm'flcy: The 1leatness of an obsetv;uion or eni.mtne to the ,rue vah1e.
Active t'1'mote, Sfflsingsystem: A system that both e mits energy and records the energy returned by
t.1J1e1 objec1s.
Adapth·e samp ling : A method to increase sampling efficiency by increasing the spatial sample fre.
quency iin areas with higher spati.'ll variability.
Adjac,tncy: Two are.a objects that sha.re a bounding lint are topologically adjacent.
Affln<' roo1•dffila1t- rnu1srormAdo11: A se1of linear equations used 10 ttMS{otm froa~ one Cattesian
coordinate system to another. The tr.ausform.1.tion applies a scaling. translation. and rotation.
Alm1rnac: lmpcmant $}'$fem infom1atio,1 serit by each GPS s1uellite. :i.od recorded by a GPS
receiver to ob1ain current satellite health. constdlation status. aud other iufonnation helpfuJ
for GPS positioniog.
ApplkAtion se.,·, •er : a middle tier iu a common databas:e architecture. that p..1SSes requests fordnta
from higher 1icr. user interfaces, to die database storage and se.rve.r tier at lo,ver levels.
Arc: A li.ne. usually defmed by a sequetlce of coordinate points.
.ArcGIS: A GIS software package produced by Environmental Re:se~ch Sys1ems. nnc., of Red·
lands. Califorui;,,
Area fe::i1111'f': A polygon. collection of coniiguous ras1er cells, orothcr representation of a bounded
lltCtl, The fe:m1te is charac1erizt<I by a set of :mribu1es Md b..,s an in.side Md an outSide.
ASCU: American Standard Code for Jnfomtation interchange. A set of numbtrs associated with a
symbol 11.1sed i.n infonn:uion storage and processing. Nombers are bet\veen 0, and 255 and
way be Iq>rcsentcd by a single byte of data.
Aspl'ct: Tiie direction of sceepest descent on a terrain smface.
Almo5Pberic d elay: A change in the speed of light. and more specifically GNSS signal sp«-d.
,vben p:1ssing through the atmosphere. ·
Alruospberic distortion: lm;,gc displacemcul due lo the bending of ligh1 as it passes through the
atmosphere.
A11ribu1e: Non-spatial data associated with a spa1iaJ feature. Crop 1-ypc, value. add.ri.'Ss, or other
infonnuion describing the characteristics of a spatial feature are recorded by the anributtt.
Autornd G-e-o.o.--pi.tild: A suite of GIS software. systems produced by Autodesk. (uc .• of Sau R.3.foel.
Califom.ia.
AVBRR: Ad1-•Mct'd Very High Resoluton Radionieter. A disieontinued sateUitc system nui by the
Nationa l Oceanographic and Almosphrric Administration to collecl visible, tJ1ennal. and
i.nfmrM satellite im,1.ge-s oftbe globe eaeb day. The systt111 had up to a I. I Ion resolution.
and was: the earliest S3lcllite with daily global coverage, useful for lauduse mapping.
8::indwldtb: A parameter used in kernel mapping 10 affect the it10uence. orspre3d, of each poini
observation.
8:i.se- stallon: GNSS recording sm1ion over a prec-iscly smveyed location. used in differeotiaJ cor·
rectiou.
650 GIS Fundamentals
B{'AtOn rtctlvt1·: A GNSS receiverc3p.'1ble of decoding beacon base s t~tio11 signals 1~nsmitte-d by
the U.S. Coast Guard btacon stations.
BNuing: A. direction. usunl spe,cified as a geographic angle measured from sorne b:ue I inc. e.g.. true
north.
B•ID011, AChin•s GNSS sys1•ms
Bench mark: A m-onumcnted. precisely surveyed location for which coo.rdinates m known 10 a
h.ig.b degree of accuracy.
Bi.linen.. iuterpollltiou: A method for cakuh'llin_g values for a grid location based on a linear combi·
oation of nearby grid values.
Bi.oar)' dassiDcatlou: A classification of sp.itial objects into 1wo cl.Asses.. typically denoted by a O
class and a l class.
Bina.I")' numbe1·: a oumber presented in numberiug sysiem with two possible digits. 0 ~ud I. with
succcssivccolunms in po,,.crs of two , e.g.• I represents I, JO represents 2. I l represents 3.,
100 represents 4. etc.
Bi.n~ry 01,erafiou: A spatial op1:ration with two inputs.
BLOB: a Bitlary Large Object. generally a non-SJ>:ttial digi1t1l obje<:t or filed stoted in or linke<l 10 in
a spatial database. e.g., and image or video.
Bir: A binary digittt A bit has one of'rwo values. on or off. zero or one. Thi$ is cbe smt\ll esr mti1 of
digital info1;mation s1orage and the basic building block from which all other comput~r data
and prog,n.ms are represented.
BooleRu :dgebra: Conditions used to selee1 features wjth set algebraiccoudit:ious. iuch,dingaud. or.
n.nd not conditions.
Bouudnry gener111lizM.lou: Simplific:uion of the ..tr'Ue" boundary lines that defwe foatures d·ue 10
the inability lo record every point along a boundary. Some sampling must occur in any non·
mathelll31ic-_.,11y defwed boundary. and a strffi.ght line segmem often does no1 fully represem
cun•es betwttn the endpoints.
or
Buffer: A bt1ffer atta is a polygon orcollec,ion cells th31111e wilhin specified proxJmities of a set
of foatures. A buffer operation is one that creates buffer areas.
Bundle adjustmeur: The simultaneous removal of geometric distor1io11 and produc1io1:i of onhopho·
tographs from a nmnber of nerial images.
8~1P: A unjt of compu1ers1omge consisting ofS binary digits. Each binary digit may hold a zero or
a one. A by~e may store up 10 256 diffe.rtnt Vlllues.
CIA c:ode CPS: Coarse acquisi1ion code, n GPS signal used for rapid. relatively low~accuracy posi,
1io1uil estiin:ttts. Accuracies \vithom fo.nher corrections are 1ypically from a fe,.v 10 tens of
meters.
CAD/CAM: Compmet Aided Design/Computer Aided Mapping. Software used ptimilrily by
design engineers and utilities managers to produce two and three dimensional drawings.
Related to G LS in that coordina1e infom1anon is input. manipuL,ted. and output. These sys·
tems often do uo1 store ma~pro)t"etcd ooordinates. aud do no1 have sophisticated attribute
entry and manipulation capabil.iti.es.
C•d•srr•I: Wilh refereuce 10 property lines or ownership. for example. a cadaslllll laytr un,nUy
contains property Jines, and a e-adastml survey is the stuwy of property lines.
CAndidRte kty: A cohuun orcolunuis in a ~ lalioual t.'lblc 1ha1 meets We requiteuiems for., key. pri-
marily tbat it uniquely identifies every row in the table.
CA...-ltr--phAst GPS: Relatively slow bu1 accur,ue signal used to esti.ruine position. Position mny be
determined to within a few ccutimetcrs or better.
Appendix A.: Glossary 651
CArrographle modt'lfng: The comblnntion of spa1ial da1:, layers dU'ough the npplic.uion of' spnthll
operati-ons.
C•rrog...phle obj..1! A digi tal ,epreseut,iion of a real.wortd entity.
Carlometric map: A map produced such that the ft'lativc posj1ions of objects dep:icted me spatially
sccumte, within the l imits of the technology and 1be map projection used.
Cell dimension: The edge length of squaie cells used io rasttr data sets.
Ct>nfroid: A «ntral point loauioo for an area feature, often defined ns the poin1 with the lowest
average distance to a ll points th.ii define the area boundary.
C h::ira('ttl'istk hull: A polygon boundiuy thnt attempts to inchlde the densest concentration or a set
of observed points. Ofieu used iu home range a.ualysis. i1 sta.r1s w ith a Tri.nugulined Irregular
Network. or TIN. or points, a nd winnows large triangles to some s pecified reduction thresh·
old.
C boroplelb map: A map of polygons with colors assigned in a gradient thal depictsdassified levels
of a vuiable. These are oommooly used for PQJ>ulatiou density. avemge income. be~ltb risk.
or othe.r var-iables mapped by administr.11ive boundaries in which high to low caiegories arc
of inten?sL
Cl:lssil1c:ulo11: A eatcgorization of spatial objec1s b.'lsed on their properties.
Cloud r.ompu1tiug: Utilizing computer processing 1.ha1 is remotely located, typically on a ne tworked
array of disuuu compute I'$. accessed 1brougb the World Wide Web.
C lust e-r sampling: A technique of grouping samples. to rech1c.e tra,•d time. among samplc-s while
nu1im,.;Jling sample l1umber.
Code-phase C PS: see CIA code.
COGO: Coordinate Geon.1euy. the entry of spa1ial da1a via coordina1e pairs. US\1atly ob1ail'led from
fie ld Slirvc.ying instruments.
C onnectivity: A record or represa,.tation of d)e coru)tttedness of li.n.ear features. Two linear fea,
nu-es ot necworks ate conuec1ed if they Ul3Y be tmvcrsed w i1.hout leaving the nctwotk.
Continuous s•n•raee: A vnriable or phenomenon that changes gradually throt1gh two-dimensional
sp.1cc. .e.g .. e levruiou or 1empen,;1u.re.
Contour line: A line of constant value for a mapped variable.
652 GIS Fundamentals
Control polnts: Polin locations for which map ptojec1ion tll\d dat3.b3:Se coordinate pairs 3re ktto,vn
to a high d~gree of accuracy. Conlfol points are most ofte-n used to convert digi•ized coordi·
oatt$ to standard map projection coordinntes.
Com·erge11t cfrde: A circle usod in defining a facet for a triauguLatC'd irregular ne1wotk. that passc-s
1hrougb thrtt points, and does not contain any other poin1s.
Com·u hull: The !l)Olygon duu completely cont3ius a set of p0in1.$ and lhat bas no acu,e (< IS<>°)
exterior angles.
Coordiul\tt$: A p~r or triplec of numbers used to define 3 pOSitiou in space.
Dangle: An unimc:nded o,·crshoot in a line segm ent w hen crossing another Linc scgmcm .
Ds\t11 lodt J>tndtnct: The ability 10 make changes in da1a suucmre in a d.1t.1~.se m.1a..1.gen1tut sys.
lcJ.n that arc transparent to users or applications that use dala.
DAISl mod('l: A method of representing sp.,tial and asp31in.l components of real.,vo,kl emities on a
oomputer.
DA111bQst nrnnnge-mtnl s~trn (DBl\tS): A collection of $l0t'twa.re tools fo, the entry. <>rganization.
SIOrage. .ind o utput of d;ua.
Data Pane: Area of a map that contains graphic depictions of an area, nsu:illy t he largest porti<,n of
a map.
Datum: A se1 of coordioate locations specifying horizontal positions (for a horizontal datum) or ver~
1ical positions (for .1 venical datum) on the Earth S\U'face.
Datmu adjus tme1,t: A re~calculation of a danun based on additional measurements.
Daturu re1'.ll?a11tioo: The outcome of a datum re.adjustment. :a specific. defined datum ·surfuc.e and
set of datum points.
Datum $W.ft: Tile change in horizon1al or venieal point loca1ion th.it tcsnlls &om a datwu adjus1.
ment.
DAtuw tr:rn..stornuulon: A method or set of equations that allows the calculation ofa pollu loca1io11
in a one dattun based on coordinates expressed in a different datum.
DtdimHion: The ru1.g.le betwee111he bearing towatds True North and the bearing towards Magnetic
North.
Dtbluotiy 11·ltingle-s: The-set of 1riru.1gles fom1ed in a triangul.ued irregular net,vork. co1lll.eaing
points to Lhl! ntartst points to c..rtale triauglc-s. while ensuring that the triangle edges don·1
cross. and are fonned by convergent circles.
DEM: Digital Elevation Model. a raster set of elevations. usually spaced in a unifonn horit.onta l
grid.
Appendix A.: Glossary 653
De,•eloJ>flblt s m1lltt: A geonu:rric sbnpe omo wh.idi die E;mb. sphere is east duriug a map projec-
tion. Tl!ie devdopablc surface is typicalJy a cone, cylinder. plane. or other surface th.at may be
mathematically Oatte.oed.
Dig.it11l tern:1.iu model; A digital represeniation of elCVillion, including DEMs. TINs. and otbt'r dig.i-
tal represe.nta1ions.
Dll\1>h.rngm: A c:uuera component I.bat functions like the iris oftbe human eye. 10, cou1rol the
amount of light available 10 fall on tJJc film or CCD recording surface, and ,o improve focus.
Difttrentl:'d GNSS: GNSS positioning bast<! on two rcctivers. one at a know lo(.niou 3od one at a
roving,. unknown location. Data from roving receivers arc corrected by the. difference error
compu.e<t at the known location.
Oig.it:d map: an clectroujc, graphic depiction of an area.
Dlg.lclz.e. dlgtc:Wug.: To eouver, p:iptror oilier brudcopy maps to comp11(er.compatible 3.Ud SfOred
data.
Digitizing trible: A device with .'I O,u surft\ce and inp,n pointer used to digitize budcopy maps.
Dilution of pncisio n (DOP): See position dilution of precision ( POOP).
Dbsolw: An c:iperation th:11 reinoves li1les separo.ti.og adj;)cen1 polygons tl\,'lt are c<>nsidered equal,
baSt.-d o u some characteristic or meas,1re. A dissolve operation is typically applied based on
equill Vl'llues of vttrittbles that are contained i.o B tllble a:ssocia1ed with 1be d:ita layer.
Dom:.i11; The range of "alues a \fariable way take.
DORJS: Dopplerorl>itog13pby by mdiopositioning in1egrated 01) satelUte. precise ,neasurffllent of a
set of poi.ius on the Earth via Doppler shifts in sa1ellite !iguaJs.
Ool dC'nd ty maps: Maps with dots pl:,ced lo.side of polygons i.n proportion to numeric value of a
\'3tiabl>e. These are ustd 10 represent population for coun1ies or siates. or other nu.llltric data
corre-sponding to defmed areas, with more dots for higher levels of the variable, for example,
one do• per t .000 inb:iibit.<uns.
ORC: Digital Raster Graphics, a digital version of USGS fine- to medimn-s.c.afe maps.
DusiJ.rrtqutn.ty GPS 1·ttth•t1·: A teceivet c3pable of mea.suri.ng the Ll 3nd L2 broadcaSI sign.,ls.
and using these 10 tsiimate bighJy accurate and pr« isc positions. typically to centimeter lev·
els.
Dyn:.mk heig ht: A point"s gtop~cnlial number. di\'ided by a nommlizing conslant. It is used pri~
m.<trily \Vhen measuring wttter levels or hydraulic beads.
Easting: The .ixis approxima1ely par.tileI 10 lines of equ3J latitude in UTM and a n umber ofother
standard map projee-tions.
Elec1rowag11• lk Sp C'c trum: A range of energy \V.1'l.\'Cleugtbs. from X·rays through ~dar wave.
lengths. ihe electromagnetic spectrum is iypically observed at wavelengths emitted by the
Sun or by objects on Earth. covering w,weleugths from the visible to the 1he1mal i.nfrnted
rtgion.
Ellit>sold: A n131hema1ic-al model oft be shape of d1e Earth tb.nt is approximntely the sbl'tpe of 3 tl:u-
tened sphere. formed by rotating an ellipse.
EDl1>soldal bdgbt: Height m,asurcd fromao ellipsoidal surface 10, point oo the .s11rfuce of1be
Eanb.
Endl~p: Tbe eod,to,end overlap io. aerial photographs taken in the same Oigln li.oe.
Eutity: A rt.a.I \vorld item or phenomenon tha1 is rcprese.nt<.-d in a GIS systfm or database.
654 GIS Fundamentals
Ephtm('rls: lnfornn.·uioo on GNSS s..1.tellite orbitS, required by GNSS receivers to computer sateUi1e
position. range distan~. and r«.ei,·er position.
Epoch: A specific date to which GNSS d:ita are nom~lized. 3ccoundng for drift ln coordin:.nes
through tUne due to tectonic movement.
Epsilon b•nd! Abond surrounduig o lioeor fenn,re 1ba1 describes lbe positiooal error relative to 1be
foaturc loca tion.
Equal-llrtll c.las~tncation: A da$$itication me thod that assigos c lasses such that each (:)a,s corre·
spouds to an equal art3.
Equal-inte1'Y:ll dasslftradon: A class:ific-ation method that assigns an equally spaced sec o f classes
across 1he ronge of-' variable.
Equipoteutfal surface: As surface along whfoh the gravitational pote ntial is equa l. perhnps best
t1uders1ood as where the pull of grnvity is equal :,cross the S\ltfuce.
Eri:1se, function: A vtttor spatial operation. typically lhal ··dips ouf· and discards the. area ill one
!eyer corresponding to lbe polygon botindaries in another d:na layer.
EROAS: A G(S and r~molc st•nsing image processing software package owned and d c"elopcd by
Leica Geosys1ems. St. G.1Uca. Swi1zerland.
ETM+: Enhanc ed Thematic Mappl"!", a scanner carried on board L..t udsat 7, providing image data
with resolutions of 30 meters for visible through m id infrared. I.S O)eter paochn:unn(ic, nod
60 me1er for !berm.al wavdengd1s.
F~t"ft: A trinngular' face in a TIN.
F11.lse northiug: A number added 10 coonHn.ues in a map proj~ion. usua lly to avoid ueg:tti"e C()()r-
diruuc locations w ithin 1.he area of a map projection.
Feature; An obje,c::t or phenomenon in the landscape. Adig;i1al rtpreseu1a1ion ofthe fe:iture is often
called a canogrnpbic fea ture.
Featurt gtut nlllutlon; The incomple1e represeutalioJJ of shape defining coordinates for entities
represented in a G IS.
Flduch1l ffi :\l'ks: Also known as Ciducinls. precisely scribed m..,rks d)31 .ve reoorded near 1he edges
of aerial images. and used 10 remove systematic camera distortion and to registe r imagts.
FIPS: Federal l.nfonnntion Processing StBud.-irds: code. a se1 or-numbets for defined pclitical or
physical entities in the United States. There arc FIPS codes for caeh siate. coun'ty. and olh{'r
feature$.
Finl Noruud Foru1: A se1 of requirements for a rel.uion..,J databas.c cable. prllutuily tha1 there be no
repeal column, defined as columns that reprei;t'nt the same kind ofinformation. For e.~ample,
a dam base table intended 10 represent fumilies.. repeat columns forc h.ildrtn wo\lld violate the
requirements for first nomial form.
Fl'i(tlon sm·fA«'! A rnS11et surface used in ca.lcufating \'.lri.lble tmvel costs through an acea. Tiie fric-
tion surface represents the cosl per unit di~ance to travel through a cell
Fb1bed srAn11ei·: An electronic dcvioe us.eel 10 record a digital image of a hardcopy map or image.
flow direction: Tine dfrcction water w ill flow from a point. usua lly an azimuth or bearing angle
assigned to• r.,stcr '"II,
Fort-igu key: A coSwnn in a relational database table tb..,i is a candidate key. and used to join lbe
table 10 a different relational table .
Friction surf;1ce: See cost surface.
FTP: File Tnrnsfer. P rotocol. a stao.dard method to tl'Msf'e.r fi les across a computer aerwork.
Appendix A.: Glossary 655
Fun(do11nl d cpl'nd('n()': Propeny of-, set of items in a d-llt3bttse uible. If one item is fi.ulc1i<iually
depcod cnl ou another, I.hat means knowing the value of we item guaro.nlees we know the
corresponding value of the s.econd item.
Ga lileo: A Europe:w-based GNSS system.
G(lne1•tttlia1ton! The simpliJienion of shape or position d1a.1 inevimbly occurs when feanm.~·s nre
mapped.
GtocC'nl1'k: A measurement system that us,e,s tbe center of the Earth a.s the origU.1.
Geocodiug: The prooess ofassignjug a geographic or projection coordi.oa1e to a data item th11t is
based on a strttt 3Cldress, town, and state or country.
Geodedc dl\tmu: A reference system aga.insl which borizou1aJ andlor vertical positions are defwed.
It typically consists of a sphere or ellipsoid a nd a set of point locatious precisely defmed with
referenice to lhat surf3ce.
Geod e-sy: The science of measuring the shape of the E.arth and locations on or i.n the Earth.
Gtold: A 1neMtu'elllent·b.lsed model of 1he sh..-,pe of1he Earth. The geoid is a gtavi1a1ioi1al equipo.
tmtilll surface, meaning a standard surface ofequaJ gravitational puU. The gcoid is used pri-.
matily as :i basis for specifying teCTain or other heights.
Geoid:d height The dislauce measured normal to the ellipsoid surface from and ellipsoid to a
geoid.
Geognipbk N orth: The northern axis of rolation of the Earth. Also kno,m as Tru.c North. and by
definition ljes at 9'1" nonb larinide.
GeoMedlll: A GIS softwact package produced by Intergraph, luc .. of Huntsville. Al.tlbama.
Gfopot('ntinl: 11ie gm,~bttlOnal potential geoemt~ by ~1e- mass of the E:irtb. defined M tbe work fo
move a \1.u.it mass objtct from infut.ity to lhllt point. It is by convention a negative number th.it
decrease-s (becomes more negative) as on grows c loser to the mass center of the Earth.
GIS: A geogr.1phic infonnation system. A GIS is a cornpu1er.b.,sed system to aid i.o the collec1jo1i.
maintmance. storage. a na lysis, output,. and distribution of spatial data and rinfomiation.
GL!\I: Geogrnphie ~fatkup Lauguage. a standard method for docuineming and 1:mnsferring spatial
data.
GLONASS: G lob.II Navigation SateJljte Sys1ern. A Russiau developed and m.,imained sys1em for
coordinate mcasurcrmnl and positioning.
G lob1d op<'nitlon: A spatial operation where the om1>u1 location. a.nea, or exrem comes from opera.
tions on the entire i.npul area or exteu.1.
GNSS: Global Navigation Sa1eUi1e System. A coo$te1latio1l of satellites ph.1$ a gro.und control s:eg·
ment lh..11 a.I.low$ preeisc location ou or above lhe Earth. This inchldcs GPS~ GLONASS. and
other satellite navigation sys1em.
Gnoruouk proJ<'<'liou: A m:tp projection w ith die projection center plnced a1 1be center of 1he
spheroid.
GR.ASS: An open.source GIS software system.
GrntkulP: Lines of latirude and longirude draw n on o. b.ardcopy map or rcprescnt«l in a digital dam..
base.
Gra,imett"r : An instrument for measuring the strength of tl1c gravitational field.
Grt:it drtlt: A cirtle on the st1tface of the gJobe tMt splits the globe into 1wo equ.al parts, Md is on
a plane that passes through the center of the globe.
656 GIS Fundamentals
G1'('M dl'dt dhtaoC'C': Tbe sborces1 dis1:1nce bet·ween two points on dle surf3ce of1he Earth. Tiiis
distance follows a great c ircle rou1e. defined as the route on I.be surface defined, by a plane
rhat intersects t.be starcing nod eoding J)Oi nt and tbe center of the Earth.
G rid, lllflJ>: Anet,~ork of constant X- and Y-coordi.nates on a map.
Grid No11Ji: Tite direc,ion parrulel 10 the northing a,is inn projecrnd. Cnnesian c-00rdi11a1e sys1em.
G rt>enwkh ruerid:hm: The liue ofequal longitude passing through the Royal Observatory io Green-
wich. England. This li1Je was tllken as zero, by convention. for the sy,tem of longitude mea-
sure01eutS for the wotld.
GRS80: Geodetic :Reference Surface of 1980, an ellipwid used for map proj«tions in nmch of
North AJmric:i.
HllrdC'opy nrn.1>: A map printed on physical media, usuaUy paper.
Height abo"f' el.llpsold. DAE: See ellipsoidal beigb1.
HE>lmE>rl 11·:msfol'nt:ttion: A method to mmsfonu among horizontal danuus.
Hien11rcbk111I dnu, model: A method of org.auizing anribute dat:a th.it stn1cntres values in :a 1.rte. lyp·
ic.ally from general to more specific.
HlghActu1·ftC')' R~f('t·('nte N('lwcn·k (IIARt'"): A set of stlte-specifie horizontal d:mum realized in
the early ye-ars of GPS measurements.
Hlgh-p~ss nlt('r: A raster operation that identifies large or high-frtq\leney differences betwe-en
cells.
Hyd1·og.rnphy: Geogrnphic tepreseniation ofw:ner features.
Bypsogl'Rpby: Geographic rrprcsc,ntation of hc,ight fc,11tures.
Idrlsl: A GIS fron:i Clruk University widely adopted wi1h panic\llat strenglhs in analysis for clevel·
opment. land use assessment and planujug. :wd sustainability.
lkonos: A high resolution imaging satellite system. llconos provides l·meter panchro11331ic and 3.
meter multi spcetml image d.ala.
Ion('I' Join: A combination of r.\'O data r.,bles in a database mMUtgement $)1$1em based on a key c:ol·
wnn. The o'ulpul lable combines rows by matching v;dues in the key column. and saves only
row'S that have matching key \'Slues in both tables.
Interua1ion:d Tei·1..stl'ial Retea·ence Frame (ITRF): A geoce111ric cootdiu.,1e rcferen<:.e frame that
foUows an international standard for SJ)C'cifylllg Earth coordinates. Defines an origin. ellip·
soid3.l shape. and X. Y. and Z coordinate directions.
lnte,rm1tional Ten?stl·ial Re£E>l'enc·E' Sen'ire (l'TR.S): A system of ptople, institutions. protocols.
bardw~re. n.nd software for orgru.tiz.ing and taking the measuremems ueeded for c.oJculating
the ITRF rc-alizations.
Iouospherk d('l::1y: The c.haoge in tn.wel tllne of an el~tromognetic $1gnal when passing lhrougb
the ionosphere. Mos-t applicable in GIS to uncer1n.iuty in GNSS positioning due to unctttain
signal travel times.
Ius 1an1aueour Oeld of ,·iew (IFOV): The area or augle sensed by 3Jl imaging system. or sensing
component such as the. pixd or Jeus., of the systcnt.
Interpoll\tioo: The escim.,1ion of vrui:tbJes ,u ullStullpled IOC4tiollS from measuremen1s at sampled
loc.ations. interpolation methods arc usually understood to use a fommla with all parrunefers
that are pre·wdetennined. meaning thnt parameter vahies used in the formuL1. do oot depend on
the data val.ues.
Appendix A.: Glossary 657
lntt1'S('<tlo11 (o,•e.rlny): The venjc-3( combin:u ion of nvo d.at.1 layers. typicaUy restricced to the
extent of one data layer but prestrving the data contained in both data layers for that extt nt.
lot('rw1.lf1·:uto sc-alt: A iuea$uremein scale thnt records bo1h order and absolu1e di:ff'erieoce in v.'thie
for a set of variables.
Isopll"lh mop: Amop depictiog lines or consmm volue for• vafi•ble. olso lo10M1•s • oontour map.
Items: Variables or a nributcs in a data iable. typic.aUy vie wed as the columns o f the table. These are
tbe types of essentialc.bnracteristics used to described each feature in die geographic dat.a set.
e.g .. ane:,, depth. m d water quality for a lakes daf.'I se1.
ldri.d: A GIS system dl"Veloped by the G raduate School of Geography of Clark University, Worces·
1er. M;).iss.--cbusens.
IDW: [nverse Distance Weighted interpolation. a method of estimating values a t unsampled loca·
tioos b;.1sed Ol'l 1be ,·31\le and disuu:ice to sampled Joc.:u iollS.
Infrared imai;l': An image that re-rords rcflcctaJlcc in the nror infrared wavelengths. typically
ll'lcludi.ng0.7 to I. I mjcron,eters.
JP£C: An image comp~sion format.
Ktrntl: Ail amngemem of cells and values used as :1 muhipl.iea1ion teinpkue ir1 roscer analysis.
Kerm•I mnpp:ing: A mclhod of ide ntifying core areas, concentrations, or density of occueation
based on "sl'acking" kernels that repl'esellt occturenc,e frequency at obser\'ed loeat1011S.
Key : An itcm or variable in a relational table used to uniquely identify cac.h row in the table.
Krlglo.g: An i11.terpolation meth<>d b.1.sed on geostatis1ics. the meast1rement of s~1tfal AtnoeorreL,.
l ion.
Lnmbtrt conlormnl conk: A common. cone·based m,,p projection.
L11.nd.sa1: A NASA project s panning more than lh.rcc dtcades and seven satellites chal proved the
capabilities of space,based remote sensing of land resources.
Latiludt>< Spb:e-rieal coordinates of Earth loca1ioo that vary in a uorth-.soul1l dirtel tion.
Law of slnei: A ll'igoc:iometric relruionship that a Hows: the calculation of unknown 1riangle e-dge
lengths fr<>m known angles and edge lengths.
L('V('llng su1'\~eyi : Surveys used to measure the relative heiglu difference benveen sets of poiots.
Lidnr: Las.er detecting and ranging, 1bc use of pulse las.er mcasuremenls to iden1ify t bc heigh1.
depth.. or 0 1her properties o f features.
Line smootbi11g : adding vertices and c h.auging the sb..,pc of a line. usually d uring d igitizing. 10
remove discrete a ng les a nd crea1e a bending a rc.
Line snstpJ>ing: Automatic movement of digitized poin1s to a ne.arby line when the point locaiioo is
within a specified snap distance.
Lint' 1blnn.iog: Removing line vertices 10 reduce file size "1i..ile ma.iutnin.ing most feature shape
i.ufommlion.
Llnt>nr l'tftre:ndng: See geocoding,
LIS: A Land l:nfonuation System, a nam e. origina lly applied for GJS systems spcc~fically devclopc-d
for property ownership and bou1ldary records 01a113gemeiu.
Local operntfon: A s pa tial opcralion whe re the output locati on. area, or e x.lent come s from opc-ra·
tfons on that sai.ne extem.
Logical modt"I: A coooeptua l view of the. objects we portray io a GlS.
658 GIS Fundamentals
i\1Ap nlgtbrA: The combination of sp:nfal dita l:\yers using si.zuple to complex spatial <>peratious..
Map generalization: The, ap-proxim.,tiou of real foa1t1.res on a m..1p or in a spalial database, wheiT.
the physic~ shape or location does not completely represem all 1be details ofthe Ce::tmres.
i\'laJ> projK'lion: A systematic rendering of features from a spheroid or (']lipsoid rcpre:scoting lhe 3-
dimensiona l EaiiJi 10 ri map sutface.
l\'laJ> , cale: The ratio of a dis'lance on a lU..Bp 10 the. corresponding distance ou the Earth.
Maplnfo: GIS sof1ware package pro<h1ced by Maplnfo. hlc .• of Troy. New York.
Mean center: A measure of the.C<'nlraJ location of a S<'I of obj t'cts or obst'rvations.. based on lhe
meao x and y coordinates for all observations.
Mean cll'cle: An estimate of a core area via a oirc.lc c.cntercd on the mean cenlcr-. with .a radius
de.rived front the observed points, for example. tbe standard devintioll-.
MeridUl.11; A line of eonsian1 longitude.
Magnf'tit' North: The point where the oorthe.rn lin e$ of 1:uagoetic a1traction en1er the Earth. r..iag·
netic Nonh does no1 o«:ur a1 th e same p0iu1 as '"Tnie" orGcog.r3phi<: Nor1b.. ln the absence
of local iutcrforence a compass needle points towards magnetic north. l he magnetic north
p0le is <:usr.ently located in northern C3nada.
Melodoto: Doll! oboul darn, Iha! describes 1hc properties of• spatio! doll!,., including 1he roordi-
si.ate system. extem. m1ribu1e types and \'alues. origin. luleage. accuracy. nnd 01l1er character.
i5'ics oeede-d for effective evaJuation and use.of data.
MNt.s and bounds Slll"\'f'y: A $urvey method based on distaoce and son:ietl111es angle rneasuremenl$
from kuowm or monumented poi11ts.
Minimmn db:tancf' digitizing: digi1izing in s,ream mode in which points less than a minimwn dis·
1ance apttrt are not auo,,..'ed.
M.inimnm mapping uni.I (Mi\lU): The smallest area resolved wben interpreting an aerial or satel·
lite image. c>r when mapping area features from -a souree data se1.
Moving ,dndow: A usually red.angular arraJlgcment of cell$ that shifts in position across a rasttr
d:nn set. At ench posit.ion an operation is 3pplied using the cell values currently encountered
by the moving \\•indo,v.
MSS: Multi.spectrril Scanner. an early S3tellite .irnagin.g scanner carried b)' Landsat satellites.
Modifiable areal unit pl'oblem: The dependence of aggregate area statistic-s on the si2c and shape
ortbe aggreg:uion units.
i\'I ODIS: Moderate Resolution [maging Sensor. A later generation imaging sc.a nutr that is part of
NASA'$ M L$sion to Planet Earth. Provide$ high spectml re$0lutior1.. frequent global coverage.
and moderate spatial resolution of from 250 to 1000 meters.
Molod enski transformation: A me.1bod to trn.o.st'orm among geodetic datum.s.
Monte CA.do ihuulJ1Liou: A me1.bod of estilll31iog the variability in a process or model by adding
small unce.nainties to the input drua or parameters over thousands of model mns, and obsen··
ing bow tbe-s:e slll311 diffe~nees change the ot11corue.
Appendix A.: Glossary 659
Mul1lp~r1 (Nthu·(': A vector (eanire. us,1:illy of polygons. where- 1nultiple. sep.ltat-e geographic enti·
ties arc- grouped and t:rcaled as one. aud associated with one row iu the associated attribute
1.nble.
Mul1i.spe-c1111.I: Au image. film. or system that records d:i1a coUt<:t((i from multiple w;webands.
Multi-tier i:ireblteenu~! Adatabase 1uanagemem system design where lhere nrn multiple levels of
clients above a strver.
N lldir point: The point direct.l.y below the aircraft. usually near the centtr of an aerial image.
NA027: North American Datum of 1927, thc adjustment of Jong-baseline surveys to esrablish a nee·
work of siandardized horizontal positions. in the early 20th centmy.
NAD83: No1tl1American Datum of 1983. The s,1ccessor to NAD27. using approximately ml order
of mag.nitude more measurements and improvements in anal}1icaJ models and computer
power. The curl'fflt network of suw.d:'lrd borizonrn.l posi1iol).S (or North America..
!NASS-COL: Nationa l Agricultural Statistical St-rvice Crop 0-ata Layer, annually produced raster
data sets of crop cntegorits for United St.\1es f31'lll1.,nd.
NAVD29: North American Vertical Datum of 1929, an adjustment of vertical mtasure111cn1S to
establish 3 nerwodc: ofbeigbts in the early 20th ceonuy.
NA\1088: North American Vertical Datum of 1988, the succt'ssor vertical datum lo NAVD29.
NATRF2022: Not1h A.lntrican Terrestrial Re(ereL1ce Franle of 2022: The sticcessor dn.nun to
NADSJ(2011).
NLCD: National Land Cover Data set, a Landsat Thematic Mapper (Tl\.1) based classification of
hludcover for the Uni1ed S1att:$.
!NOAA: National Oce:auic and Atmospheric Administration. the U.S. government agency tha1 over·
sees 1b,e developmeo1 of nation3.I d.'lrums.
!Nod t': An important point along a line fcamre. where two lines meet or inlersect.
Node, s u::1pph11g: au1011.unic tllOvetnent ofdigitized points. to a 11e.a..rby uode when the point loca1ion is
,,..itbin a specifi~ snap distance.
Norulm\l s111r l bu1~: .'I v:u-iable that provides descriptive. often non.quau1iiative i.iiform.,tioii abom 4
s patial object.
N ominal S<'lll(': A measurement scale th:u indicates the difference belweeo v3Jues. but does. not
rcRect rank orabsolu1e differences.
N orrJ1 ~ r row: Graphic, usuiUy in :uTOW, that show the direction to geogr.,.phic. magneiic. or grid
north o.u a map.
NorrbJng: The axis U) the approx.ima1ely oorth·south dirtttion in UTM and 01ber .sto.odard coordi·
narc sy-s-l('mS..
Normal fom1:s: A siandard method o( structuring relational dntab.ases to aid io updates and remove
red undmtcy.
N-tuplf': A group of attribute values in a database management system.
660 GIS Fundamentals
NWI: Nt1tionttl Wetlands ln\'en1ory cf.3.13 com.piled by the U.S. Fish and Wiktlife Service over inosi
of the United Slates. These data provide firsl·pass indic.itious of wetland type and t-xteut.
Obje(t: See c:mogrnJ>hic object.
Objecl-orie uted d n la model: A data model that incorporates encapsulation. inheritance. and other
objee1-orieuted progrsmming principles.
Open sou11:e sofc,.,.·:ue: Computer programs that provide the source c<>de to any user, typically ~as·
ily acoe$$ible through a web portal.
Operflliou, SJ>Rlial: The manipulatioo of coordioatc or attribute data.
Optk;:il n:ds: A ray appro,dmately perpendicuJarto 1be film or imn_ge plane in a camera and pa.mllet
10 lhe ceute.r ofthc Jens barrel. th.al ni:ay be thought of as the priU13.ry di!fftiou of incoming
light.
01·dlmd Slltdbutf: A variable l.b.:n contains n t.inking.
0 1·din:d SC'Ulfl: A S.C-Ble that represents the relntive order of \talues but does not record 1be magnitude
of differences between vaJues.
0 1·thogonat Intcr:secting at a 90 deg.:ru ang le.
01·thogn1.1,W(' ,1rw: Horizonrn.l placement as wouJd be seen from. a veni~l viewpoilll :i1 infinity.
There is no term.in or tilt-perspecli ve distortion in an ot1hographic view.
Or1hogrnphk proje,cdon: A map projection with the projec1io11cen1er an infwiLe distM<:e Crom 1he
m ap surf.ace.
Or1holnrnge: s« <irthopbotogrnph.
Orthomell'ic heigh t: Height measured from the Geoid surface to a point on the surface of the Earth.
Or1hopbotogr11.pll : A ,·ertieal pbo1ogr39h with :in orlhograpltie view, Or1bopbotogt3pbs :i.re ere,ued
by using projection geometry and measu~menlS ton-move tilt. 1errain. and pers-p«tive dis-
toni.on froiu aerial photogrnpbs.
Ou1er join: A combination of two dat'l lables in a da1aba~ mauagcmeu1 system based on a key ool-
mnn. The 01.npu1 table appe.nds those rows in a second table that ma1ch values in the key col·
umn. Null values are pltle«I in joined-table oohuru,s from the second table whe.re there is no
match to the first table.
Owrlay: The ·,·enicaf" combination of two or more sp,nial da1a 1:tyers.
Oversh oot. A digitized line that extends pas! a eoru1ccriug line.
Pauchrow11tk; Ao image. film. or sys1em 1b..11 rtoords i.n only oue waveleng1h baud. a.ud resulting
in gray scale (black and white) images.
P111'11.lhu::: The relative shjft in position of fea1ures due to a shift in viewing loe.'ltion.
Passh 'e- remote se-.nsiug !.yste-m: A system that does not emit the radiation it records from 1ar:gc1
objects.
POOP: Positional .Dilution of Precision., a figure of merit uscd to represent the qualily of the satellite
geometry wben tikin.g GPS mdings. PDOPs berweeo l aoc:f 6 are preferred [or most applka~
lions. and lower is betttr.
P'-'r.sp-N~tl\' (' con,•<-rgeuce-: T11e app.u-ent decreBse io ioter-object distaoce as the object$ a.re farther
away. for example, the appatent cOo\•crgencc of two railroad rails as Ibey recede iuto the dis-
tance.
Appendix A.: Glossary 661
P trspt(th't dtw; A view oo a loc,uio111ba1 includes some relief or p,erspec1ive di s1ortion. meaning
the location of objects may be distorted if theirttla1ive distance to the c-mn.e ra varies consid -
embly.
Pis.els: Picture elements that make up an image, these are the indlviduaJ grid cells that record or dis-
play a brightness or color in an i.roage.
Plan cu1·,·Ature: Terr.tin curvature along a oontour.
Plllnar topology: The enforcerueot of in1e.rstt1ioo for line and area features in a digital data layer.
Each line crossing requires au explicit node Md intersection.
Pbme J Ul"\'f')ing : Loc.ation surveying methods suitable under the assumption that dte surveyed
l.luds fonn a plan.at surface. i.e .. lb.at disronious d\1e to the Earth's curvature tUay be ignored.
Platten: The f:1at back ponion of a fi lm camNa against which the fi lm rests while an image is col·
lected.
Plumb bob: A weight on a string held freely to determine the local vertical dir«tiou.
Pobu ruodt d:lgldzl11g: Mtu:1ual digi1iziug in which the opcrotor must ptess 4 mouse button or 01ber-
wisc indicate when a point should be sampled.
Poblltr: An :tddress stored i11 a dnta Slruen1.re pointing 10 tbe l'lexc or rehlte<I data ele111en1s. Poimers
are used to organize data and speed access.
Polygon: A cl<>sed. connected se1 or lines dun define an :\tC'3,
Polygon indusio11: An area differen t in some characteristic from the rtcorded a1~jbu1es of the poly-
gon. but 1101 resolved.
Po~lion dilution or precis"ion (POOP): An index of the geometric distribution of a set of satellites
for the purposes of estimating and controlling position accuracy. POOPS typically moge
between l and 20. and lower POOPs on average result in higher po.sitioual accuracies.
Precl~ion: The repeatability of a measure or process.
Pri.ml\1-y key: A row or rows in a relational data.base 1able that is selec1ed as the key. and I.hat
uniqudy identifies all the rows in the table.
Pri.me rueridlaan: See Gt'eenwieh meridian.
Profilf' cun•a•UJ'f": Terrain curvan1re in the direction of steepesa descen1.
Projection. see Map projection.
Proximity functio n: See buffer,
Public L:rnd S w·yey $)'$'.lem (PLSS): A land measuttmcnt sys1cin us.eel in the western United
Stales of America 10 unambiguously define. parcel loca1ion.
Pynuuiding, IRAslel' pyr:uniding : Building rnsters. often images. of successively co.'U'Str SJ)3tial
resolution ,,ithin an image, primarily to allow fasier redrnws when panning and zooming.
QGIS: All open-source GIS.
Q uad- h·e.-s: A raster d.nta stmc.rure bas.ed on succ.essive, adaptive rtductions in cell size within a
data layer to reduce storage requirements for 1hem.11ic area data.
Q uE'ry: Requests or searches for spatial dn1a. typic.ally applied via a database man.agcmcnt sysiem.
R,-d"'l lrus dlstonion: The disphlce:rnen1 of object.sin ttu image due to san.i.U lens iruperfec1ious.
usuaUy radially inward or outward.
Random samplt pantru: A s.'lmpling ~nem where sample location is de1emiined by 3 r.\Udom
process.
662 GIS Fundamentals
R1rng(' dls-r~nct: A measurtment between locations when positionUlg. \1sually referring to s,uellitel
rcc,civer distances in GNSS positioning.
Rnng(' pole: Apol,e used in S1.1rveyi1\g to raise a GNSS anteona. survey prism. or other survey
instru~nt abo\•e the ground. Rauge poles are oftcu used in GNSS data oollcction to raise an
antenna and thereby obtain better POOPs. and improved accuracy.
Rn,ster dAIR wode-1: A regular ··grid cell" approach to defining space. UsuaUy $(tuare cells a.re
arranged in rows and columns.
Real lime dltfe1·eo1iru (on·~(lion: GNSS posi1iou.ing which relies on a radio link Md ~xtemal posi·
tion.in_g me:tsurements to oorrcct major GNSS caors in real time, a ud provide. instant
improvemeaus in accm'3C}',
Real lime kinematic positioning (RTK): A form of real time differential correction of GNSS posi·
1io11s.
ltecord: A colleclioon of a Ltribules slored for a sp«ific i.nslance of an 1.'Ulily.
Rtf<'rtnct {rstmt: A well-defined. us\1:tlly ellipsoidal. surfuce. 1ha1 serves as :, OOsis for our dan1ms.
Regislrntion: The conversion or assignment of coordinatt'S from a non-projected coordinate syslem
10 n coordinate system. TYJ)ical ly via M .tffine n's\nsrorm.ation.
Relsttions: See relational table.
Rtl11tlonlll Algebr11: A ffl ofoperations on dntabase cables spe,cifted by E. F. Codd for lhe consistent
mrullpula1io u of cfa1a in a database.
Rtfatlon:ril lAblt: A dat:t cable in a relational datab.1.se management system.
Relief displacemen t: Apparent horizontal distor1ion of features due 10 height differences relative to
the nadir pai.nl inn vertical aerial image.
Remote Seus-ing: Measuring or recording information about objec1 or phenomena wi1hout cootacc-
ing dtem.
ReSA.mpling: The rt('.akulation and assigruntut of cell values when changing cell size and/or orieu-
talion of a raster grid.
RMSE: Root M«111 Squllre Error. a sia1istic th.at me.-sutts 1be difference between true ruid predicted
data values for coordinate locations.
Rubbersbttciu.g: The use of polynomial or Olhet nonlinear tn1nsform:uions to nt31ch feature geom-
etry.
Run-l('11g1b coding: A compression ine1hod used to reduce sto1-age requirementS for taS1erdarn sets.
The value a.ud nwnbcrof sequentiaJ occurrences are stored.
Stb('mn: A co1\1p.'l,C;t graphicol represenrationof a datab:,se conceprual models. entities.. and the rela-
tionships among them.
Scope: The spaiial e;c:1ent of input for a sp:.nial operntio11.
Seoul lines: Lines of intersection between a dcvelopablc surfac~ and a spheroid in a map projcc-
1ioo.•
Sele-c1ion op N·atio11: The identification of a set of objects based on their propt1tics.
S<'ml·mnjor· n~s: The larger of the two radi.nl axe-s 1hat def't»e an ellipsoid.
Semi-minor axis: The smaller of the two radial axes thnt define an dlipsoid.
Semlnuiitntt>: Th,e vari.111ce between values sampled a1 a given lag distance aprut.
Appendix A.: Glossary 663
Str,•('r; A co1ttp\lter or a progt:tm co1nponent th:it stores data. and provides S\ibse, s of d3Ul in
response to requesis.
SN nlgOOrn: A me1hod for specifying selection criteria based on comparison oper:uors less than.
equal lo. greater than. aud ~haps ol.berS.
Sb•d•d ~U•r m•p: Adepictio11 of 1he briglnn~ of cem,in ,,Oec1ioo with ngiven sun loc,uioo.
Sidel11p: Edge over-lap of pholographs la.ken iu Oightlinc-s.
Si11gltp11r1 f('ohJ1'('>: A vector d.1ta l3Yer where e,·ery feature is individualJy represented by a point.
line, or poJygon. :wd there is a corresponding row in 1be attribute table for -each featutt~
SbnhtJ': A s.ystem (or controlling the time or amount of light reaching a detecting surface.
Skeletouhing: Reducing the width of linear fearures reprc.sented in rasttrd.tla layers to a sing.le cdJ.
Slh·tr: Sm,111. spurious polygons at the 01arg.ias or boundaries of feature polygons that are au ru1e-
focc of impiecise d igitizing or O\•crlay.
Slop": Tbeeh:ange in elevation o,·era change Ul location. usually measured over $0me fixed i.oter-
val. e.g., the change in height betwten two p0in1s 30 meters .'lp.trt. Slope is ust1ally rep011ed
as a percent slope, or as a degree angle measured from horizontal.
SLR. Sinell.tlQ L::.s<'r RJl.ngiug.: System for 01easurin.g posi1jon. particuJarly tiplifi and subsidence.
from Earth to Satellite laser range find ing measurements.
S1u11> dis1:u1cf': A distance lhreshold defwe<I in dig.itizing or other S'p.'lti:d .t.u..,lysis. Poim features.
nodes. or vertices within the snap are moved lo be coincidenl. to occupy th.c same local ion.
Snn1> 1olel'1,rnct: See snap disrn.nee.
Sanppin~: Automatic line joins durin~ vector digitizing or layer overlay. Nodes or vertices are
joined iftbey ;ue within a specified snap distance.
Spngb"lti d;ita mode-I: Vtcior data model in which lines may cross withoul in1ers,ec1ing.
Spnd1d dt1rn modtl: A way of organizing sp:uial dMa i.n a d:m1base, including cbe concepni:d
description and rela1jonships among spatial objcers.
Spnr:fal obje<"r: A digi1al represema1ion o( a physical objec, or phenomeoon.
Spatial OJ>eration: A logic.al. mathematical. selection. or olhcr spalial proc,ess th..-,,l tnms-fonns spa-
1i.al dat:a.
Spa1fal resffl.11tion: The smn.llcs-t featurt tha1 cau be iden1ified in a da1a set.
Sp<'Ct1•11m : see electromagnetic spectrum.
Spbeairal coordinates: Aeoordiuate syslnn based on a spbcrc. l ocatioo ou the spben, surface is
defined by two angles of rotation Ul orthogonal plane$. Tbe geographic eo<>rdinate system of
latin1de and longitude is the most common exrunple of n sphericttl coordinate system.
Spheroid: A nutbematicBI model of the shape of 1bc Eanh, ba$ed on the equation of a $pbere.
Spirh le"eling: An early leveling survey 1edtniquc in which bori20utal lines werei established
between survey stations, and relative height differences de1em1lned by measund marks on
IC\'Cling rods.
Sp tin": A smC>Othcd line or stuface created by joining multiple constrained polynomial funclions.
SPOT : Syste.oo.e Pour l'Observruion de La Terre. ti sateJlite iotaging system providing JOto 20 meter
resoluti on images.
SQL: Strucrnced Query Lru\guage. a widely .\Clopte<I set ofcoUllllands used 10 maoipul.'tte ret.,1ion3J
data.
664 GIS Fundamentals
SSURGO: Fine rei:sohltion digit31$Oil dma corresponding to coun1)• level soiJ sur\'eys 5n lhe United
States. Produced by the Natural Rcsouree Cons<'tvatjon Service.
St~nds'IJ'd 1>art1ll<'t:s: Lines ofimersection between n developable surf3ce and a spheroid i.n a map
projection.
STATSGO: co.,rs.e resolution digital soil dala distributed on a statewide basis for the United States
Most ofleu derived from :iggreg_.i1iou and generalization ofSSURGO data.
Stlltf" Plan(' Coordinnte$: A sumdardi-zed coordinate system for the United States ofAmerica 1hat LS
b.lsed on th,e Lambetc conformal conic .'Ind tt.\.0$vttSe Mercator projections. St:1te pJaue
zones are defined such that projection distortions aro maintaine-d lo be less th.an. l p..lrt in
10.000.
S tereo pairs: Ovesrlappin~ photos taken from different positions but ofsubstantially the s.1.me arro,
,vith the goal of usUlg parallax t0 interpre1 heigl11 differences witbi.111he overlap area.
Stereographic projtttion: A map projection with the projection ecnler is placed al the ai1tipocle, the
point on the opposite side of the spheroid from the projectjon intersection point with the
spheroid.
Stel'f'OP hotog1•11pbs : A pair or more of overlapping photographs that allow the perception of three.
di.mtnsions..due to a pel'$peeti\'e 5hift.
St1'f'am modf' digitizing: Point data collection via manual digitizing where the distance or time
i.nterval between s.wpled locations is 6xed. This removes the need for a buuo1t press by the
manual operator during digitizing .
Sh·uctu1'td Qutry.• L1'ng11ngt (SQL): A st-a.ud.vd syn1ax for specifying queries 10 d1ua'b.,.ses.
Sm·vey station: A position occupied. and from which measurements are made. during a land survey.
Sys.tttlll1tk saruplit: A $3.mpli.ng pauern wl1b a regul.v s-,1upling framework.
Tenestrild re,fe-t•e n ce frame: The set o f measured points and their calculated coordinates that are
used to deiane a specifte. o r realized. geodetic datum.
Theuu1:1k ht)'er: T11ematicaUy distinct spatial daln organ ized in a single layer. e.g... all roads in a
smdy area pJaced in one them:n ic layer. all rivers in a different them:.uic layer.
TIFF: Tagged lmruge Fi.le formal, a widely-supported image d istribution form.at. The Geo-TIFF
variant comes with image registration i11forma1·io1J e111bedded.
TIGER: Topologically lntcgrnled Geographic Endcoding and Referencing files, a set o f structures
used tO deli;ver digital vector data and anributes as$0Ciated wi1h 1he U.S. Cens:us.
Time.geogn1.pbk df'ns-ity e$limn1iou: A method ofestimating the location of n moving objcet from
a time seqlL'l!'nce ofob$ervation. The method uses time intervals between observations along.
wi1h estima.tes of M·er-,ge. ma.'timum. 3.11dfot min.Unum speed to estimate a prob-,ble occup-,.
tiou region..
TIN: Triangufated Irregular Ne1wotk. a data model 111os1 c-01wnonly used to reptesen1 terr.tit\. Eleva.
tion points are c.onnected to form trianglts in a network.
TM: Thematic Ma.pp.er. a bigh-resolU1ion scanner cnrried on board later L.'\Jlds:i.t s:nellites. Provides
inform.u1oo in the visible, near infrared. mid infrared. Md thermal portious of the eled.fo-
magnetie sp«ui.nn.
1uH\UPS: An im.ag~ proet--ssing and GIS soflware package prod'uced by M.icroimages. lnc.. of Lin-
coln. Nebrn.ska.
To1,ology: Sb.apc-i.nvari.ant sptitial proptrt.ies of line or area features such as adjacency. con1iguity.
and coru1«1«tncss, o ften recorded in a st"t of related tables.
Tr1uis,·e1·S(> Mut1J1tor proje-cdou: A conunon. Dl3p projec1ion b.ued on a trans, <trsc cyliuder.
Appendix A.: Glossary 665
UTM: Univer:sal Transverse Mercator coordinate $)'Stem. a St.'lndard sec of mnp projtttions devcl·
oped by I.be U.S. 1'.filitary aud widely adopted for eoottlinate specification over regional
s rudy a reas. A cylindricaJ projtttion is specified with a central meridian for ench six..c!cgrce
wide UT~,t zone.
VarUlble- di~tAll<'E' bufft>r: A buffering variant whcl't' t he buffer distance depends o n some value or
level of a feature ,mribu1e.
Vf'cto1· data model: A repre'St'nlation o f sp:atia l data based on coordinate loc.ation storage for shape-
definiog poi.ius a.ud associated aur1bute information.
Vertirid datum: A reference s mface against w hjch vertical he ights art' measured.
V('rte.i, \'trtk:('! Poims \1swed to specify 1be position aod sh.ape of Jines.
Ve1'y l..ong Baseline lnterferomelry (VLBl): A me3Sl11Cmen1 system collection signals from dis-
t.wt q,i:asar s~rs dlat :,;How millimeter.le\'el meas\lremems of position through time. 11 fonns
the founda1iou of our positional me.asu.rcment system.
Virtual rtff't'e'D<'<' $tarlon (YRS) nC'lwork: A set of ratbercloseJy-spaced GNSS base srntions plus
con_untuiica1ion links that simplify and Staod:utlize real-1imc differential correction within ll
region.
WAAS: W ide Area Augmentation Sys-tem. a satellite-based ir.,ns-.miu ion of cozrection signals to
improve Gf>S positional estim ales, largcly through lhe removaJ o f ionsophc ric and aun o-
spberic effects.
\Viwelenglh: T he distanoe bchvecn peak energy values tll an electromagnetic wave.
\VGS84: W01·ld Geodetic Sys1em. an E:uth-ecn1ered reference ellipsoid used for defining spatial
locations in three dimensions. Very similar to GRSSO e Jljpsoid. Commonly used as a basis
for map projectioo.s.
Zenith angle,: Th¢ angle measured betw~en a \'Mic.al line upward from a poinl on lhe Earth and the
line. from dun poiol to dle Sun.
666 GIS Fundamentals
Appendix B: Useful Conversions and Information 667
Area
1 hectare = 10,000 square meters 256* \
1 square kilometer = 100 hectares
1 acre = 43.560 square feet
1 square mfle = 640 acres
1 hectare = 2.47 acres
1 square kilometers = 0.3861 square
miles
Scale
1 centimeter distance on
Angles
map equals a distance
1 degree = 60 minutes of arc
Scale value on the ground of:
1 minute = 60 seconds of arc 1:5,000 50 meters
decimal degrees = 1:10,000 100 meters
degrees + minutes/GO+seconds/3600
1:25.000 250 meters
180 degrees= 3.14159 radians 1:50.000 500 meters
1 radian = 57.2956 degrees 1:100.000 1000 meters
Spherical angles on a globe:
c,..,.,"'
m.,.,.., 1 inch distance on a
map equals a distance
Seate value on the ground of:
1:6.000 500 feet
1:15.840 1.320 feet
1:24.000 2.000 feet
1:62,500 5.208 feet
1:1 00.000 8.333 feet
\
South Pole ·9C>
- - lcng, ludC - --
668 GIS Fundamentals
11
12 13 14 15 16 17
•
'
Appendix B: Useful Conversions and Information 669
Trigonometric Relationships
0 e
B
Coordlnat:e Geometry
Coordinate x 1 = X 0 • dx
geometry (COGO) ~·Y0 • dy
d, • L-cos (0)
J- d, - dy • L·sln (0)
l
: L
1herefore
If we Mow the location of o ~nt. xn, Yn, and have measured rhe
azimuth and dlstoflCe ,o another' po1m xu, Yu \\lhor ore the coor"d1nctes
for th2 unknown point, ><u, Yu?
Suppose Xn • 12. Yn•3. 0•6.8. end oz,murh • 242"
Fromobove.
O' We con calculat e O from the oz1mu1h
dy • D cos(O) 0 .. 242°
dx • D sin(O)
e • 242 - 180 (see fogun,)
~-Xn · dx
Yu =Yn - dy
So
270° •o' dy • 6.8 cos{62) • 3 2
I)
d, • 6.8 sm(62) • 6.0
Xu: 12 · 6 • 6
Yu 3 • 3.2 -0 2
l\i· Yu
180°
670 GIS Fundamentals
y
Lo1!1udc. loni;rtudo?: from known J...O Cor1c:slon
p • (x' • y1°) v de r,ned as aboYt:
x
IJt • ten i (t-f) e'' .ti~ bl'
long,:ude • tc,, 1 (y/x)
south pole-
101.llliCh'. ,ori
' ( Pz . o,e_:z,co.,)(',')
b·•'·•ln~¢))
h· p _ ,,
co,(qi)
SOUlh polt
Appendix B: Useful Conversions and Information 671
(...
·-
~$1flYC X-4xll
lnll!:rsttts 01
;..o +·o
w
p
·-----
d----·(x,.. Yw)
c
A
y
B
A 8 C A2 = B2 • C2 + 2BC · cosa
sina
::: sin~
= siny
2
8 2 = A2 , C • 2AC · cosfl
2
c2 • A2 • 8 • 2AB · cosy
Appendix B: Useful Conversions and Information 673
Deflection of Curvature
A B
d • 0.785·AB2
~
"'
.c
a.
"'
"'
.c
+-
....
0
(/)
:::,
'6 AB • R · sin(O)
~
er:
. AO= R · 6
0 The angle 9 is
expressed rn rod1ons
674 GIS Fundamentals
Appendix C: Answers to selected questions 675
·-~
~
1.4: GIS software differ from other software primarily in tracking geographic
coordinate location. tieing these locations 10 anribute data. and storing and
pr<Wessing large quantities of dala. While many softwares are designed 10
store and analyze large volumes of data (e.g .•video editing), and some otlier
softwares focus on coordinates (e.g.. computer assisted design programs for
three-dimensional objects). GJS records coordinates that are tied 10 real.
physical locations. Coordinates are defined relative to a physical origin. usu-
ally some point on the Eanh surface. or the ne.aJ tbe center of the Earth. and
stored in the computer. Sets of points are combined to characterize the loca.
tion and shape of geographic features, and non-spatial attributes are associ-
ated with these fearures.
t .6: By our definition in this chapter, paper records and maps are not a GlS,
because they are 001 computer based. However. they do serve in our collec·
tion. storage. analysis. and output of spatial data and information. so some
would argue tllat they are a GIS. just an extremely low 1cclmology , ,ersion.
6 76 GIS Fundamentals
Chapter 2
2.2: Our mulliple levels of abstraction from tl1e physical, "real.. world usually
include a data model. data structures. and machine code. Tue data model
describes the real-world objec1s witl1 a subset of simple objects and relation-
ships. Data models typically encompass our mental iniage of bow 1he real
world entities are connected, shaped, or related. These models may often be
illustrated by box and arrows diagrams. Data structures are how these objects
are organized in a computer, for example. what parts go in wha1 files. or bow
the files are linked on 10 another. Machine code are the O's and I's used 10
swre info:rma1 iou.
2.4: a) imerva Vratio, b) nominal. c) nominal, or ordinal (if read along a brightness
gradient),. d) ordinal. e) nominal. f) in1ervaVra1io.
2.6: a, b, f
2.10:
I 36°45'12" 36.75333
2 114°58·2· 114,9672
3 85°19'7" 85.31816
4 14°00'33" 14.00917
5 275°30'00" 275.00001
6 0°59'43" 0 .99528
7 182°19·22· 182.32278
Appendix C: Answers to selected questions 677
2:12
2.16; Topology is tile smdy of spatial rela1ionsllips. and it is imponant i.n <HS
b~ausc 1opological vector data strucrurcs have certain positive properties.
Topological relationships such as adjacency. connectivity. proximity. and
overlap are important in structuring and analyzing data and are often helpful
in ensuring da1a quality. Many 1opological characteristics are invariant 10
warping or bending spa1ial fearures. This is imponam because we o lien warp
spatial da1a througlt map proj ecrious.
2:18: a) I. b)2. c) 2. d) I. e) 2
678 GIS Fundamentals
,• ••
?Q·
. JllU , U·patl
·
c:? .)._ I
10
'""' !
Jro1y
....,.
lOl,3'0
' "'---
8 -~ '-
'
single-part
I
2
'
Mohlcnd
SOttt:ttiO
.
251.540
2.4,090
3 Sieil',' 2'.710
222:Mlxed cells: may be a problem under several condi1ions. for example, wbeo
there are veiy differenl values for materials within the cell. or when we are
interested primarily in one factor that is a minority presence in a cell bul not
recorded. ML,ed cells may be addressed by decreasing !he cell size. by care-
fully developing ~,e assigmnem ru.le for cell values when !here are mi.x,ed
cons1imeots. or by recording multiple anribu1es for each cell. including tbe
identity and proportion of values in each cell.
2.26:
2 2 1 2
nul1 2
null 1.75
2 2
I null 4 3
null 3.5
3 null 3 4
0 0 b 0
O b
0 b c b
c c c
null
null c
c c c c
2.28:
One-to-one
cell-IDi-fOUnt One-to-many
5 I attribute table
attribute t able 7 I
(rows first.start i cell-lD coun1
upper-lefl corner) 2 I
3_ i- I 2 2
3 I 3 6
9 I 4 5
10 I 5 I
4 I 6 I
6 I 7 5
7 I 8 J
8 I 9 I
8 I 10 I
J I
4 I
J I
7 I
7 I
4 I
I-
4 I
3 I
8 I
7
4
3
I
I
I
l
2 I I
680 GIS Fundamentals
2:30: All object data model defmes "narural" objects. from the point of view of
the model designer. that encompasses spatial and anribnte properties. as well
as functions or operations I bat may be specific to that object Rather !hat
breaking data inlo thematic layers. components from many themes may exist
within an object. Objects may relate 10 other objects through specific or spe-
ciaLized. unique correspondences or collrlections.
2.36: We compress data when data vohllrles are 100 large. particularly for raster
data sets. Cells are recorded for each location in a raster area. and gigabytes
to rerabyws are often stored. Vector data sets typically record shape-defining
locations. and only where features of interesl occur, for example, a road line.
This conttasts with a raster representation which records a sel of cells for a
road. plus cells for the surrollrlding area where there is no road.
Chapter 3
3.8: Mag11etic north is at the point where lines of magnetic anraction converge.
and a weightless magnet. if suspended in a frictionless media. woulld point
straigJu down towards the center of the Earth. Magnetic nonh is cunenlly
located near Greenland.
TI1e geographic north pole is tJ1e northern intersection of the Earth' s axis of
rotation with the Earth's surface. It is located in the Arctic Ocean.
3.10: Multiple danuus exjst because we have Unproved datums through time. and
bec.ause we develop different datums for different purposes. Datums are
required for measurements, and so most governments estimared datums
when a sufficient number of points were surveyed. Additional pointS with
improved methods will led 10 subsequent estimations. or versions. of national
datums. in most cases with higl1er accuracies. Satellite and other measure-
ment capabilities developed in the second half of the 20th centmy added
globaJ datums. increasing the number of available datums for most loc.ations.
682 GIS Fundamentals
3.12: Sea level is no longer used as a reference height because sea level is rising.
so rbe mean height would depend on the length ofrhe measurement record
and is nor stable, there are. loc.al variation.~ due to persistent currents. changes
in temperature, or changes in salinity. and sea level varies on a 19 year tidal
c-ycle. Technologies have developed such rhat we can quickly and easily
measure accurate height differences smaller tl10n chese sources of sea level
variability.
3.14:
NAD27 NAD83(8 6) HPGN
''" ....... lol 11.1:le .......... ....... """'""'
'°""'""
~10"11'
-
I G<lf {S) 32 U 15' nt'09"42' 3Z<O"l5t827' 111"09·,s 12or J1'u1s1a10· U7'09451l01'
3.16:
NAD63(ZOII) - 1966 NAD63(Z011) - 2015 SurfOGt shifr d1s1~t (cm)
1amudt b\gi,ude
''" I01rtucle ICf\iglrudt &cegl"\Jdt lornudt
3.18:
NAD27 NAD83(2011 )
Srart
""' l!IM\G('
""''""'
·11,.<!0'00"
c1-,1:.i (,,,l lc111t11dr. i0n9tlllX e:ao11on (ml
3 ~ti4'1CM
"'' )0'00' -s,n'oooo· 200 •s"zn, w,r ·IU"00041J.ll' eou•9
...,.,.
-
..ttoo·co- ·690000" ,co 47'00'00 llSO' -bt~S~'S! uu· Lt;9.8U
'
l)~OOOl '7:).)" ..s1'N~ ll34° 050,
' 25'00'00" &l'l0-00- I
Appendix C: Answers to selected questions 683
_
3.20:
•
1
,..,,,..
-
..toooo·
2:s•oo·oo·
·69-"oo·oo·
.fl-Woo·
200
l
IJ
"
.", L>
••
3.22: MH'\V - 4.99 above gauge 0. NAVD88 at gauge is 0.43 above gage zero.
So MH\V is 4.99 · 0.43 above NAVD88. which is 4.56. Hospital should be at
30 + 4.56. or 34.56 ft.
3.30: The UTM coordinate system defines map projections for all portions of tlte
globe,. Areas berweeu so• s latitude aud 84° N latitude are divided .imo 60
wide zones. each zone ru,uling from tlie equator 10 tlte northen1 or soutl1ern
limit. Separate transverse Mercator projections are fit to each zone. Negative
zone values are avoided by spec.ifying false castings and oorthings. coordi·
nate values added to intenuediate projection coordinates.
684 GIS Fundamentals
3.32: Benin. Israel - Transverse Mercator. because both are narrow with the main
territorial axes are nonh-soulh:
Bhutan - Lamben conformal conic. because they are relatively narrow and
have main axes oriented east-west
Slovenia - either Lambert coufomial conic or azimuthal, because although it
is bas a sLigh1 eas1-wes1elongation, the shape is son1ewba1 round. and so
could be well represented by either fom,s.
3.34: The Public Laud Survey System (PLSS) is a systematic subdivision of land
carried om in tl1e U.S. for the purpose of uniquely identifying property
boundaries. Principle meridians and baselines are established. and 1owuship
rutd range lines surveyed parallel 10 these at 6 mi.le imervals. The 1ownsbip/
range grid is further subdivided into I mile squares, in tum subdivided into
smaller units. The PLSS is nol a coordinate system.
Appendix C: Answers to selected questions 685
Chapter 4
4.2: a. c, and e.
4.6: Acompu1erscrec11 is now tbe most common map media: millions of maps
are rendered each hour through applications like googJe map and o,apquest.
Paper is the second most common media, and is most used when a .hardcopy
fom1 is required.
4.8: A large sc.ale map typically shows more detail. because each fealure is drawn
larger, and tliere is more opponunity to show variation in shape.
4.10: Complete the following table that shows scale measurements and calcuJa.
tions.
Ground distance Corresponding map dis·
Map Scale
and units iance and units
17;120 kilometers 16.85 inches 1 : 40,000,935
I
5 unll rod1us
·<\
,.':~:\.
O
- - ' ,/
,,
-.:,
10 unit./
tOdiUS
'
•
I
'
------ ··.,
~ u,-1 roebus
a ·:--.·\.
----..
JO urur ..·"'
rodius
)
4J 6; Aspline is a line smoo1bly fil lhrough a sel of poin1s. Splines are used 10
increase venex deusiry wilhou1 substantially slowing the digitizing process,
particularly for smoo11tly-curvh1g features. suclt as river meanders or "' llld·
ing roads. Splines fir piecewise polynomial flloc1ions while in1posiog smoo1b
join points.
Appendix C: Answers to selected questions 687
4. 18: Manual digitizi11g involves foting a map or displaying a scanned image. and
manually positioning a pointing device to indicate 1he location of each node,
vertex. or other sl1ape-definin_g coordioate. Digitized data are in vector fom1.
Scan digitizing uses a machine to record differences in colors or brightness
for a map or image documen1, usually into a raster grid. Lines. points, and
areas are defined by some lhresholding teclmique. and lines or points may be
u1inned and converted to a vector fo,m. as needed. Mam..1methods have u1e
advaruage of low costs for small maps. inexpensive equipment requ iremeots.
feature interpretation by humans when using substandard maps. and rela-
tively little training. Scan digi1izing is inexpensive for large nwnbers of very
detailed maps. may be automated. and may be more consistent.
4.20: Map registration fixes a map or image to a ground coordinate system so that
the coordinates of any poinl in the media may be delennined easily.. The pro·
cess consists of identifying control points that are visible in both the image/
map and on the ground. collecting coordinates of u,ese points in both the
image/map system and the projected "ground" coordinate system. fining a
system of 1:raosfom1ation equations to the coordinate data sets. and applying
these transformation equations t'o the image to convert it to the proj;ected
ground coordinate systeaL
This is a linear 1ransforma1ion because the x and y variables are 001 multi-
plied together or raised 10 a power largcrtban I. by definition the equation of
a straight line-.
4.24: The average positional error is likely 10 be the. same or larger than the
RMSE. The RMSE is usually minimized, or closely related 10 a minimized
quanrity when statistically fining the coordinate tn111sformation. Ifwe col-
lected a representative sample. we expect u,e Ri'v!SE 10 be approximately
equal 10 the average error. However. if our sampling was inadequate or
biased, often it is in areas where we have difficulty identifying good control
points_ and hence our RMSEs tend 10 be larger in these locations.
688 GIS Fundamentals
4.24: Transformation bis the mos1 Likely to have Lowest average e1TOr al inde-
pendeotl)' measured poims. 11 depends on 1he distribmion and number of con-
trol points. but in most cases higher order polynomials overfit. and wbUe
exhibiling lower R.M:SE values. they have larger errors.
4.28: Me1ada1;a arc 1he data abom da1a. They describe 1be exienl. l)']>e, coordinate
system. lineage. attributes, and other imponant characteristics o f a spatial
data sel. Metadnta are required to evaluate the adequacy of a data set for an
intended use.
Appendix C: Answers to selected questions 689
Chapter 5
5.2: GNSS is based on range dislanc.e measurements from multiple satellites to "tri-
angulate.. a location. Orbitlllg satellites transmit radio signals along with precise
positi.oning and timing information. The current distance beiween a satellite and
a receiver is a range measurement. A GNSS receiver combines multiple. simul-
taneo11s range measurements to estimate loc.ation in near-real time.
5.4: Typically 4 satelLites are required for a 3-dimensional fix. although a icx may be
detennined under some assumptions with data collection from 3 satellites over a
short period of time.
5.6: GNSS data range in accuracy. from sub-centimeter for !he highest accuracy
using carrier phase methods. to 1ens of meters usin.g real-time CIA positioning.
Accuracies are highe.st when using high quality receiving systems in fiat terrain,
with 110 buildings. trees.. or ot11er structures 10 block views of the sky. Accuracie.s
also i:mprove when satellites are widely spaced.
5.S: Fig ured depicts the lowest POOP. with tbe widest distribution of satellites.
closely followed by figure b. Figure a has the higl1est POOP, with t!1e tightest
di.stribU1ion.
5.12: DuaJ frequency receivers prin1arily reduce ionospheric delays. and hence
uncenainty in position. They may help somewhat with aimospheric delays. but
typically much less 1ban with ionospheric delays.
5.14: GNSS accuracy typically decreases as terrain becomes more varied. on,i1en
canopy or buildings obslJ1lct a ponion of !he sky. Positional accuracy decreases
beca11se sub-optimal constellatioos of sa,ellites are more likely to be observed.
Satellites are in closer proximity. and measuremeots are less independem, and
hence to not reinforce each other to improve accuracy.
5.16: WAAS is the Wide Arca Augmentation System. a real-time difforen1ial COITl!t·
tion system designed to aid navigation in U.S. civil aviation. Correction factors
are derived from a nationwide network of control stations and broadcast from a
geostationary satellite located over 1he equator. The system is desigc1ed primarily
for at1iation and related uses in North America.
690 GIS Fundamentals
5.18: COGO stands for coordinate geometry. COGO is the calculation of co<>rdinates
based on angle and distance measuremeurs.
5.20:
Xi- Y,
1- d.. - ·
... • •• •,
...
v, d,
I
L Y,
....
0 0 70 100 9•0 342 94 () 3<2
,s
.
15 130 336 1256 1606
L
d, 400 0 200 1<1)4 1'1J4 541 4 1•1 .C
,so 80 20 183 81 1683 88J
Point l 2 3 4 5 6 7
- -
Azimuth 278° 42° 103° 199° 359° 245° I oa 0 14·22"
5.24:
Stcrting point PO. X • J,200 Y • 400
p.,.,,, ID Aumrth Distance Oenox Oel1a Y x y
Pl 9, ,o, 10<.6 ·92 1,304 6 3908
P2 192 n ·160 -753 I 2886 3155
Pl 262 204 -2020 ·28 4 1,0866 287 l
P• 6 ,o, 10.9 103.4 1,0975 3906
P5 18 33 10 2 31< 1.107.6 4220
P6 105 88 850 ·228 l.1926 3992
Appendix C: Answers to selected questions 691
5.26
Point
r.r.Yts
(0,,0
~s-1
l
?1000.C&n
2
PIO A.66'60
3
,,w
P1D OY21'3
•
PIO P.1(()4,0:
-.vtv
5
Pll)CW,W
0..,1~
"°"""' """ AV$ APQJ TX $1~ (.A
Latitude 45°05' 454• 30° 17 29 7" 33'24' 15.9" 111.'° co· JJ.a· 41' 49' 551"
Longitude 93~00' 179• 91' 41' 34.3• 118°24' 54 2· 109' 30· 20 2· 119'59' 216°
Orthome1ric
Height (ml 280.15 167,45 442.12 3.61199 401.37
692 GIS Fundamentals
Chapter 6
6.10: The most common band combination is a blue-green,red set tltat corre-
sponds 10 wha1 the human eye observes. Another connnon combi11a1ion is a
green-red.-near infrared set. which provides bener vegetation discrimination.
6.24: The four systems. from Landsa, ETM+ 1hro1,g)J Q11ickbird represem a range
of resolutions (from 30 m through 0.6 m). spectral ranges (full color through
near and mid inf.rared). per scene coverage (from tens of thousands of square
kilometers through a few tens of square kilometers), and more lhe two week
to les.1 lhan two day repeat thnes. Finally, costs rise markedly along Ibis gra-
dient. Allhougl1 near the end of its functional life at the time of this writing.
the ETM+ data were available for h11ndreds of dollars for a full scene. while
the hlgher resolution data were from tens to hundreds of thousands of dollars
for au equivale11t area.
6.26: Image types are selected if they measure the phenomena of interest to the
required level of spatial and attribute accuracy, are wi1bio the 1eclm.ical c.a)Xl-
bilities of the organization. have an acceptable probability of suc,cessJul data
collection. and fil with lhe available budget for acquisition and processing.
694 GIS Fundamentals
Chapter 7
7.2: Do the data provide the required infonnation, for tlte required area. at the
necessary level of categorical and spatial detail. aud at the accuracy needed
for the intended use?
Chapter 8
8.2: Database management systems are computer software tools that aid in the
entry.. organizatioJL analysis. distribution. and presematiou of data.
8.4: A one.to-one relationship among table means thal fo r every row in one table
tl1at in some way is matched to a row in another table. there is only one row
in u,e second table that watches. A many-to-one relationship means that one
row in a rable may match many rows in a second table. Note that by match.
,ve do not mean completely march. Usually we are using a colmnn in each
table 10 match the tables: 1he rows are considered to match when th,e match
column has the same value in both tables.
8.8: Tite eight basic operations are illustrated in the section ''Priniary O~rations"
in Cb.apter 8. TI1ey are resiric,. project. product divide. union. imersec1. dif·
ference. and join.
8.10: Sets from OR conditions will have the same number or more members than
the component condirioos.
8:14: Nollllal fonns are a way of organizing database tables. Wbeo foUowed. they
optimally structure tables to remove redundancies. efficiently store data. and
orgalllize data in "natural .. groupings that speed analysis and increas:e flexibil-
ity.
696 GIS Fundamentals
8.16:
,..--.....,_
ldl l'Vl
y WO
z ••
td2 rm
x
y
5
I
y
z
..••
ldl pos tm
I
.
A rl A 6 A ,.. 6
y pr Q 4 v pr I
y nn
y nn I
R ,..
N 3
R
,.. -
L 2
Q mn
Q mn 4
S. 18: A functiQnal dependency occurs when knowing the value of one variable
define.s the value of another variable. For example, if I know a person 191a1 a
person was a German citizen in 2014, tl1en I know that tl1eir Chancellor was
Angela Merl<el. or iftha1 a person is a Chicago Cubs fan in 2015, !hen lheir
team has 1he longest ac1ivc period will1ou1 winning 1be baseball World Series.
Chapter 9
9.2: Selection operations apply criteria to features. and identify features that meet
u1ose criieria. The criieria may apply 10 spatial charac1eris1ics. for example.
u1e si:ze. shape. or location of a polygon: u1ey may apply 10 non-spa1ial attri-
butes of 1be fea111res. for example the value or condition of an anribute.
9.6
10 cotg
I A
2 7
re:clas:s 2 A
3 B
5
9 • A
3 5 B
6 6 B
7 A
8 A
9 c
10 c
A
c
B
698 GIS Fundamentals
9.8: Lruge are white. medium ligh1 gray. sinall darker grey.
14
6
9.10: The modifiable area u11i1 problem arises because sta1is1ics for aggrega-ted
areas depend on tile aggregations. We may combine adjacen, areas. and cal-
culare sums. means. medians. and other arrributes of the areal units. If we are
selective about how \Ve aggregate, we may dtange these statistics solely by
changing 1be aggregation unils.This is 1be modifiable areal uni, problem. The
zoning effect is bow aggregate s1a1is1ics change with zone boundaries. The
area effec:1 is how s1alis1ics change when changing 1he size of aggregati.on
areas.
Appendix C: Answers to selected questions 699
9.12
700 GIS Fundamentals
9:14:
I
New!O Closs Count
c a l 17
a
b b 2 44
c_L 1 l 7
9.18:
ID distance
I 250
2 0
3 1500
4 500
5 0
1,000
meters 6 500
7 250
8 250
9.22:
layer t - lf'lluttcr- Loyer 2 - 10 ettc1e - Ootput
• • ·' •
A
' ,2 '
' .•
B
c B
c
,J ,$ 3 ,5
• • •
' l
' 0
• ' , • •
CJ
'
10 1C00t] S1tS
A Cl'$, ftt I ""'
I ~JT)T
XC
IO
1 ""'
m
coot · ,11s-
..."" ...""
•c ....
Wl1 t~s~
l
l
..,.__
'""'
2 xc
srn,
'""' • CA,
• •' ~
Wl1
Gn
.~oi,e
! ftt
• au U(t>
'""'
9:24:
Spo!IOI Ool o
c
I
'
I~, ,•
A l
'
• •
10 ... - • -- .,.., ,.,...
10 ••• """
10 ... -...... -..• ,...,.,3.... ......
.....
.
1 E..,,. I 12J!j II
l
2 ""
""
l'H9
u .....
! CH l
A
c
l
-
II
"""'
• """" Zlll
2
3
'5
...,
!Wi
...,
1863
U ""M
V ((IM
II
'
Cumo
..,. ,,.
.,... ..,
l..JJl
•
7
19tt;l
19lt;I
U<OM
LC«
L C,:c .
o lcumo l.JJl
II
.,... ...
....... ,.,
• 1010 r c« o 1G1.1~ ~.Jii
Appendix C: Answers to selected questions 703
9:26:
A 0
2
c d
B
..
XfT
0 urban Un«iln null
b fcrest Wol~ nu11
c ..,
d
•~
urbon
fortsl ,..,
sond
clay_
nuU c
h nUII null .send
i urbOn Unc.oln dqy
fol'tsr+wonowo vun SOM
k ore.st W-0 l<WiO nu! nu
J u,1,an Lincoln null nu I
9:28:
Loyer I IAy<r 2
A B x x x x I I I 2
A
",. A
B B w VI x x 3 3 2 2
A c c B w VI w x 3 5 5 2
A c B B w \'I w w 3 5 • '
10 !yp< .....,, 10
"
,2 GO\~·
10
'"" ,
A
c.ount
I x , • •
2
•• w ,
"
x
'
3
' l 4
2 B 6 2 \'I 9
• •c ww ,
3 c 3 •
704 GIS Fundamentals
9.30: Ncrworl< models are conne<:tcd Linear graphs 1bro11gh wllicb resources flow.
or to which movement may be constrained. TI1ere may be both source and
demand features connected to these networks. Networks are different from
many other spatial models in that movement oroccurrence is limited to the
network. .and they often track time-varying
9:32: Note answer values in c.ase.s large address ranges may be off by one address
unit
Oegilbo Rd Bend<90 SI
,. lnoloo R
!Iii
. ,r,01 Yaffll Rd
S !:]
"
.... Inoloo Ref 0<,91lb0 Rd
"" 8erid1go s,
Appendix C: Answers to selected questions 705
9.34:
a)
I 3 Recloss
Tobie
4 b) operation
5
0
0 b
o) in out
b)dissolve
l 0
2 0 c
3 b
4 c d
5 d
0 d
706 GIS Fundamentals
Chapter 10
10.2: Compal~ble cell sizes are ofte n required in raster operations because other·
wise the input is ofien ambiguous. If one input cell is substantially laiger or
mismatcb:ed 10 another. ii may be uncenain which input cell value 10 clloose.
10.4:
on me
) 2 • ~~ on t l'C e.retic
• J • 2•C•l•Do5o7,}l,2
II 10
• YCI.IC' n:,rigc.
• lM
$O\IOl'C
I ©
' " ' . "'
5
. • 2
>
[i]
• "•
• 16
I•
• 1 , - - , - - , , - - , tnlffiiCtl Of'I !he: 'llo<r
5 3 M ·.9.IU2.13.IC>
• & 11
I
•• 10
10.8: CI • 1. C3 • I. C4 • 0. CJ0 • 4
IO. JO: Any nested operation. something like con(isnull(layerl. layer2. layerl). or
sqn(abs(layer I)).
L0.14: In most systems, a NULL value is returned when it appears as any i.npu1 of
an operation, unless there are explicit instructions to ignore null values.
Appendix C: Answers to selected questions 707
l0.16:
I I 0 0 0 1 0 1 0 1 0 0
0 0 0 I 1 0 0 I 0 0 0 1
ond :
I I 0 0 I I I 0 I I 0 0
J 0 0 I I 0 I 1 I 0 0 I
I 1 0 0 0 1 0 I I 1 0 I
0 0 0 I I 0 0 I I 0 0 I
or :
I I 0 0 I I I 0 I 1 I 0
1 0 0 I I 0 I I I 0 I I
lO. J8: A clip function may be implemented in a rasrer environment by ere.a ring a
clip layer which have ce.lls with a value of J wherever data in the targel layer
are to be.kept, and Ocell values corresponding to are.as where the ta:rget layer
data is to be discarded.
10.20: TI1e kernel is a high-pass filter. It would highlight local differences in the
target raster. with values changed liule in areas "1tere cell values are about
u1e same, aud exaggerating values that are higher or lower than u1eir neigh-
bors.
10.22: High spatial covariance means cells near each other tend 10 have similar
value:s: low values tend 10 be clustered near low values. and high values clus-
tered near other higJ1 values.
708 GIS Fundamentals
10.24:
5 1 4 7 tM d1a9ono1 of 5
"•R'7 9
7
row/column
COS!• 2· 5·3' 10•2'1'10·<•10•
a squoreis 1 41
'2 1·10,2· 5
l 4 l 2 x the edge 1 4 ~
• 120
Appendix C: Answers to selected questions 709
Chapter 11
11.4:
.,
62·2"6<4 ° 66°
b) 67 63 62 6.4 • 05 • 005 dlldic. -67 • 2' 65 • 70 • ·O 14
dz/d'x •
20 80
65 •• l>l • 68
62 • 2'63 • 67 ,
70 68 66 d?/f!y• • ·Ol.5 dzJ<tt • -66 - 2"68 - 70 • -0.21
20
80
17 • 2'19 • 18 •
c) 18 19 • 21 -0 I ·18 • 2' 2t • 20 • ·009
23 17 dl/dX. dztdi<
lO 80
21 2' 19 17 • 2'2) • 18 •
dz/dv• 23 • 22 • 005 dlldy • -18 • 2'22 • 20 • -0.01
20 22 18 20 80
710 GIS Fundamentals
-
710 706 704 700 696
b) dZ/dx • •0,26. d'z/liy • ..02Q.
708 705 705 "697 700 slope · 21 l~-
- o,c)CCI • 4l t'
'709
71L
- 705 696 694 <) 0,1~ • -0.J~. di/Oy • 0 ll.
.iope; • 20 78~
714 712 708 703 698 c»peCI • 107 24°
1000
900
j I l,
~
.,c
+-
500
.,'-
<J
400
.,"-
~
300
"-
.!a 200
Vl
100
0
0 20 40 60 80
S lope (degrees)
Appendix C: Answers to selected questions 711
l 1.12
0
""'
10)8
.01!
..... '!lo,
'""' "'''
U)47•
104t '
..
,, .... """ 1o;o
~
•
.... ...
.... fr!,r()
_... ...
!'>I> •
,,.. 0
1016
[ ~
where d2 is the distance from the upper point to the point on the contour. H1
and lib are the upper and lower e!e,,ations at known points. and d1 is the dis-
tance between points H1 and Hi,.
712 GIS Fundamentals
11.16:
A.210
•
d,.. ~.260 ¢Ol'!OJr 3,0 Oo'l 1#\c: rroM
d!llO'IU A ro (QrlfOl,I" VCIA Z~-0 s
o 10 r 11 ;u, "'" trom o F.420
118 ,, • 1 ,a . 12w • 110>
(240 ?JO)
C.180' '°"''
E 1:, °"'
0 ItJOO °"
102.li;,,,c ll'OC'l'I
unitJ tro,11 E
II . IS
____ ,..,,
...........
- .....,..., - , - - strtom
8cely
IO<Ot!On
Appendix C: Answers to selected questions 713
l l .20: A solar zenilb angle is 1be angle measured a1au obseJ"\'lnion point be1ween
the venical line. "straight up:· and the sun's location. The solar azimuth
angle is 1be angle rurned clockwise from geographic north to the sun's loca-
tion. The solar incidenc.e angle is the angle betwe.en an incoming sun 's ray
and !be surface normal line.
l 1.22: A v iewslted for a point is the combination of areas that are visibl•e from a
point. Viewsheds are used in landscape design to maximize scenic vislas or
bide powerlines. roads. or other features. and they are used in teleconmuuti·
ca1ioos 10 calcula1e in<er-visibiliiy and commuoic,uioo oe1works. They are
calcu!a1ed by tracing rays from a view point 10 all viewable locations. using a
DEM: 10 estin11te if the angles 10 all intervening points are lower 1han the
angle to the potentially visible point.
71 4 GIS Fundamentals
Chapter 12
12.4: Adaptive sampling will likely give a better estimate of the surface over all
numbers of sampling points.
12.6:
Appendix C: Answers to selected questions 715
12.10:
ID x y z
0 153.9519 4,478.714.6 2040.6
·"•• . l
.8 I 151.280.9 4.479.647.3 18630
,o
• .5 .t, .
7 2 148.228 5 4.472.143 4 19921
·" .z " 3
4
148.906 8
152.3832
4.464.9786
4,466.92$ 8
2540 I
2106 3
5 158.827 3 4.475.746 9 22832
·" ' , ,10 ,9
6 160.6079 4,475.874 2 1933 5
.l ' •• .12
)J
7 163.024 4 4.477.357 .9 18364
8 164.465 8 4,481.173.5 1838 3
9 166.416.0 4,468,285.4 2523.9
Unknown points. 10 164.169.1 4,468.370.4 2138.8
ID x y z 11 160.692.7 4,468. 709.3 1854.2
12 156.537.9 4,464.724.2 18669
0 155.859.6 4.477,159.0 2.200.95
b 147.580.6 13 167.306 3 4,462.816 5 2453.8
4.478.884.2 1.872.06
c 162.535 7 14 143.946.6 4,482 44f> 4 1837.9
4.469,9602 2.059 26
d 151.71<.9 15 144.709.7 4,475,323:.0 1912 8
4,463.755.2 2.288.4
12.14:
mo.xinwm distance
circle
•
• •
• ••
.
• • ,• ••
.....
• •
.. •
-
•
• •
.~ ·
• •
i'.stor'ICe circ.le
•
12.16:
Appendix C: Answers to selected questions 717
12.18: Kentel mapping assigns a density surface around observation points. The den-
sity surface represents the likelihood or probability that the organism or popula-
tion occupied the area. "Stacking," or adding together tbe density surface.s for a
set ofobservations creates a combined probability map. giving an estimate oflhe
occupancy across combined observations or samples. The density surface is typ-
ically described mathematically by au equation. with the shape cotnrolled by
parameters. TI1ere is often a "bandwidth" parameter that affects the· "peaked-
ness'" of each individual density surface.
12.20: B s hows tlie wider bandwidth. because the disiributions are broader around
eachpoim. and the maximum values are lower. indicated by less san>rated sltad-
ing near the poi.Ju loca1ions.
71 8 GIS Fundamentals
Chapter 13
13.2: Criteria often must be refined bec.ause they are expre.ssed in a way tha1 may
not be directly applied. Distances or groupings lllll)' be ambiguously
described, e.g,. near. far. large. or small. and olle111hese must be quamiiied
before en1ry into a car1ograpl1ic model.
13.4: A discrete weighting bas specific. distinc1 categories. Roads may be pass-
able or impassable for large vehicles. rivers deep or shallow. or fores1s ever-
green or conifer domina1ed. Weightings may be defined which are specific 10
1.hese categories. e.g.. passable roads receive a weight of0.5. while impass-
able roads a weight of zero. ln contrast. weightings may also be continuous,
in 1ba1 each road may have a measured width, e.g., 12.4 meters. and 1he
weight may be some continuous func1iou of 1bis wid1b. e.g.. width • t 32.7.
13.6: A. original
B • reclassed by T, with low and high input values reclassed to high output
values. and intennediate input values reclassed to low ompu1 values:
C · reclassed by W. with low inpm values set 10 zero when below a threshold,
and then a lfoear increase in output with input above the threshold:
0 - Reclassed by R. High values above a thres.hold are set to zero. and low
values befow the threshold are set to 1.
13.10:Correc1 answer is C
A · applies area test 100 early. later intersections may split candid.ite poly·
gons. rendering smaller area polygons than the size limit specified for I be
ru1alysi.s:
B • commits same error. later in the proces.s:
D • omi1s raster 10 vec1or conversion on elevation. and alJows areas in same
parcel that are satisfactory and adjacen1. bu1 split by od1er criteria 10 be in dif·
feren1 polygons, so may rejec1 acceptable areas.
Appendix C: Answers to selected questions 719
720 GIS Fundamentals
Chapter 14
14.2: Data lrnusfer is perhaps the process most commonly aided by spatial data s1an·
dards. Among otlier things. s1andards require 1ba1 spatial da1a sources. methods,
and characteristics be described in a consisteot manner. Spatial dara s1a11dards
provide a predictable way of organizing data so that it may be transferred among
organi.za1i ons. Standards allow data to be transferred wi1hom loss of information.
14.4: TI1e mean reports the central error. the average error one would expec:a when
sampLing from a population. A freq11e11cy threshold describes the percentage of
errors above or below a value. 111,is give.s some general ootion of 1.he li.kelihood
of large o:r small errors.
14.6: Position.al acc11racy repons how close the represented locations of objects are
oear 1he tn1e locations of the objects. Attribute accuracy reflects how often the
value of a categorical attribute is correct (discrete) or how close and int'eival/
ra1io anribute is 10 the true value (con1inuous). Logical consis1ency does no1
in1ply either spatial or anribute accuracy. but just 1ha1 mul1iple themes or 1ypes of
are consis1en1. e.g.. there are no roads in a lake. fires on salt OalS. or oil deposits
in granitic rocks.
14.8: The steps in applying the NSSDA are I) identify 1es1 points. 2) identify source
for •1n1e.. poinlS. and extract inuh corresponding 101es1 points. 3) Calcula1e 11le
positional error for eacl1 tme/test pair. 4) record the error data in a standardized
table, which includes calculation of e.rror statistics. 5) create 1he documentation/
metadala describing the accuracy assessment.
14.10: Good candida1e poims are ru1y feanires 1ha1 are well-defined and may be visi-
ble on both the data set 10 be leSled and in 1t1e source used for trulh. This oflen
means construcred fea tures. for example road intersections.. curbs, manhole cov.
ers. geode1ic markers. u1ili1y poles. or fire bydrams or other rela1ively immobile
fea1ures.
14.12: Me1ada1a are 1he "data about data." They are imponant because 1he describe
the characteristics about any data we might wish to use. The allow us to evaluate
the data s11itabiliiy for imended uses. maimain the invesune111 over multiple
organiza1ions or changes in personnel. and help 11s in explaining or describing
our data t,o others.
Index 71 1
u x
U.S. ,Uf\·C)· foo1 116 XYl Earth-centered coordinates. 34
UAV.$Cc drones
usidcri boott 1$ 7 z
union 397, 400 zoiu1l function 459
DBMS 332
\ani\'crul wi:I lo,, cqu..iiou .SS?
mU\"<::tsal 1rn.n,vcnc Mcttator. , cc
projection, t.n'M
USLE
k c 1u1i,'crul so.ii Jou cqulllion SS2
v
,-ariogr.un S32
Vdatum 110
\'CCIQr
;rid\'autag<:s 51
41
:lll'cl\:t
com~rcd to r;,.stCT 56-57
conversion 10 raster .S7
d.inglc 4$
data model 40-SO
dcftni1ion 39
S,Jno,otb.i.ng 59