Lecture Notes GIS

1/30/2018
Geographic Information System

Arthur Godfrey, Dr. Batte
Lecture 1
A gentle introduction to GIS
1
1/30/2018
DATA
Definitions
“…a system of hardware, software, and procedures designed to support the capture,
management, manipulation, analysis, modeling, and display of spatially referenced data
for solving complex planning & management problems” (Rhind, 1989).
“…a computer system capable of assembling, storing, manipulating, and displaying
geographically referenced information…” (USGS, 1997)
“…a set of computer-based systems for managing geographic data and using those
data to solve spatial problems” (Lo & Yeung, 2002).
“….a computer system that allows the analysis and display of data with a spatial
component (Phillips, 2002).
4
2
1/30/2018
We may distinguish three important stages while working with geographic

data:
- Data preparation and entry
- Data analysis
- Data presentation
Spatial data and Geo-information

Another subtle difference exists between the terms DATA and INFORMATION.
Most of the time, we use the two terms almost interchangeably, and without the risk of
being ambiguous.
By DATA, we mean representations that can be operated upon by a computer.

More specifically, by spatial data we mean data that contains positional values.
By INFORMATION, we mean data that has been interpreted by a human being.
Humans work with and act upon information, not data.
6
3
1/30/2018
Lecture 2
Geographic information and spatial data types
Objectives
Distinguish between different types of geographic phenomena like fields and
objects and between discrete and continuous phenomena.
Understand the principle of “spatial autocorrelation”
Understand the concept of the following computer representation:
• Regular tessellation
• Irregular tessellation
• Point, Line and Polygon
Can apply the topological properties of interior and boundary to define

relationships between spatial features.
8
4
1/30/2018
Notions to be grasped
❖ Select a suitable computer representation, based on the different types of
geographic phenomena.
❖ Difference between the real world and computer representations.
❖ Different types of geographic phenomena.
❖ Boundaries (crisp – fuzzy).
❖ Regular versus irregular tessellations (quadtrees).
❖ Vector representations (TIN, point, line, area).
❖ Properties of geometric space used for spatial data (Euclidean space, metric space,
topological space, interior and boundary).
❖ Two dimensional topology.
❖ Scale and Resolution. 9
Spatial Data Modeling
10
5
1/30/2018
Different types of geographic phenomena

Some phenomena manifest themselves essentially everywhere in the study area, while
others only occur in certain localities.
1. A Field is a geographic phenomenon for which, for every point in the study area, a
value can be determined. (temperature, barometric pressure and elevation).
2. Objects populate the study area, and are usually well distinguishable, discrete,
bounded entities. The space between them is potentially empty.
11
Geographic Fields
Fields can be discrete or continuous, and if they are continuous, they can even be
differentiable.
In a continuous field, the underlying function is assumed to be continuous
(temperature). Continuity means that all changes in field values are gradual.
In a differentiable field, we can determine a measure of change per unit of distance
anywhere. (for elevation, this measure would be slope).
12
6
1/30/2018
Geographic Fields
In a discrete field,
Discrete fields, cut up the study space in mutually exclusive, bounded parts,
with all locations in one part having the same field value. (land classification,
geological units)
One may note that discrete fields are a step from continuous fields towards
geographic objects:
• Discrete fields as well as objects make use of ‘bounded’ features.
• A discrete field still assigns a value to every location.
13
Different types of geographic phenomena
14
7
1/30/2018
Continuous Field Discrete Field
Geological units
Elevation 15
Spatial Autocorrelation
• locations that are close are more likely to have similar values than
locations that are far apart.
• Principle is used when an interpolation is performed.
• Continuous field phenomena are based on the spatial autocorrelation
concepts.
• These fields although characterized by a continuous function must be
finitely represented.
• This also applies for lines.
16
8
1/30/2018
Boundaries
Which phenomena have boundaries:
-Discrete fields
-Objects
Two different types of boundaries:
• Crisp boundaries
• Fuzzy boundaries
17
Fuzzy boundaries (dynamic) Fuzzy boundaries (natural)
18
9
1/30/2018
Computer representations of geographic information

Regular tessellations (mosaic)
▪ Is a partition of space into mutually exclusive cells that together make
up the complete study area.
▪ In a regular tessellation, the cells are the same shape and size.
▪ The field attribute assigned to the cell is associated with the entire area
occupied by the cell.
▪ Square, regular tessellations are known under various names: raster,
grid.
▪ The size of the area that a grid cell represents is called the raster’s
resolution.
19
Examples of Regular Tessellations
20
10
1/30/2018
Regular Tessellation Cont..
Advantages:
• We know how they partition space. This leads to fast algorithms.
Disadvantages:
• They do not adapt to the spatial phenomenon we want to represent.
• Cell boundaries are both artificial and fixed: they may or may not
coincide with the boundaries of the phenomenon of interest.
21
Irregular Tessellations
Again, these are partitions of space into mutually exclusive cells, but now,
the cells vary in size and shape, allowing them to adapt to the spatial
phenomena they represent.
One example: region quadtree.
It splits up the area into four quadrants.
22
11
1/30/2018
Region Quadtree
This procedure stops when all cells in a quadrant have the same field value.
23
Vector representations
In vector representations, an attempt is made to associate
georeferences with the geographic phenomena explicitly.
Coordinate pair from some geographic space also known

as a vector
Note: rasters do not explicitly store georeferences of the phenomena they

represent.
24
12
1/30/2018
Georeferences
Instead, they provide a georeference of the lower left corner of the raster, for
instance, together with an indicator of the raster’s resolution, thereby
implicitly providing georeferences for all cells in the raster.
25
Triangulated Irregular Network (TIN)

It is built from a set of locations for which we have a measurement.
The locations can be arbitrarily scattered in space (not a nice regular grid).
Observe that in 3-D space, three
points uniquely determine a plane,
as long as they are not positioned on
the same line.
If we restrict the use of a plane to the
area between its three anchor points,
we obtain a triangular tessellation of
the complete study area. 26
13
1/30/2018
TIN
A plane fitted through the anchor points has a fixed aspect and gradient and
can be used to compute an approximation of e.g. elevation of other
locations.
27
Does TIN belong to vector?
YES, each anchor point has a stored georeference.

NO, as the chosen triangulation provides a tiling of the entire study space.
However, the cells of the tiling do not have an associated stored value as is
typical of tessellations.
28
14
1/30/2018
Point representations
Points are defined as single coordinate pairs (x,y), when we work in 2D or
coordinate triplets (x,y,z) when we work in 3D
Line representations
• Used to represent one-dimensional objects
(roads, railroads, canals, rivers…).
• Line is defined by 2 end nodes and 0-n internal nodes.

• An internal node or vertex is like a point that only serves to define the
line. 29
Line representation
A network is a collection of connected lines.
Area representations
When area objects are stored using a vector approach, the
usual technique is to apply a boundary model. This means
that each area feature is represented by some arc/node
structure that determines a polygon as the area’s
boundary.
30
15
1/30/2018
Area representation.
A simple but naive representation of area features would be to list for each
polygon simply the list of lines that describe its boundary.
Why is this not working?
-Redundancy
-If we want to find neighboring polygons, we have to do time-consuming analysis
(topology)
31
Area representation
32
16
1/30/2018
Topology and Spatial Relationships
Topology deals with spatial properties that do not change under a

transformation
E (polygon, area) in figure above is still inside D (another polygon or area)

33

The neighborhood relationships between the areas remain, and the boundary
lines have the same start and end points.
The areas are still bounded by the same boundary lines, only the shapes and
lengths of their perimetry have changed.
34
17
1/30/2018
We can define within the topological space, features that are easy to handle
and that can be used as representations of geographic objects.
These features are called simplices as they are the simplest geometric
shapes of some dimension:
▪ point (0-simplex),
▪ line segment (1-simplex),
▪ triangle (2-simplex),
▪ and tetrahedron (3-simplex).
35
Topology of 2D
We use the topological properties of interior and boundary to define
relationships between spatial features.
Region A →Boundary A and Interior A
Region B →Boundary B and Interior B
When the two regions meet this is defined as the Boundary of A intersects
the boundary of B.
The interiors do not intersect and the boundary of one does not intersect the
interior of the other.
36
18
1/30/2018
Topology of 2D
The mathematical definition of meets.
A meets B = interior (A) ∩interior (B) = Ø∧
interior (A)∩boundary (B)= Ø∧
boundary (A)∩interior (B)= Ø∧
boundary (A)∩boundary(B)≠Ø
There are 7 other spatial relationships that can be defined in a

similar way.
37
Spatial Relationships
38
19
1/30/2018
Rules of topological consistency
39
2½-D
• Rules of 2D topology apply.
• Nodes have a z-value.
• A node with an x-and y-coordinate can have only one z-value.
• Two different nodes, with identical x, y values, cannot have different z-
values.
• Consequently, true solids cannot be represented in a 2½D GIS.
40
20
1/30/2018
2½-D
41
Scale and Resolution

Map scale can be defined as the ratio between distance on a paper map and
distance of the same stretch in the terrain.
1:50,000 →1 cm on map = 50,000 cm in reality

1 cm on map = 500 meters in reality
Large-scale →much detail of a small area

Small-scale →few detail (world map)
42
21
1/30/2018
Scale and Resolution

The size of the area a pixel cell represents is called resolution.
43
Representation of geographic fields
44
22
1/30/2018
Representation of geographic objects
45
Spatial Data Modeling
46
23
1/30/2018
Raster and vector compared
47
Lecture 3
Data Processing Systems
48
24
1/30/2018
Contents
• What are data processing systems?
• Functional GIS components
– input
– output
– data storage
– spatial queries & analysis
• GIS & databases
-databases
• DBMS
• Example
- using GIS & DBMS together 49
Data - Processing - Systems?
50
25
1/30/2018
Data Processing Systems

Data processing systems are computer systems with:
1. appropriate hardware components for processing, storage and

transfer of data.
2. software components for management of the hardware and the
data.
51
Hardware and Software trends

• From minicomputers to PC’s
• Increasing performance
– Speed
– Storage
• Software boom
• Computer networks
• Mobile phones
• Open systems
52
26
1/30/2018
Computers CPU: 3 GHz

Mem: 1 Gb
Disc: 160 Gb
More than 30 years of development!
CPU: 1500 times faster

CPU: 2 mHz
Mem: 8000 times larger
Mem: 128 Kb
Disc: 16000 times larger
Disc: 10 Mb
53
Disciplines in GIS
Geography
Remote Sensing
Surveying
Cartography
Photogrammetry
Computer Science Geo-informatics
Mathematics
Archeology
Forestry
Earth Sciences
Urban Planning
Legal Sciences
54
Economy
27
1/30/2018
GIS Components
A GIS is a computer-based system that provides the following four sets of
capabilities to handle geo-referenced data :
1. data capture and preparation
2. data management
3. data manipulation and analysis
4. data presentation
55
GIS Functional Components
Data Capture
And Preparation Storage and Maintenance
Manipulation and
Data Presentation
Analysis
56
28
1/30/2018
Data Input
• Available data
– Tape, CD, Net
• Manual input
– Keyboard, digitizer
– On-screen, photogrammetry
• Semi-automatic digitizing
– Line-following
• Automatic digitizing
– Scanner
57
Data Preparation Operators

• Format transformation functions
• Geometric transformations
• Map projections
• Edge matching
• Graphic element editing
• Coordinate thinning
58
29
1/30/2018
Spatial Proximity
Objects that are near in geographic space should be near in
storage space as well.
59
Raster data storage
Values of neighboring cells are close together.

60
30
1/30/2018
Spatial analysis
GIS uses spatial analysis functions:
Operators that use spatial data to derive new geo-information.
61
Analysis of Spatial Data

• Classification
• e.g. determine land use
• Overlay functions
• e.g. spatial logic
• Neighborhood functions
• e.g. buffering
• Connectivity functions
• e.g. network analysis
62
31
1/30/2018
Data Output
• Hardcopy
– printer
– plotter
• Softcopy
– on-screen
• Data sets
– CD, DVD, Internet
63
Database Management Systems (DBMS)
64
32
1/30/2018
Reasons for using a DBMS

A DBMS:
• Supports very large datasets
• Maintains data correctness
• Supports concurrent use
• Provides a high-level, query language
• Uses a data model
• Provides backup and recovery functions
• Reduces data redundancy
65
The Relational Model

A Data Model is a language for the definition of:
• the data structures for storing the base data.
• the integrity constraints.
• the computer programs used to manipulate the data.
66
33
1/30/2018
The relational data model
67
The relational data model Foreign Keys
68
34
1/30/2018
69
70
35
1/30/2018
Other DBMS’s
Relational databases are well suited for:
• Large amount data
• Simple data types
Not setup well to deal with spatial data
• Object-oriented / object-relational models
• GIS applications will rely on DBMS add-on application packages
71
Vector Data Storage

Vector data and relationships are stored in tables.
72
36
1/30/2018
Geometric Features in GIS

The user can examine information about:
where things are and what they are like.
• From spatial data to attribute data and vice versa
Raster • geometric attributes

• non-geometric attributes
• relationships
Vector
73
Topology
Topological data structure provides:
• Automated way to handle digitizing errors
• Reduced storage space (Poly boundaries stored only once)
• No overlapping polygons
• Advanced spatial analysis (adjacency, networks)
74
37
1/30/2018
Paper maps versus GIS

Spatial information -
• Acquisition
• Storage
• Maintenance
• Analysis
• Output
75
GIS and DBMS support

GIS software provides support for
– spatial data
– thematic or attribute data
DBMSs are much better in table functionality.
GIS applications now make use of external DBMS for attribute data
support.
76
38
1/30/2018
Lecture 4
• Data quality
• Spatial referencing
77
Objectives
Data quality
• Explain the relationship between accuracy and precision.
• List main components of spatial data quality.
• Understand and apply the RMSE for measuring location accuracy (section 4.3.1).
Spatial Referencing
• Understand basic concepts of spatial referencing pertinent to published maps and spatial data.
• Know the meaning of items in a map legend (vertical and horizontal datum, ellipsoid, coordinate
types, etc).
• Distinguish between local and global reference systems.
• Understand the concepts of height measurements (in relation to the geoid or an ellipsoid).
• Comprehend datum transformations.
• Know the classification of map projections.
• Know the distortion properties of map projections.
• Describe the process of selecting a suitable map projection.
• Grasp the logic of map projection equations and the principles of transforming
78
maps from one
projection system to another.
39
1/30/2018
Components of Spatial Data Quality

• Positional accuracy
• Attribute accuracy
• Temporal accuracy
• Lineage
• Logical consistency
• Completeness
These components are considered to be an adequate description of
data quality.
79
Inaccurate data
80
40
1/30/2018
Inconsistent data (I)
The overlapping parcel boundaries is the result of digitizing from

existing parcel field maps !
Case: Ghana 81
Inconsistent data (II)
Corrupted registration system !

Case: Ghana 82
41
1/30/2018
Incomplete data
83
Lineage
Refers to the history of the data set

• Source of data.
• Methods of acquisition or derivation.
• Date and scale of source material.
• Structure of source material into map sheets or administrative
units.
• Accuracy and precision of instrumentation.
84
42
1/30/2018
Lineage
Provide information about;
• transformations,
• interpolations,
• cartographic displacements,
• exaggerations etc.
Any data set created from distinct sources should be provided
with lineage information to identify the quality of the data set
in terms of ‘fitness for use’. 85
Attribute accuracy in Remote Sensing: Example of an error matrix
The overall accuracy is (62+18+12) / 100 = 92 % 86
43
1/30/2018
How to test positional accuracy

For discrete objects, e.g. cadastral boundaries, use an independent source
of higher accuracy.
• use a larger scale map.
• use check points determined by field survey.
For fuzzy objects, e.g. vegetation and soil boundaries, use fuzzy set
techniques.
These have the ability:
• to handle logical modeling (map overlay) operations on inexact data.
• of using a variety of natural language expressions to qualify uncertainty.
87
How to estimate positional accuracy

Compute accuracy from knowledge of the errors introduced by different sources
Example
1.0 mm in source document
0.5 mm in map registration for digitizing
0.2 mm in digitizing
If sources are combined independently, an estimate for the overall accuracy, will be:
( 1.02 + 0.52 + 0.22)1/2 = 1.1 mm

88
44
1/30/2018
How to measure positional accuracy
Position accuracy is normally measured as a Root Mean

Square Error (RMSE)
Average deviation of all measured positions from their
true positions.
89
Computation of RMSE
90
45
1/30/2018
Computation of Systematic Error

The observed errors should be checked for a systematic error
component, which may indicate a, possibly repairable, lapse in the
method of measuring.
91
Accuracy and Precision
Measurement errors are generally

described in terms of accuracy. The
accuracy of a single measurement is; “the
closeness of observations, computations
or estimates to the true values”
Accuracy should not be confused with precision, which is “a

statement of the smallest unit of measurement to which data can be
92
recorded”.
46
1/30/2018
Spatial Referencing
Contents
➢ Spatial reference surfaces and datums
• The Geoid – vertical datum
• The Ellipsoid – horizontal (geodetic) datum
➢ Local and global datums
➢ From Earth to map
➢ Coordinate transformations
➢ Map projections
• Classification of map projections
• Universal Transverse Mercator (UTM)
• Map projection selection 93
The Geoid – Vertical datum for height referencing
94
47
1/30/2018
The Geoid, the height reference surface

• To describe heights, we need an imaginary surface of zero
height.
• This surface must also have a physical meaning.
• A surface where water does not flow, a level surface, is a
good candidate.
• The most obvious choice is the level surface that most
closely approximates all the Earth’s oceans.
• This surface is known as a Geoid.
• The Shape of the Geoid is influenced by Gravity.

• The Geoid represents an Imagery Sea Level.
95
• Every point on a Geoid has the same height all over the world.
Vertical datums
Starting from the mean sea level points, the heights of points on the
Earth can be measured using a technique known as geodetic
leveling.
A leveling network implements a local vertical datum:

• network of leveling lines starting from the Amsterdam tide-gauge, showing some of
the benchmarks;
• how height is determined for some point, working from the nearest benchmark.
96
48
1/30/2018
The Ellipsoid – horizontal datum for horizontal referencing
97
The ellipsoid and the horizontal datum

We also need a reference surface for the description of the
horizontal coordinates of points of interest.
Since the geoid is defined physically and not mathematically, it
cannot be used as a reference surface for horizontal coordinates.
The mathematical surface that is simple enough and most closely
approximates the local mean sea level is the surface of an oblate
ellipsoid.
98
49
1/30/2018
Local and global ellipsoids
The ellipsoid is chosen in such way that it best fits the

surface of the area of interest (the country)
99
The Ellipsoid Reference surface for location

measurements
An ellipsoid with specific dimensions ‘a’ and ‘b’ as half the
length of the major and minor axis respectively is chosen
which best fits the local mean sea level.
Then the ellipsoid is positioned and oriented with respect
to the local mean sea level by adopting a latitude (Φ) and
longitude (λ) and height (h) of a so-called fundamental
point and an azimuth to an additional point.
We say that a local horizontal datum is defined by

(a) dimensions (a, b) of the ellipsoid,
(b) the adopted geographic coordinates Φ, λ and h of the
fundamental point, and
(c) azimuth from this point to another.
The local horizontal datum is implemented through a so-called triangulation network.

100
50
1/30/2018
Commonly used ellipsoids and datums
101
Commonly used ellipsoids and datums
102
51
1/30/2018
Global and Local Datums

Global ellipsoids and datum’s to approximate
the earth-as-a-whole - with the aid of
satellites- are becoming more in use (e.g.
WGS84, ITRS, ETRS89).
Changing or re-adjustment of the local
ellipsoids and datums is taking place in many
countries. 103
Datum change (example)
Mapping organizations do not only coach the user community about the
implications of the geocentric datum. They also develop tools to enable
users to transform coordinates of spatial objects from the new datum to the
old one. This process is known as datum transformation.
The tools are called datum transformation parameters.
104
52
1/30/2018
Datum re-adjustment (example)
Why do the users need these transformation parameters?

• Because, they are typically collecting spatial data in the field using satellite
navigation technology.
• They also typically need to represent this data on a published map based
on a local horizontal datum.
The good news is that a transformation from datum A to datum B is a
mathematically straightforward process.
The bad news is that the estimated parameters may be inaccurate if the 105coordinates
of the common points are wrong.
Datum Transformations
This is often the case when we transform coordinates

from a local horizontal datum to a geocentric datum.
The coordinates in the local horizontal datum may be

distorted by several tens of metres because of the
inherent inaccuracies of the measurements used in the
triangulation network.
106
53
1/30/2018
From Earth to Map
107
Overview
108
54
1/30/2018
109
Coordinate Transformations
110
55
1/30/2018
Projection change using projection equations
111
Change a map projection

Including a datum transformation
112
56
1/30/2018
Datum transformation Molodensky 2D-transformation
113
Datum Transformations
114
57
1/30/2018
Datum transformation from ITRF to Potsdam
115
Overview of Coordinate Transformations
116
58
1/30/2018
2D (direct) Cartesian transformations are used for the

rectification of images using 2D Ground Control points.
117
Map Projections
118
59
1/30/2018
Map projection principle
119
Map projection equations
A map projection is a mathematical function by which 2D

Geographic coordinates are transformed into 2D
Cartesian map coordinates.
Forward equation
Inverse equation
120
60
1/30/2018
Map projection equations (example)
The map projection equations for the Mercator

projection (spherical assumption)
Forward equations
Inverse equations
121
Map projection class
122
61
1/30/2018
Map projection property

Conformal
• Shapes/angles are correctly represented (locally).
Equivalent ( or equal-area )
• Areas are correctly represented.
Equidistant
• Distances from 1 or 2 points or along certain lines are
correctly represented. 123
Conformal map projection

Cylindrical
124
62
1/30/2018
Equivalent map projection

Cylindrical
125
Equidistant map projection

Cylindrical
126
* also called Plate Carrée projection
63
1/30/2018
Robinson projection (compromise)

Pseudo-Cylindrical
127
Classification of map projections
128
64
1/30/2018
Universal Transverse Mercator
129
Transverse Mercator projection
130
65
1/30/2018
UTM-projection
131
UTM-Zones
132
66
1/30/2018
UTM Zone numbering system
133
A standard map projection system

The use of a standard map projection system for all countries
is becoming more in use ( U.T.M. )
Re-adjustment of the local map projection systems is taking
place in many countries.
134
67
1/30/2018
Map projection selection
135
Criteria for the selection of a Map projection

Shape of the area (determines the projection class).
Purpose of the map (determines the property of the
projection).
Position of the area (determines the aspect of the
projection).
136
68
1/30/2018
Shape of the area
137
Purpose of the map

Conformal
• maps which require measuring angles (aeronautical charts,
topographic maps).
Equivalent
• maps which require measuring areas (distribution maps).
Equidistance
• maps which require reasonable area and angle distortions
(several thematic maps).
138
69
1/30/2018
Position of the area
139
Choosing a Map Projection

The ideal map projection for any country would be either
azimuthal, cylindrical, or conical projection, depending on the
country’s shape, with a secant projection plane located along
the country’s main axis.
The property of the map projection depends on the map
purpose.
140
70
1/30/2018
CHAPTER 5
Data entry and Data preparation
141
From Alpha to Omega

• Spatial data input
– Direct spatial acquisition
– Digitizing paper maps
– Obtaining spatial data elsewhere
• Data preparation
• Point data transformation
• Advanced operations on rasters !!!!
142
71
1/30/2018
Direct Spatial Data Acquisition
143
Digitizing Paper Maps

A cost-effective method of data capture is the DIGITIZING of
existing maps.
This requires the conversion of an analogue map into a
digital map.
A number of digitizing techniques exist.
144
72
1/30/2018
Digitizing Process
145
On-tablet Digitizing (manual)
146
73
1/30/2018
On-screen Digitizing
147
On-screen vs. Manual Digitizing

• It is more comfortable for the operator.
• It is more accurate.
– zooming facilities
• It is faster.
– semi-automatic, digitizing and editing at the same time
• There are updating procedures.
– geometrically corrected satellite imagery and scanned aerial photo’s
can be overlaid with the old vector data.
148
74
1/30/2018
Map Registration
149
Map Registration
150
75
1/30/2018
What to Digitize ?
Point, Line, Polygon, or...?
151
Digitizing Errors
152
76
1/30/2018
Point & Stream Mode Digitizing
153
The Scanning Process
154
77
1/30/2018
Working Principle
A digital scanner illuminates the document and measures
with a sensor the intensity of the reflected or transmitted light.
155
156
78
1/30/2018
Scanning Resolution
Scanning Resolution can be expressed in:
• in millimeters
• in microns
• in dots per inch
Example:
pixel size = 0.05 mm = 50 μm = 508 dpi
157
Scanner Resolution (pixel size)

Minimum required resolution depends on the details in the map and
the digitizing technique:
• Manual on-screen map digitizing: around 200-300 dpi

• (Semi-) automatic digitizing: thickness >1/3 of the thinnest line
– lines should be at least 3 pixels wide
– a resolution of 300-600 dpi is mostly sufficient
• Hardcopy output: 2x the resolution of the printer:
– nowadays 1200 dpi
• Photogrammetric applications: 800-2400 dpi 158
79
1/30/2018
Scanning Modes
Line Art, 1 bit
159
Grey Scale, 8 bit
160
80
1/30/2018
Index Color, 8 bit
161
Full Color, 24 bit
162
81
1/30/2018
Scanner Output
The scanner output is only a digital copy of the source
document in raster cell values.
• Data are NOT structured into classified and coded objects

• To obtain this, the data have to be vectorized and further
structured
163
The Vectorization Process

The conversion from raster to vector
Step 1. Skeletonizing
Step 2. Feature Forming
164
82
1/30/2018
165
Original Line Work
166
83
1/30/2018
Scanned Line Work
167
Skeletonized Line Work and Line Forming
168
84
1/30/2018
Original and Vectorized Line Work
169
Vector Editing
170
85
1/30/2018
Vectorized Lines (after vector editing)
171
Feature Forming
• Splitting lines to form line segments and nodes.
• Joining line segments to form polygons and features.
• Feature coding
172
86
1/30/2018
Vectorized Lines
(after vector editing and data structuring)
173
Vector / Raster
174
87
1/30/2018
Selecting a Digitizing Technique
175
Spatial Data elsewhere
176
88
1/30/2018
Spatial Data Clearinghouses
177
Spatial Data elsewhere
Means:
Data in different formats and standards, and with different
quality !
178
89
1/30/2018
Data Preparation
179
Spatial Data Preparation
Spatial data preparation consists of editing data that is

to be entered into the GIS database.
180
90
1/30/2018
Data Preparation cont…

Images may need enhancements and (re)classification
Vector data may require editing, such as the removal of errors
and generating polygons.
Data may need to be vectorized or rasterized to match
existing data sets.
181
Data Preparation cont…
Additionally, processing includes associating attribute data

with objects, either through manual input or reading digital
attribute files into the GIS.
182
91
1/30/2018
Data Checking and Clean-up
Data sets must be checked for consistency and

completeness.
This requirement applies to the geometric, topological and
semantic quality of the data.
183
Data Quality
• Geometric
Are the coordinates correct?
• Topological
Are the relationships correct?
• Semantic ≈ "meaning"
Does the data make sense?
184
92
1/30/2018
Clean-up operations for vector data
185
Cleaning & structuring of vector data
186
93
1/30/2018
Topological and Semantic Problems
187
Associating Attributes
188
94
1/30/2018
Data Structure Conversion

Rasterization: assigning point, line and polygon attribute
values to raster cells.
Vectorization: producing vector data from a scanned
image; used to identify features in satellite images or aerial
photographs .
189
Topology Generation
• Topology deals with spatial relationships between features.
• Topological relations are used in a GIS to perform spatial
operations like:
• overlaying
• buffering
• network tracing
• shortest-path routing
190
95
1/30/2018
Topological Relationships
“Green area is … covered by … blue area” (by 2D geographic extent)
191
Combining Multiple Data Sets

Three fundamental cases:
Data sets of:
• same area, different accuracy
• same area, different representation
• adjacent areas, which must be merged into a single data
set
192
96
1/30/2018
Differences in Accuracy
Combining two data sets derived from different maps may
lead to sliver polygons.
193
Differences in Representation
194
97
1/30/2018
Merging Data in a Database
195
Merging data in a database

Some GIS systems have merge or functions to solve
problems arising from merging adjacent data sets.
196
98
1/30/2018
Point Data Transformation
197
Interpolation
Very often, we only have a limited number of
observation points, but we're interested in every
location in the area.
How do we find the 'missing' points?
Answer: using interpolation techniques.
198
99
1/30/2018
Two Fundamental Cases
Discrete Field
• Qualitative
• What is it?
Continuous Field
• Quantitative
• How much is it?
199
Field Representation from Point Measurements
Qualitative point Quantitative point

measurements measurements
(e.g. geological, soil units) (e.g. elevation, temperature)
200
100
1/30/2018
Discrete & Continuous Field
Land use Elevation
Qualitative Quantitative
P = Forest P = 112 meter
Q = Grassland Q = 168 meter
201
Discrete Field from Point Data
Thiessen polygons
Any location is assigned the

value of the closest measured
point.
202
101
1/30/2018
Continuous Field from Point Data
Interpolation techniques:
• Trend surface fitting
• Moving window averaging
• Triangulation
Many other techniques exist!!
203
Trend Surface Fitting
The key in trend surface fitting is to find a

mathematical surface f(x, y).
Uses regression techniques to best fit the original
measurements.
204
102
1/30/2018
Example, 1-D
205
Examples of trend surfaces obtained from regression

techniques, 2-D
206
103
1/30/2018
Trend Surface Fitting
Simple tilted surface:

• z = f(x,y) = c1.x + c2.y + c3
• f(x,y) = -1.83934.x + 1.61645.y + 708782
This formula can be used to compute a value for any

location (x, y).
207
Global and Local Trends
Global trend surface fitting assumes that the entire

study area can represented by the same
mathematical surface.
In many cases, it is more realistic to partition the
study area (e.g. at mountain ridges for elevation
data) , and apply trend surface fitting techniques
for each part.
208
104
1/30/2018
Moving Window Averaging
Computes new raster cell values from a

number of quantitative point measurements
within a square window.
209
Moving Window Averaging

Possible parameters settings:
• Resolution of the raster cells
• Shape/size of the moving window
• Selection criteria
which measurements do participate
• Averaging function
equal weight or weight factor
210
105
1/30/2018
Inverse Distance Weighting
Squared
Computed field value = ?
211
Inverse Distance Weighting
10.66
Squared
Computed field value = 10.66
212
106
1/30/2018
Triangulation
Another way of interpolating quantitative point

measurements is by triangulation.
Delaunay
213
TIN construction
A TIN is typically based on a Delaunay triangulation

of a set of input data points.
214
Bad triangulation Good triangulation
107
1/30/2018
Delaunay & Thiessen
215
Advanced Operations on Continuous Fields
• Slope angle calculation

• Slope aspect calculation
• Slope convexity / concavity calculation
• Slope length calculation
• Hill shading
216
108
1/30/2018
Advanced ops., cont….

• Three-dimensional map display
• Change in elevation through time
• Catchment delineation
• Dynamic modeling
– (e.g. surface run-off, erosion)
• Visibility analysis
All these operations apply a technique known as Spatial Filtering
217
Spatial Filtering
Linear Filtering or Convolution
The difference with moving window averaging is that the moving window in
filtering is itself a little raster, which contains cell values that are used in the
computation for the output cell value. This little raster is known as the filter
218
109
1/30/2018
Computation of Slope Angle
gradient
219
Slope & Aspect
220
110
1/30/2018
Spatial Filtering
Figure (b) and (c) illustrate an x-gradient filter, and y-gradient

filter, respectively.
221
The x-gradient filter determines the slope increase ratio

from west to east.
If the elevation to the west of the centre cell is for
example 1540m and that to the east of the centre cell is
1552m, then apparently along this transect the elevation
increases 12m per two cell widths, i.e. the x-gradient is
6m per cell width.
The y-gradient filter operates entirely analogously,
though in south-north direction.
222
111
1/30/2018
X- and Y-gradient Calculation
223
Slope Angle & Aspect Angle Ψ
Observe that both filters express elevation gain

per cell width. 224
112
1/30/2018
Here, f stands for the elevation field as a

function of x and y, and δf/δx, for instance, is
the elevation gain per unit of length in the x-
direction.
225
To obtain the real slope angle α along path p,

observe that both the x- and y-gradient contribute
to it.
This is illustrated in Figure above.
A, not-so-simple, geometric derivation can show
that always
226
113
1/30/2018
Slope Angle Calculation
227
Slope Angles (example)
228
114
1/30/2018
Slope Aspect Calculation
229
Summary
From Points to Fields
1. Discrete Fields (Vector!)
2. Continuous Fields (Raster!)
Interpolation
1. Thiessen Polygons
2. Surface fitting, MWA, IDW, Delaunay
Operations on Rasters (compulsory!!)

• Filtering
• Slope & Aspect Computation
230
115
1/30/2018
CHAPTER 6
Spatial data analysis
231
Spatial Data Analysis Part I
The solution of a spatial problem

depends on a large number of
parameters.
Parameters are often interrelated.
Interaction of parameters is more
precisely described in an
application model.
232
116
1/30/2018
Introduction
Application model: description of the behavior of relevant

geographic phenomena, it tries to simulate the geographic
phenomena that are present.
Models for planning and site selection are prescriptive
They quantify environmental, economic and social factors
233
Introduction
In predictive models, a forecast is

made of the likelihood of future
events, for example, of a future
landslide.
This lesson however does not
discuss analytical modeling, it
discusses analytical functions that
form the building blocks for analytical
models. 234
117
1/30/2018
Introduction
• There are many ways to classify analytical functions.

• The one suggested by Aronoff, (1989) is used here:
• Measurements, retrieval and classification
• Overlay functions
• Neighborhood functions
• Connectivity functions
• These functions are split over 2 lectures.
• Every function is discussed for vector and for raster data.
235
Overview of this lesson:

1. Measurement operations
1.1. Vector measurements

1.2. Raster measurements
2. Spatial selection queries
2.1 Spatial Selection by Attribute conditions

2.2 Spatial Selection using topological relationships
3. Reclassification
3.1 User controlled classification

3.2 Automatic classification
4. Overlay techniques
236
118
1/30/2018
Measurements
Only Geometric Measurements are discussed, no

measurements on attribute values.
All measurement functions are discussed both in
raster and vector (on points, lines and polygons).
Vector measurements include: location, length,
distance and area size.
237
Measurements -Vector
Vector measurements include: location,

length, distance and area size
Location → always stored by GIS

• One coordinate pair for points
• List of pairs for lines and polygons
238
119
1/30/2018
Measurements -Vector
• Polygons have a centroid.

• The centroid is the center of mass of the
polygon, the point on which it would balance
under a homogeneously applied force like
gravity.
• Note that the centroid can be outside the
polygon
• The location of the centroid of a polygon can
be either stored or computed on the fly
239
Measurements -vector
Length is associated with

polylines, and with polygon
boundaries.
It can be stored by the GIS or
computed on the fly.
240
120
1/30/2018
Measurement -Vector
241
Measurement -Vector
If one or both features are not

a point, we will measure the
minimal distance between the
two features.
242
121
1/30/2018
Measurement -Vector
A bounding box is the minimal

rectangle that covers the feature.
Bounding box computation is a
Support function.
Example: when the bounding boxes do

not intersect, the features do not
intersect.
243
Measurement -Raster
Raster measurements include: location,

distance and area size
Location of an individual cell →derived
from anchor point and resolution
The cell’s location can be its lower left
corner or midpoint
244
122
1/30/2018
Measurement -Raster
Area size →number of cells * cell size

When you know the resolution, you can calculate
the area of a single cell. In this example 30 x 30
meters = 900 m2.
The number of cells is also called the frequency
or count.
245
Measurement -Raster
Distance →standard distance function

applied to the locations of their mid-
points.
When a raster is used to represent line
features as strings of cells, the length
of a line is computed as the sum of the
distances between the cells.
246
123
1/30/2018
Measurement -Summary
Vector:
• Vector measurements include: location (coordinates and
centroids), length, distance and area size.
• Another geometric measurement is the minimal bounding box
of lines and polygons.
• Pythagorean distance function is used.
Raster:
• Raster measurements include: location, distance and area
size.
• Standard Pythagorean distance function applied to the
locations of their mid-points.
247
Spatial selection queries
Interactive
Spatial Selection by Attribute conditions
• Relational operators
• Logical operators
• Combining attribute conditions
Spatial selection using topological relationships

• Selecting features that are inside selection objects
• Selecting features that intersect
• Selecting features adjacent to selection objects
• Selecting features based on their distance 248
124
1/30/2018
Spatial selection queries
Interactive spatial selection is a

selection in which you select
features by clicking on the screen
(on the feature to select) or
drawing a graphic, to select all
objects within this graphic.
249
Attribute queries are selections in which

we use a selection condition on the
features attributes.
The condition is specified in a query
language.
This query language can be SQL when the
data is stored in a relational database or in
a software specific language.
250
125
1/30/2018
A condition that tests a single criterion is

called an atomic condition.
Atomic conditions use a predictive symbol
such as < (less than).
Any of these symbols is combined with an
expression on the left and on the right.
251
The expression can be an attribute

name (Area), a constant (400000), or
an arithmetic expression:
Ranking > MinRanking + 5
Area / Length < 1
252
126
1/30/2018
Logical connectives are used to

combine two atomic conditions to one
composite condition.
Examples of logical connectives are:
AND, OR and NOT.
253
OR returns true if one or both of

the expressions a and b is true.
NOT returns true if the expression
is false.
254
127
1/30/2018
Spatial Selection using topological relationships
Spatial selections use topological

relationships
The steps carried out are:

1. Select one or more selection objects.
2. Apply a chosen spatial relationship to
determine the features that have that
relationship with the selection objects.
255
Selecting features that are inside

selection objects.
Polygons can contain polygons, lines or
points, and lines can contain lines or
points.
256
128
1/30/2018
An example of a selection using the inside

relationship is shown on the left.
In picture 1 the state of Georgia is
selected, this is the selection object.
In picture 2 all the cities that are inside this
state are selected.
257
Selecting features that intersect.

In picture 1, the state of Georgia is
selected.
In picture 2, the interstates that run
through Georgia (intersect).
258
129
1/30/2018
Selecting features adjacent to selection

objects.
Adjacency is the same as the meets
relationship.
259
Selecting features based on their distance.

Will search within a given distance from
the selection object, at a given distance, or
beyond a given distance.
Example (left) select all the cities within
100 kilometers from Atlanta.
260
130
1/30/2018
Spatial Selection -Summary
Two types of selections, attribute selections and

selections using topological (spatial) relationships
In attribute queries, predictive symbols and Logical
connectives are used.
We distinguish atomic conditions and composite
condition.
261
(Re) Classification
Remove detail from an input dataset to reveal important spatial

patterns.
Reduce the number of classes and eliminate details.
If the input dataset itself is the result of a classification we call it a
reclassification.
Spatial selection is a two step process:
1. Select one or more selection objects.
2. Apply a chosen spatial relationship to determine the features that
have that relationship with the selection objects.
262
131
1/30/2018
(Re) Classification –post processing
Vector reclassification can be performed with (or without) post-

processing.
With post-processing we mean that adjacent features that after

classification have the same class are merged together.
This is called spatial merging, aggregation or dissolving.
263
(Re) Classification –post processing
For raster datasets, post-processing is not

possible, only the value of the cell will
change.
Post-processing of vector data is only
sensible for lines and polygons.
264
132
1/30/2018
(Re) Classification
3.1 User controlled classification

• Classification table
3.2 Automatic classification
• Equal interval technique
• Equal frequency technique
265
User controlled classification
In user-controlled classification we
indicate the classification attribute and
the classification method.
This is normally done via a
classification table.
266
133
1/30/2018
Automatic classification
• User specifies the no. of output

classes.
• Computer decides the class break
points.
• Two techniques of determining the
class breaks are discussed:
• Equal interval
• Equal frequency
267
• Equal interval is calculated as (Vmax–

Vmin) /n
• Vmax is the maximum attribute value, Vmin
is the minimum attribute value and n is
the number of classes.
• In our example: (10-1) / 5 ≈2
• Each class will have two values.
268
134
1/30/2018
Equal frequency, is also called

quantile.
Total number of features / number of
classes (n)
The objectives is to create
categories with roughly equal
number of features (or cells).
269
The two techniques are illustrated here
again. On the horizontal axis, the original
values.
The classes are the blue vertical lines. In the
equal interval example, the blue lines are
equally spaced. On the vertical axis, the
frequency.
In the equal frequency technique, when the
frequency goes up, the classes become
smaller.
270
135
1/30/2018
(Re) Classification -Summary
• Reclassification is used to reduce the complexity of a layer in

order to show patterns
• Two types of classifications, user controlled classification and
automatic classification
• In user controlled classification the user will create a
classification table.
• In automatic classification the user will determine the number
of classes and the classification technique.
• Two classification techniques were discussed:
• Equal interval
• Equal frequency
271
Overlay
Standard overlay operators take two input
data layers and assume they are geo-
referenced in the same system and
overlap in study area. If either condition is
not met, the use of an overlay operator is
senseless.
The principle of spatial overlay is to
compare the characteristics of the same
location in both data layers, and to
produce a new output value for each
location.
272
136
1/30/2018
Overlay
Overview of the topics discussed:

4.1 Vector overlay techniques
• Intersection
• Clip by
• Overwrite by
4.2 Raster overlay techniques
• Arithmetic operators
• Comparison and logical operators
• Conditional expressions
• Decision table
273
Vector overlay -intersection
The standard operator for two layers of

polygons is the polygon intersection
operator.
The result of this operator is the collection of
all possible polygon intersections.
The attribute table combines the information
of the two input tables ( spatial join).
274
137
1/30/2018
Vector Overlay Clip
Clip takes a polygon data layer and restricts its

spatial extent (the area that it covers) to the
outer boundary of a second input layer (clip
layer).
No other polygons from the clip layer play a role
in the result.
This technique can be used to reduce the area of
a thematic layer to that of the study area.
275
Vector Overlay Overwrite
The polygon overwrite creates a layer with the

polygons of the first layer except where polygons
exist in the second layer (as they take priority).
This operator can be used to overwrite a layer
with “updates“ stored in a second layer.
276
138
1/30/2018
Vector Overlay
Vector overlays are usually also defined

for point and line data layers.
When a polygon layer is intersected by a
line layer the result will be a line layer (the
layer of the lowest order).
277
Raster overlay operators
Vector overlay operators are geometrically

complicated, raster overlays are mostly cell by
cell computations.
GISs that support raster processing have a full
language to express operations. This is called a
raster calculus.
The syntax used here is based on the ILWIS
raster calculus.
278
139
1/30/2018

New cell values are calculated using calculus -map algebra.
• Performed on cell-by-cell basis.
• No geometric calculation
Output_raster_name : = Raster_calculus_expression
279
Overview:
• Arithmetic overlay operators
• Comparison and logical operators
• Conditional expressions
• Decision table
280
140
1/30/2018
Raster overlay -arithmetic
Arithmetic operators
• +, -, *, /
• MOD (modulo division)
• DIV (integer division)
• Goniometric operaters: sin, cos, tan, asin, acos, atan.
• For example:
Raster2 : = Raster1 * 5
281
Raster overlay –comparison/logical

Comparison operators
• <, <=, =, >=, >, <>
C:=A<>B is true when the cell’s value in A
differs from the cell’s value in B. It is false
if they are the same.
Logical operators
• AND,OR,NOT, XOR (exclusive)
a XOR b is true if either a or b is true, but not
both.
282
141
1/30/2018
Raster overlay – comparison/logical
The cell values of the output raster is

either true or false.
Other values are not possible.
283
Raster overlay –conditional expr.
Conditional expressions test a condition, if the

expression is true, the then_expression will be
evaluated, if it is false the else_expression.
A conditional expression can lead to many
different types of outputs, numeric value, but also
true/false.
Format: output_raster: = IFF (condition,
then_expression, else_expression).
284
142
1/30/2018
Raster overlay – decision table
Decision tables are the same statement

as in a Conditional statement, but
presented in a different way.
• The decision table will guide the
overlay process.
• It lists all possible combinations of input
values, and the output values.
285
Overlay operations -RASTER
286
143
1/30/2018
Overlay - Summary
Vector overlay techniques, intersection, clip by and overwrite by.

• Intersection is the fundamental operator, the attribute table is a
spatial join (fields from both input tables.
• Clip by is like a cookie cutter, cutting out the map extend of the
second layer.
• For types of raster overlay techniques, using arithmetic operators,
comparison and logical operators, conditional expressions and
decision table.
• Comparison and logical operators only evaluate to true and false.
• Conditional expressions and decision table lead to the same
result
287
Spatial Data Analysis Part II
Three topics will be discussed:

1. Neighborhood functions
2. Network analysis
3. Error propagation
288
144
1/30/2018
Neighborhood functions
Will find out characteristics of the

vicinity (neighborhood) of a location.
289
To perform neighborhood analysis, we

must state which target locations are of
interest to us, and what is their spatial
extent.
Define how to determine the neighborhood
for each target.
Define which characteristic(s) must be
computed for each neighborhood.
290
145
1/30/2018
Proximity computation makes use of the geometric

distance function.
Spread computation assumes that the phenomenon

spreads in all directions, but not necessarily equally
easily in all direction.
In seek computation the phenomenon will choose a
least-resistance path. 291
➢ Buffer generation (Vector and Raster)

➢ Thiessen polygons (Vector)
➢ Spread computation (Raster)
➢ Seek computation (Raster)
292
146
1/30/2018
Buffer zone generation
Principle is simple, we select one or more target

locations and determine the area around them.
• Buffer generation can be performed on vector
as well as raster data.
• Target locations can be point, lines or

polygons in a vector environment.
293
• Buffers can be simple, or zonated.

• With a zonated buffer, the buffer
consists of multiple rings each
representing a different distance.
• In vector buffer generation, the buffer
will become a new polygon in the
output layer.
294
147
1/30/2018
Raster layers:
• Need target cell(s).
• The distance function applies the Pythagorean
distance between the cell centers.
• Using cell resolution as the unit.
• The distance from a non-target cell to the
target is the minimal distance one can find
between that non-target cell and any target
cell.
295
Thiessen polygons
Divide an area into polygons, so that each

polygon contains locations that are closer to
the midpoint than to any other midpoint.
It will generate a polygon around each target
location that identifies all those locations that
‘belong to’ that target.
296
148
1/30/2018
Thiessen polygons
The easiest way to construct Thiessen

polygons is via a Delaunay triangulation of
the same input points.
The construction first creates the
perpendiculars of all the triangle sides.
297
Spread Computation
In spread computation the neighborhood of

a target location not only depends on
distance, but also on direction and
differences in the terrain.
Examples are pollutions and radio waves.
298
149
1/30/2018
Spread Computation
Spread computation involves one or more target locations, which

are better called source location in this context, as they may be for
example the pollution source.
Spread computation also involves a local resistance raster, which

for each cell provides a value that indicates how difficult it is to
pass by that cell.
299
Spread Computation
While computing total resistance, the GIS

takes proper care of correct spread path
lengths.
The spread from a cell to its neighbor cell to
the east is shorter than to its northeast
neighbor
300
150
1/30/2018
Spread Computation
The GIS computes the total resistance for

spreading from csrc to cn as ½(val (csrc) +
val(cn))
This is half of the resistance value of the
source cell (csrc), because this cell is only
traveled half, plus half of the resistance
value of the cell north of it
The value of the source cell of course is 0.
301
Spread Computation
The GIS computes the total minimal

resistance raster for a diagonal neighbor
as ½(val(csrc) + val(Cne)) * √2.
This is half of the resistance value of the
source cell plus half of the resistance
value of the cell to the north-east
multiplied by the square root of 2.
302
151
1/30/2018
Spread Computation
Since the source material has the habit of

taking the easiest route to spread, we must
determine the minimal cost.
We must consider all possible paths to
reach the cell and assign the minimal
value.
Lowest Value
303
Spread Computation
Note that the accumulated resistance along

a path of cells is simply the sum of these
incurred resistances.
We can use the value 4.00 and add the
resistance for moving from this cell to its
north east neighbor cell.
304
152
1/30/2018
Seek Computation
Seek computation applies when a

phenomenon does not spread in all
directions, but chooses a least-cost
path.
A typical example is a drainage
pattern in a catchment.
305
Seek Computation
Input for a seek computation is an elevation

raster.
For each cell, the steepest downward slope
to a neighbor cell is determined.
The direction of this downward slope is
stored in the flow direction raster.
306
153
1/30/2018
Seek Computation
1. For each cell, first eliminate all cells

that are not downhill (have a higher
elevation value)
2. You have two types of neighbors: direct
neighbors and diagonal ones. For each
type, pick the steepest.
3. Compensate for the difference in path
length.
307
Seek Computation
From the flow direction raster, the GIS will

calculate the accumulated flow count
raster.
Cells with a high accumulated flow count
represent streams.
308
154
1/30/2018
Seek Computation
The value of the accumulated flow for

each cell is the number of cells that
flow into this particular cell.
Cells with a value 0 have no other cells
flowing into them, and represent higher
areas.
309
Summary –proximity Computation
310
155
1/30/2018
Network analysis
2.1 Types of networks

• Directed network and undirected network
• Planar network and non-planar network
• Two types of analysis
• 2.2 Optimal path finding
• Ordered
• Unordered
• 2.3 Network partitioning
• Network allocation
• Trace analysis
• 2.4 Turntables
311
Types of Networks
Network is a set of connecting lines

Network can represent rivers, roads,
pipelines, telecommunication lines etc.
Network analysis analyze the way
‘goods’ can be transported along these
lines.
Network analysis can be done in raster
or in vector.
312
156
1/30/2018
Types of Networks
Networks can be directed, where

transportation is only in one
direction, for example rivers, or it can
be undirected, the goods can be
transported in both directions
(roads).
313
Types of Networks
Networks can be planar, this means they

are 2-dimensional.
Planar networks do not have overpasses
or underpasses.
Example of a planar networks are rivers.
314
157
1/30/2018
Types of Networks
Non-planar networks have multi-level

crossings, underpasses and overpasses.
When they are modeled in 2D these
overpasses and underpasses should be
modeled in a special way.
Example of non-planar networks are roads.
315
Analysis on networks
Optimal path finding
Which generates a least cost-path on a network between a pair of

predefined locations using both geometric and attribute data.
Network partitioning
Which assigns network elements (nodes or line segments) to

different location using predefined criteria.
• Network allocation
• Trace
316
158
1/30/2018
Optimal path finding is used when a least cost

path between two nodes in a network must be
found.
You need a cost function!
• Also called Impedance.
• One of the attributes in the feature
attribute table.
• Length, travel time, etc.
• The least-cost path is the one that has
the min. value of the total cost between
two nodes
317
Cost factors
• The cost can be defined on both lines
and nodes.
• For lines, the cost can be the same or
different along and against the line
direction.
• The cost on nodes is used to define the
turns.
318
159
1/30/2018
Two costs, one for each driving

direction can be applied in rush
hour.
In the morning it takes much longer
to go into the city than it takes to
drive in the opposite direction.
319
Cost can be associated with line

segments, but also with nodes.
Passing a traffic light for example can
take a considerable amount of time.
320
160
1/30/2018
Ordered optimal path finding: the

sequence in which the places have to
be visited matters.
Unordered optimal path finding: the
sequence does not matter.
321
Network partitioning
In network partitioning, the purpose is to assign lines and

nodes (parts of the network) to a number of target
locations (for example which part of the network belongs
to a hospital, school or fire station).
There are two types of network partitioning problems:

• 2.3.1 network allocation
• 2.3.2 trace analysis
322
161
1/30/2018
Network Allocation
We have a number of resource

centers, and the problem is which part
of the network can be assigned to
which service center.
In principle this is a simple problem,
each line segment is assigned to the
service center that is the nearest.
323
Network allocation
Problems associated with network

allocation are:
• The capacity with which a centre
can produce the resources
• The consumption of the resources
which may vary amongst lines or
line segments.
324
162
1/30/2018
Trace Analysis
Trace analysis is performed when

we want to understand which part of
a network is connected to the trace
origin.
A condition can be applied for
example, trace only in the direction
of the origin (upstream).
325
Summary –Network Analysis
326
163
1/30/2018
Error propagation
How errors propagate:

• Errors already present in the input data will
propagate through the manipulations.
• New errors arise from the computer processing
(analytical operations performed).
327
Error propagation
Error propagation analysis:

• Testing the accuracy of each state by measurement
against the real world.
• Modeling error propagation, either analytically or by
means of simulation techniques.
328
164
1/30/2018
Error propagation
Initially the complexity of spatial data led to the

development of mathematical models describing only
the propagation of attribute errors.
Modern models incorporate both spatial and attribute
errors.
329
CHAPTER 7
Data Visualization
330
165
1/30/2018
Contents
GIS and maps
The visualization process
Visualization and strategies
The cartographic ‘toolbox’
• data characteristics,
• representation of these characteristics
Examples, how to map:
•qualitative data,
•quantitative data, (terrain or statistical) elevation, time
Map cosmetics: the finishing touch

Map dissemination 331
GIS and Maps
In a GIS environment, maps can be used

as:
•Input for GIS
•Communicate GIS results
•Support spatial analysis
Maps are not only final products (output)!
332
166
1/30/2018
Map Characteristics
Main characteristics of maps

Maps provide answers (in graphical
form) to questions related to the 3
basic components of geographic
data
• geographic location (Where?)
Where did the cartography
students come from?’ • thematic attributes (What?)
• time (When?)
333
Map Characteristics
reality more abstract

Main characteristics of maps!!!!!
Maps offer abstract representations (models) of reality,
that are:
• simplified
map. • classified
• symbolized
334
167
1/30/2018
Scale
Main characteristics of maps

Maps are representations at
scale
Scale: ratio between distance on
the map and corresponding
distance in reality.
335
Scale
Scale indications
• verbal e.g. one-inch-to-the mile
• representative fraction e.g. 1 : 100 000
• graphical (scale bar) suitable in digital
environments!
Digital Graphical Scale
Advantage of scale bar in digital environment is that

its length changes when zooming in or out.
336
168
1/30/2018
Definition of a Map
A map is:
• a representation or abstraction of
geographic reality; a tool for
representing geographic information in a
way that is visual, digital.
• a reduced and simplified representation
of (parts of) the Earth’s surface on a
plane. 337
Types of Maps
Traditional distinction in maps

• topographic maps accurate representation of
the Earth’s topography.
• thematic maps one or more particular themes
are emphasized.
Less relevant distinction in a digital environment
338
169
1/30/2018
Map Dimensions
You can distinguish types of maps based

on the number of dimensions used for the
representation.
• Flat
• Flat + Height
• Flat + Height + Time
339
The visualization process
Maps are the result of a visualization process.
340
170
1/30/2018
Cartographic Tools
Visualization methods and techniques are

applied using cartographic ‘tools’:
• Functions (e.g. algorithms)

• Rules (e.g. generalization, cartographic
grammar)
• Habits or conventions (e.g. water is
represented in blue).
An algorithm can be used to smooth lines and improve the

appearance of features. 341
Visualization and strategies
‘Visualization’ has several meanings:

• generic: to make info visible (presentation in graphical form)
• more specific: to use sophisticated computer technology and
‘toolboxes’ to make data/ info visible for specific use:
visual exploration
this process is often called:

scientific visualization: meant to stimulate thinking
keywords: interaction, dynamics
342
171
1/30/2018
Geo-visualization
Two main strategies of visualization:

exploration and presentation.
• private visual thinking: involves an

individual playing with the spatial data to
determine its significance.
• public visual communication: concerns
maps aimed at a wide audience.
If maps are visually explored, we also

talk about geo-visualization.
343
Geo-visualization
Geo-visualization is accelerated by:
• The possibility to generate maps at any stage in Geo-

information processing.
• Hard- and software developments
• New output media
• Changing needs / expectations of users.
• Availability of abundant data, from different sources.
344
172
1/30/2018
Cartographic Visualization
The cartographic communication

process, based on “How do I say
what to whom, and is it effective?”
Information loss or gain:
Information derived by the map user
is not the same as the information
that the cartographic communication
process started with.
345
The Cartographic ‘Toolbox’
Analysis of the characteristics of data
What is the common ‘denominator’? Used

for the title of the map (theme, area, year)
What is the nature of the data or What are
the measurement scales ?
The common denominator refers to a common label for all the
attributes/attribute values that are mapped (here: geological units).
346
173
1/30/2018
Measurement scales are linked to the way in which

people perceive visual variables (see further).
347
Nominal data exists as discrete, named values without a

natural order amongst the values.
Examples are the different soil types (e.g. sand, clay,
peat) or the different land use categories (e.g. arable
land, pasture).
Ordinal data are measured along a relative scale, based
on hierarchies. For instance, one knows that one value is
’more’ than another value, such as ’warm’ versus ’cool’.
Another example is a hierarchy of road types: highway’,
’main road’, ’secondary road’ and ’track’.
348
174
1/30/2018
Quantitative data can be measured, either along an

interval or ratio scale.
For data measured on an interval scale, the exact
distance between values is known, but there exists no
absolute zero on the scale. Temperature is an example:
40oC is not twice as warm as 20oC, and 0oC is not an
absolute zero.
Quantitative data with a ratio scale have a known
absolute zero.
An example is income: someone earning 1000shs earns
twice as much as someone with an income of 500shs.
349
Basic elements of a map:

• Point symbols
• Line symbols
• Area symbols
• Text
These elements can all be varied in
appearance
350
175
1/30/2018
Bertin’s Visual Variables
Bertin’s visual variables: an elementary

way in which, point, line and area
symbols can be graphically varied.
• Size
• Color
• Value (lightness)
• Grain/ texture
• Orientation
• Form/shape
351
Bertin’s Visual Variables

The visual variables enable observers to perceive:
• What belongs together, or is of equal importance (e.g. all red
symbols represent danger).
• Order
(e.g. the population density varies from low to high, represented
by light and dark color tints, respectively).
• Quantities
(e.g. symbols changing in size with small symbols for small
amounts).
• An instant overview of the whole representation.
352
176
1/30/2018
Measurement Scales and Visual Variables
353
How to Map Qualitative Data
What is the common denominator of the

data? Watersheds
What is the nature of the data? qualitative
(nominal)
Solution: Colors of equal visual weight or
brightness which allow the user to quickly
differentiate between watersheds.
354
177
1/30/2018
How not to Map Qualitative Data
355
How to Map Absolute Quantitative Data
What is the common denominator of

the data? number of inhabitants.
What is the nature of the data?

absolute quantitative (ratio)
Solution: symbols varying in size
356
178
1/30/2018
How not to Map Absolute Quantitative Data
357
How to Map Relative Quantitative Data
What is the common denominator of the

data? Number of inhabitants/ km2.
What is the nature of the data? relative
quantitative (ratio).
Solution: Value has been used to display
the density from low (light tints) to high
(dark tints).
358
179
1/30/2018
How not to Map Relative Quantitative Data
359
Test Yourself
360
180
1/30/2018
Map Dissemination (Output)

Map design is influenced by:
• Data characteristics & user strategies (see before)
• Output medium, e.g.: paper or screen maps
Screen maps:
• Often smaller
Legend is not always visible and they enable:

• Access to a data base
• Links to other data
• Embedding in multi media
• Dynamics and interaction 361
How to Map the Terrain elevation
362
181
1/30/2018
How to Map the Thematic Data in 3D
Statistical (socio-economic) data can

also be represented as an elevated
surface.
Here, the municipalities in the
province of Overijssel are elevated
proportionally to their number of
inhabitants.
The resulting map is called a 'prism
map' 363
How to Map Time Series
Single static map: specific graphic

variables and symbols are used to indicate
change or represent an event
Series of Static Maps: A single map in the
series represents a ‘snapshot’ in time.
Together, the maps depict a process of
change.
Animated map: Change is perceived to
happen in a single image by displaying
several snapshots after each other, just like
a video. 364
182
1/30/2018
Map Cosmetics: The Finishing Touch
1. Additional information
marginal info (or metadata): makes
the map more usable.
2. Adding text improves the
identification of features.
3. Contrast improves overall map
legibility.
365
Web Maps
The Web as output medium increases the functions of maps:
• next to: insight in spatial data (traditional role)

• interface to additional information / services
• previews of data that can be acquired
Classification of maps on the Web
366
183
1/30/2018
Summary
Role of map in a GIS environment: more than just output

Basics: how to translate characteristics of spatial data into symbols on a
map, with particular attention to:
• analysis of the measurement scale of the data,
• measurement scales can be linked to perception properties of
visual variables
• selection of those variables that best translate the nature of
the data.
After translation of the data, the map has to be finalized to make it usable
(taking care of visual contrast, adding text, marginal information)
Finally the map is disseminated to users, e.g. via the Web.
367
References
Arbia, G., Griffith, D., and Haining, R. Error propagation modeling in raster gis: overlay operations. International
Journal of Geographical Information Science 12, 2 (1998), 145–167.
Aronoff, S. Geographic Information Systems: A Management Perspective. WDL Publications, Ottawa, Canada,
1989.
Bertin, J. S´emiology Graphique. Mouton, Den Haag, The Netherlands, 1967. BIJKER, W. Radar for rain forest—A
monitoring system for land cover change in the Colombian Amazon. PhD thesis, International Institute for
Aerospace Survey and Earth Sciences, Enschede, The Netherlands, 1997.
Board, C. Report of the working group on cartographic definitions. Cartographic Journal 29, 1 (1990), 65–69.
Bonham-Carter, G. F. Geographic information systems for geoscientists : Modeling with GIS, vol. 13 of Computer
methods in the geosciences. Pergamon, Kidlington, U.K., 1994.
Burrough, P. A. Natural objects with indeterminate boundaries. In Geographic objects with indeterminate
END OF COURSE
boundaries, P. A. Burrough and A. U. Frank, Eds. Taylor and Francis, London, U.K., 1996, pp. 3–28.
Chrisman, N. R. Errors in categorical maps: testing versus simulation. In Proceedings AutoCarto (1989), pp. 521–
529.
Good Luck
Clarke, D. G., and Clark, M. Lineage. In Elements of Spatial Data Quality, S. C. Guptill and J. L. Morrison, Eds.
Elsevier Science, Oxford, U.K., 1995, pp. 13–30.
Date, C. J. An Introduction to Database Systems, seventh ed. Addison- Wesley, Reading, Ma, 2000.
Dibiase, D. Visualization in earth sciences. Earth and Mineral Sciences, Bulletin of the College of Earth and Mineral
Sciences 59, 2 (1990), 13–18.
ElmasrI, R., AND Navathe, S. B. Fundamentals of Database Systems, second ed. Benjamin/Cummings, Redwood
City, Ca, 1994.
368
184
1/30/2018
Heuvelink, G. B. M. Error propagation in quantitative spatial modeling- Applications in Geographical Information

Systems. Nederlandse Geografische Studies. Koninklijk Aardrijkskundig Genootschap, Utrecht, 1993.
Iliffe, J. Datums and Map Projections for Remote Sensing, GIS and Surveying. Whittles Publishing, CRC Press,
2000.
Intergovernmental Committee on Surveying and Mapping (ICSM). Get in step with the geocentric datum. Tech. rep.,
Office of the Surveyor General, Land Information New Zealand (LINZ), 1999.
Janssen, L. L. F. Principles of Remote Sensing, vol. 2 of ITC Educational Textbook Series. International Institute for
Aerospace Survey and Earth Sciences, Enschede, The Netherlands, 2000.
Kerle, N., Janssen, L. L. F., AND Huurneman, G. C. Principles of Remote Sensing, third ed., vol. 2 of ITC
Educational Textbook Series. International Institute for Geo-Information Science and Earth Observation, Enschede,
The Netherlands, 2004.
KiiverI, H. T. Assessing, representing and transmitting positional accuracy in maps. International Journal of
Geographical Information Systems 11, 1 (1997), 33–52.
Knuth, D. E. The TEXbook. Addison-Wesley, Reading, Ma, 1984.
Kraak, M.-J., and Brown, A., Eds. Web cartography, developments and prospects. Taylor & Francis, London, U.K.,
2000.
Kraak, M.-J., and Ormeling, F. J. Cartography: Visualization of Spatial Data. Addison-Wesley Longman, London,
U.K., 1996.
Land Information New Zealand (LINZ). Strategic business plan. Tech. rep., Land Information New Zealand (LINZ),
October 1999.
LaurinI, R., and Thompson, D. Fundamentals of Spatial Information Systems, vol. 37 of The APIC Series. Academic
Press, London, U.K., 1992.
Leunen, M.-C. V. A Handbook for Scholars, revised ed. Oxford University Press, New York, NY, 1992.
369
Longley, P. A., Goodchild, M. F., Maguire, D. M., and Rhind, D. W., Eds. Geographical Information Systems:
Principles, Techniques, Management, and Applications, second ed., vol. 1. John Wiley & Sons, New York, N.Y.,
1999.
Mccormick, B., Defanti, T. A., and (eds.), M. D. B. Visualization in scientific computing. ACM SIGGRAPH Computer
Graphics—Special issue 21, 6 (1987).
National mapping Division, U. S. Geological Survey. Spatial data transfer standard. Tech. rep., U. S. Department of
the Interior, 1990.
Office of the Surveyor General of Land Information New Zealand. A proposal for geodetic datum development. Tech.
Rep. OSG TR2.1, Land Information New Zealand (LINZ), 1998.
Openshaw, S., Charlton, M., and Carver, S. Error propagation: a Monte Carlo simulation. In Handling geographical
information: methodology and potential applications, I. Masser and M. Blakemore, Eds. Longman, Harlow, U.K.,
1991, pp. 78–101.
Preparata, F. P., and Shamos, M. I. Computational Geometry—An Introduction. Springer-Verlag, New York, NY,
1985.
Robinson, A. H., Morrison, J. L., MuehrckE, P. C., Kimerling, A. J., and Guptill, S. C. Elements of Cartography, sixth
ed. John Wiley & Sons, New York, N.Y., 1995.
Samet, H. The Design and Analysis of Spatial Data Structures. Addison- Wesley, Reading, Ma, 1990.
Strunk Jr., W., and White, E. B. The Elements of Style, third ed. MacMillan Publishing Company, New York, NY,
1979.
Veregin, H. Developing and testing of an error propagation model for GIS overlay operations. International Journal
of Geographical Information Systems 9, 6 (1995), 595–619.
Zadeh, L. A. Fuzzy sets. Information and Control 8 (1965), 338–353.
370
185

Lecture Notes GIS

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Lecture Notes GIS

Uploaded by

Copyright:

Available Formats

1/30/2018

Geographic Information System

We may distinguish three important stages while working with geographic

Spatial data and Geo-information

By DATA, we mean representations that can be operated upon by a computer.

Can apply the topological properties of interior and boundary to define

Spatial Data Modeling

Different types of geographic phenomena

Different types of geographic phenomena

Continuous Field Discrete Field

Fuzzy boundaries (dynamic) Fuzzy boundaries (natural)

Computer representations of geographic information

Examples of Regular Tessellations

Regular Tessellation Cont..

Coordinate pair from some geographic space also known

Note: rasters do not explicitly store georeferences of the phenomena they

Triangulated Irregular Network (TIN)

Does TIN belong to vector?

YES, each anchor point has a stored georeference.

• Line is defined by 2 end nodes and 0-n internal nodes.

Topology and Spatial Relationships

Topology deals with spatial properties that do not change under a

E (polygon, area) in figure above is still inside D (another polygon or area)

Topology and Spatial Relationships

Topology and Spatial Relationships

There are 7 other spatial relationships that can be defined in a

Rules of topological consistency

Scale and Resolution

1:50,000 →1 cm on map = 50,000 cm in reality

Large-scale →much detail of a small area

Scale and Resolution

Representation of geographic fields

Representation of geographic objects

Spatial Data Modeling

Raster and vector compared

Data - Processing - Systems?

Data Processing Systems

1. appropriate hardware components for processing, storage and

Hardware and Software trends

Computers CPU: 3 GHz

CPU: 1500 times faster

GIS Functional Components

Data Preparation Operators

Raster data storage

Values of neighboring cells are close together.

Analysis of Spatial Data

Database Management Systems (DBMS)

Reasons for using a DBMS

The Relational Model

The relational data model

The relational data model Foreign Keys

Vector Data Storage

Geometric Features in GIS

Raster • geometric attributes

Paper maps versus GIS

GIS and DBMS support

Components of Spatial Data Quality

Inconsistent data (I)

The overlapping parcel boundaries is the result of digitizing from

Inconsistent data (II)

Corrupted registration system !

Refers to the history of the data set

Attribute accuracy in Remote Sensing: Example of an error matrix