Spatial Sampling and Interpolation: How Are Raster Surfaces Made?

Lecture 11
------Using GIS--
Introduction to GIS
Spatial Sampling and Interpolation
Raghunath Jha
Raghunath Jha
Introduction to GIS
How are raster surfaces Made?

•Raster surfaces are generally either made:
• From remote sensing (covered later) which
collects reflectance values at every pixel within the
geographic extent and can be classified later on or
•From sample points whose Z values are
Interpolated across space to fill in all the blank
areas.
©2005 Austin Troy
1
Lecture 11
Introduction to GIS
What is interpolation?
•Process of creating a surface based on values at
isolated sample points.
•Sample points are locations where we collect data on
some phenomenon and record the spatial coordinates
•We use mathematical estimation to “guess at” what
the values are “in between” those points
•We can create either a raster or vector interpolated
surface
•Interpolation is used because field data are expensive
to collect, and can’t be collected everywhere
©2005 Austin Troy
Introduction to GIS
How does it Look

•Let say we have our ground water pollution samples
This
gives
us
©2005 Austin Troy
2
Lecture 11
Introduction to GIS
How does it work

•This can be displayed as a 3D trend surface in 3D analyst
©2005 Austin Troy
Introduction to GIS
How does it work

•We can also use interpolation methods to create contours
©2005 Austin Troy
3
Lecture 11
Introduction to GIS
Sample points
• Also known as “control points.”
•These are points where you or someone else has
collected data (attributes) for a spatial coordinate (point)
•Any number of attributes can be collected at that point
•E.g.1 weather stations collect data on temperature,
rainfall, wind, humidity, etc.
•E.g. 2 soil invertebrate samples would record
abundance of numerous species at each location
©2005 Austin Troy
Introduction to GIS
What isn’t interpolation?

•Interpolation only works where values are spatially
dependant, or spatially autocorrelated, that is, where
nearby location tend to have similar Z values.
•Examples of spatially autocorrelated features:
elevation, property value, crime levels, precipitation
•Non-autocorrelated examples: number of drum sets per
city block; cheeseburgers consumed per household.
•Where values across a landscape are geographically
independent, interpolation does not work because value
of (x,y) cannot be used to predict value of (x+1, y+1).
©2005 Austin Troy
4
Lecture 11
Introduction to GIS
Interpolation examples
•Elevation:
Source: LUBOS
MITAS AND
HELENA
MITASOVA,
University of Illinois
©2005 Austin Troy
Introduction to GIS
•Elevation:
•Elevation values tend to be highly spatially
autocorrelated because elevation at location (x,y) is
generally a function of the surrounding locations
•Except is areas where terrain is very abrupt and
precipitous, such as Patagonia, or Yosemite
•In this case, elevation would not be
autocorrelated at local (large) scale, but still may
be autocorrelated at regional (small scale)
©2005 Austin Troy
5
Lecture 11
Introduction to GIS
•Imagine this elevation cross section: If each dashed line
represented a sample point (in 1-D), this spacing would
miss major local sources of variation, like the gorge
©2005 Austin Troy
Introduction to GIS
•Our interpolated surface (represented in 1-D by the blue
line) would look like this
©2005 Austin Troy
6
Lecture 11
Introduction to GIS
•If we increased the sampling rate, we would pick up that
local variation
©2005 Austin Troy
Introduction to GIS
•Here our interpolated surface is much closer to reality at
the local level, but we pay for this in the form of higher
data gathering cost
©2005 Austin Troy
7
Lecture 11
Introduction to GIS
•Weather
•Weather tends to be modeled on a regional level (e.g.
your local weather report) because, in most places,
weather systems and trends happen over a very large
area. Hence the need for sample point density is not
so great
•In other places, local climate variability is very great,
such as in the SF Bay Area where temperatures can
vary 50 degrees within 10 miles due to ocean effects.
©2005 Austin Troy
Introduction to GIS
•Weather
Example: precipitation varying over a
•Weather is also season
extremely variable
over time, so
samples must be
continually taken.
This is why weather
stations are usually
permanent
Source: LUBOS MITAS AND

HELENA MITASOVA, University
of Illinois
©2005 Austin Troy
8
Lecture 11
Introduction to GIS
•Groundwater contamination:
•The needed density of points will depend on the
geology and the type of terrain
•Areas where geology allows for free groundwater
flows across large areas will have less local variation
and need less dense points, while areas with geologic
features that inhibit or redirect flow (e.g. karst
topography) will need denser points
©2005 Austin Troy
Introduction to GIS
Where interpolation
does not work
•Cannot use interpolation where values are not spatially
autocorrelated
•Say looking at household income—in an income-
segregated city, you could take a small sample of
households for income and probably interpolate
•However, in a highly income-integrated city, where a
given block has rich and poor, this would not work
©2005 Austin Troy
9
Lecture 11
Introduction to GIS
Sampling
•As you can see, the density and spacing of samples
depends on many things
•A key component of any study with spatially
referenced field data is the sampling strategy
•If the values in your interpolation surface (layer A)
depend on some factor in layer B, then we can design
our sample of A based on layer B
•We can do this by conducting a stratified random
sample
©2005 Austin Troy
Introduction to GIS
Sampling
•Example: let’s say want to make an average precipitation layer
and we find that in our study zone precipitation is highly spatially
variable within 10 miles of the ocean
•We’d a coastline layer to help us sample.
•We’d have high density of sampling points within 10 miles of
the ocean a much lower density in the inland zones
©2005 Austin Troy
10
Lecture 11
Introduction to GIS
Sampling
•Say we were looking at an inland area, far from any ocean, and
we decided that precipitation varied with elevation. How would
we set up our sampling design?
•In this case, flat areas would need fewer sample points, while
areas of rough topography would need more
•In our sampling design we would set up zones, or strata,
corresponding to different elevation zones and we would make
sure that we get a certain minimum number of samples within
each of those zones
•This ensures we get a representative sample across, in this case,
elevation;
©2005 Austin Troy
Introduction to GIS
Sampling
•The number of zones we use will determine how
representative our sample is; if zones are big and broad,
we do not ensure that all elevation ranges are represented
©2005 Austin Troy
11
Lecture 11
Introduction to GIS
Sampling
•The number of samples we want within each zone
depends on the statistical certainty with which we want
to generate our surface
•Do we want to be 95% certain that a given pixel is
classified right, or 90% or 80%?
•Our desired confidence level will determine the
number of samples we need per strata
•This is a tradeoff between cost and statistical certainty
•Think of other examples where you could stratify….
©2005 Austin Troy
Introduction to GIS
Sampling
•A common problem with sampling points for interpolation
is what is not being sampled?
•Very frequently people leave out sample points that are
hard to get to or hard to collect data at
•This creates sampling biases and regions whose
interpolated values are essentially meaningless
•So spacing of sample points from interpolation should be
based on some meaningful factor—if they are dense in a
region in sparse in a region, it should be because the values
are variable in the first area and homogeneous in the other
©2005 Austin Troy
12
Lecture 11
Introduction to GIS
Sampling and Scale dependency

•Sampling strategy for interpolation depends on the
scale at which you are working and the scale
dependency of the phenomenon you are studying
•In many cases interpolation will work to pick up
regional trends but lose the local variation in the process
•The density of sample points must be chosen to reflect
the scale of the phenomenon you are measuring.
©2005 Austin Troy
Introduction to GIS
Scale dependency
•If you have a high density of sample points, you
will capture local variation, which is appropriate for
large-scale (small-area) studies
•If you have low density of sample points, you will
lose sensitivity of local variation and capture only
the regional variation; this is more appropriate for
small-scale (large-area) studies
©2005 Austin Troy
13
Lecture 11
Introduction to GIS
How does interpolation work

•In ArcGIS, to interpolate:
•Create or add a point shapefile with some attribute
that will be used as a Z value
•Click Spatial Analyst>>Interpolate to Raster and
then choose the method
©2005 Austin Troy
Introduction to GIS
Inverse Distance Weighting

•IDW weights the value of each point by its distance to
the cell being analyzed and averages the values.
•IDW assumes that unknown value is influenced more by
nearby than far away points, but we can control how rapid
that decay is. “Influence diminishes with distance.”
•“To predict a value for any unmeasured location, IDW will use the
measured values surrounding the prediction location. Those
measured values closest to the prediction location will have more
influence on the predicted value than those farther away. It weights
the points closer to the prediction location greater than those farther
away, hence the name inverse distance weighted.” From ArcGIS help
©2005 Austin Troy
14
Lecture 11
Introduction to GIS
Spline Method
•Another option for interpolation method
•This fits a curve through the sample data assign values to
other locations based on their location on the curve
•Thin plate splines create a surface that passes through sample
points with the least possible change in slope at all points,
that is with a minimum curvature surface
•SPLINE has two types: regularized and tension
•Tension results in a rougher surface that more closely
adheres to abrupt changes in sample points
•Regularized results in a smoother surface that smoothes out
abruptly changing values somewhat
©2005 Austin Troy
Introduction to GIS
Kriging Method
•Semivariograms measure the strength of statistical correlation as a
function of distance; they quantify spatial autocorrelation
•Because Kriging is based on the semivariogram, it is probabilistic,
while IDW and Spline are deterministic
•Kriging associates some probability with each prediction, hence it
provides not just a surface, but some measure of the accuracy of
that surface
•Kriging equations are determined by fitting line through points so
as to minimize weighted sum of squares between points and line
•These equations are weighted based on spatial autocorrelation,
which is determined from the semivariograms
©2005 Austin Troy
15
Lecture 11
Introduction to GIS
Example
•Here are some sample elevation points from which surfaces were
derived using the three methods
©2005 Austin Troy
Introduction to GIS
Example: Spline
•Note how smooth the curves of the terrain are; this is because
Spline is fitting a simply polynomial equation through the points
©2005 Austin Troy
16
Lecture 11
Introduction to GIS
Example: IDW
•Done with P =2. Notice how it is not as smooth as Spline. This is
because of the weighting function introduced through P
©2005 Austin Troy
Introduction to GIS
Example: Kriging
•This one is kind of in between—because it fits an equation
through point, but weights it based on probabilities
©2005 Austin Troy
17
Lecture 11
Introduction to GIS
Other methods of interpolation

•Thiessen polygons
•This method builds
polygons, rather than
a raster surface, from
control points
•“grows” polygons
around sample points
that are supposed to
represent areas of Source: Jens-Ulrich Nomme http://www.tu-harburg.de/sb3/pssd/GIS-
Methods/thiessen.html
homogeneity
©2005 Austin Troy
Introduction to GIS
Density Functions
•We can also use sample points to map out density raster surfaces.
This need to require a z value in each, it can simply be based on the
abundance and distribution of points.
©2005 Austin Troy
18
Lecture 11
Introduction to GIS
Density Functions
•These settings would give us a raster density surface, based just on
the abundance of points within a “kernel” or data frame. In this
case, a z value for each point is not necessary.
©2005 Austin Troy
19

Spatial Sampling and Interpolation: How Are Raster Surfaces Made?

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Spatial Sampling and Interpolation: How Are Raster Surfaces Made?

Uploaded by

Copyright:

Available Formats

Lecture 11

Spatial Sampling and Interpolation

How are raster surfaces Made?

©2005 Austin Troy

How does it Look

©2005 Austin Troy

How does it work

©2005 Austin Troy

How does it work

©2005 Austin Troy

©2005 Austin Troy

What isn’t interpolation?

©2005 Austin Troy

©2005 Austin Troy

©2005 Austin Troy

©2005 Austin Troy

©2005 Austin Troy

©2005 Austin Troy

©2005 Austin Troy

Source: LUBOS MITAS AND

©2005 Austin Troy

©2005 Austin Troy

©2005 Austin Troy

©2005 Austin Troy

©2005 Austin Troy

©2005 Austin Troy

Sampling and Scale dependency

©2005 Austin Troy

©2005 Austin Troy

How does interpolation work

©2005 Austin Troy

Inverse Distance Weighting

©2005 Austin Troy

©2005 Austin Troy

©2005 Austin Troy

©2005 Austin Troy

Other methods of interpolation

©2005 Austin Troy

©2005 Austin Troy

You might also like