Professional Documents
Culture Documents
Systems
(CS 497)
Seyum A. (PhD)
Objectives and Outcomes
Objectives:
Outcomes :
2
About The Course
● Course Materials
○ Principles of Geographic Information Systems. ITC Educational TextBook Series, International
Institute for Geo-Information Science and Earth Observation (2004) (Third Edition)
○ Principles of Remote Sensing. ITC Educational TextBook Series, International Institute for
Geo-Information Science and Earth Observation (2004) (Third Edition)
○ Geographic Information Systems: A Management Perspective. Stan Aronoff (1989), WDL
Publications, Ottawa, Canada.
○ Concepts and Techniques of Geographic Information Systems. C. P. Lo et al
(2002),Prentice-Hall, Inc., New Jersey, U. S. A.
○ ArcGIS Tutorial Books.
3
Course Evaluation
● Mid exam – 20%
● Final Exam – 40%
● Projects and Assignments – 40%
4
Geographic Information System
5
6
What is GIS ?
Geographic
● Implies that locations of data items are known, or can be calculated, in terms
of geographic coordinates (latitude, Longitude )
Information
● Implies that the data in GIS are organized to used useful knowledge, often as
colored maps and images, but also statistical graphics, tables and various on
screen responses to interactive queries
System
● Implies that a GIS is made up from several inter-related and linked
components with different functions. Thus, GIS have functional capabilities
for data capture, input manipulation, transformation, visualization etc
7
What is GIS?
Geographical Information System (GIS)
● Links databases and maps
● Manages information about places
● Helps answer questions such as:
○ Where is it?
○ What else is nearby?
○ Where is the highest concentration of ‘X’?
○ Where can i find things with characteristics ‘Y’ ?
○ Where is the closest ‘Z’ to my location ?
8
9
10
What is GIS ?
● A technology
○ Hardware and software tools
● An information handling strategy
● The objective: to improve the overall decision making
strategy
11
GIS: a formal Definition
12
In Short, a GIS is
13
Why GIS Unique?
14
15
Pandemic/Public Health
16
GIS concepts
17
GIS: Historical Background
18
GIS Process
19
GIS: Summary description
20
How GIS differs from Related Systems
•DBMS--typical MIS database contains implicit but not explicit locational information
–city, county, zip code, etc. but no geographical coordinates
–is 100 N. High around the corner or across town from 200 E Main?
•automated mapping (AM) --primarily two-dimensional display devices
–thematic mapping (choropleth,etc such as SAS/GRAPH, DIDS, business mapping
software) unable to relate different geographical layers (e.g zip codes and counties)
–automated cartography--graphical design oriented; limited database ability
•facility management (FM) systems--
–lack spatial analysis tools
•CAD/CAM (computer aided design/drafting)--primarily 3D graphic creation
(engineering design) & display systems
–don’t reference via geographic location
•CAD sees the world as a 3-D cube, GIS as a 3-D sphere
–limited (if any) database ability (especially for non-spatial data)
•scientific visualization systems--sophisticated multi-dimensional graphics, but:
–lack database support
–lack two-dimensional spatial analysis tools
21
Examples of Applied GIS
•Urban Planning, Management & Policy •Civil Engineering/Utility
–Locating underground facilities
–Zoning, subdivision planning
–Designing alignment for freeways, transit
–Land acquisition
–Coordination of infrastructure maintenance
–Economic development
–Code enforcement • Business
–Housing renovation programs –Demographic Analysis
–Emergency response –Market Penetration/ Share Analysis
–Crime analysis –Site Selection
–Tax assessment •Education Administration
•Environmental Sciences – Attendance Area Maintenance
–Monitoring environmental risk – Enrollment Projections
–Modeling stormwater runoff – School Bus Routing
–Management of watersheds, floodplains, wetlands, forests, •Real Estate
aquifers –Neighborhood land prices
–Environmental Impact Analysis –Traffic Impact Analysis
–Hazardous or toxic facility siting –Determination of Highest and Best Use
–Groundwater modeling and contamination tracking •Health Care
•Political Science –Epidemiology
–Redistricting – Needs Analysis
–Analysis of election results Service Inventory
–Predictive modeling
22
What GIS Applications Do:manage, analyze, communicate
• make possible the automation of activities involving geographic data
–map production
–calculation of areas, distances, route lengths
–measurement of slope, aspect, viewshed
–logistics: route planning, vehicle tracking, traffic management
•allow for the integration of data hitherto confined to independent domains
(e.g property maps and air photos).
•by tieing data to maps, permits the succinct communication of complex
spatial patterns (e.g environmental sensitivity).
•provides answers to spatial queries (how many elderly in Richardson live
further than 10 minutes at rush hour from ambulance service?)
•perform complex spatial modelling (what if scenarios for transportation
planning, disaster planning, resource management, utility design)
23
GIS System Architecture and Components
24
Knowledge Base for GIS
25
The GIS Data Model
26
The GIS Data Model: Purpose
27
28
A layer-cake of information
31
•Data is organized by layers, coverages or themes
(synonymous concepts), with each layer representing a common
feature.
• Layers are integrated using explicit location on the earth’s
surface, thus geographic location is the organizing principal.
32
The GIS Model: example
Here we have three layers or themes:
e
ud
--roads,
tit
--hydrology (water),
la
34
Spatial and Attribute Data
● Spatial data (where)
–specifies location
–stored in a shape file, geodatabase or similar geographic file
● Attribute (descriptive) data (what, how much, when)
–specifies characteristics at that location, natural or human-created
–stored in a database table
● GIS systems traditionally maintain spatial and attribute data
separately, then “join” them for display or analysis
– for example, in ArcView, the Attributes of … table is used to link a
shapefile (spatial structure) with a database table containing attribute
information in order to display the attribute data spatially on a map
35
36
Spatial Entity
● Spatial data describes the location and shape of geographic features, and their
spatial relationship to the features
● The Spatial data can be represented through using
1. The Point
2. The Line
3. The Area
4. Network
5. The Surface
37
GIS Data Modeling
● A data model describes the thematic layers used in the GIS application, their
spatial representation, their attributes, their integrity rules and relationships,
their cartographic portrial, and their metadata requirement
● A conceptual description of how spatial data are organized for use by the GIS
● Represents a set of guidelines to convert the real world to the digitally and
logically represented spatial objects consisting of the attributes and geometry
● The attributes are managed by thematic or semantic structure
● The geometry is represented by geometric-topological structure
38
Representing Data with Raster and Vector Models
Raster Model
•area is covered by grid with (usually) equal-sized, square cells
•attributes are recorded by assigning each cell a single value based on the majority
feature (attribute) in the cell, such as land use type.
•Image data is a special case of raster data in which the “attribute” is a reflectance
value from the geomagnetic spectrum
– cells in image data often called pixels (picture elements)
Vector Model
The fundamental concept of vector GIS is that all geographic features in the real work can be
represented either as:
•points or dots (nodes): trees, poles, fire plugs, airports, cities
•lines (arcs): streams, streets, sewers,
•areas (polygons): land parcels, cities, counties, forest, rock type
Because representation depends on shape, ArcView refers to files containing vector data as shapefiles
39
Vector Data Model
40
Vector Data
● Vector data provide a way to represent a real world features within a GIS
environment
● A vector feature has its shape represented using geometry
● The geometry is made up of one or more interconnected vertices
● A vertex describe a position in space using an X, Y and potentially Z axis
● In the vector data model, features on the earth are represented as:
○ Points
○ lines/ routes
○ Polygones / regions
○ TINs (Triangulated Irregular Networks)
41
● Vector data model uses sets of coordinates and associated attribute data to
define discrete objects
● Point objects in spatial database represent location of entities considered to
have no dimension
● Simplest type of spatial objects
○ E.g. wells, sampling points, poles, telephone towers, etc.
● Line objects are used to represent linear features using ordered set of
coordinate pairs
○ E.g. infrastructure networks (transport networks: highways, railroads, etc.) ; utility networks:
(gas, electric, telephone, water, etc. ); airline networks: hubs and routes, etc.); natural networks
such as river channels
42
● Polygon objects in spatial database represent entities which covers an area
○ E.g. lakes, Buildings, parcels, etc.
● Boundaries may be defined by natural phenomena (e.g. lake), or by man
made features (e.g census tracts, neighborhoods)
○ E.g. Land cover data: forest, wetlands, urban areas, etc.
○ Soil data – soil types
43
● In vector world the point is the
basic building block from which all
spatial entities are constructed
44
Point
● A point is a zero dimensional object and has only the property of location
● Points can be used to model features such as a well, building, power pole,
sample location etc.
● Other names for a point are vertex, node
45
Line
46
Polygon
● Polygon features are made of one or more lines that encloses an area
● A polygon is a two dimensional object with properties of area and perimeter
represented by a closed sequence of lines
● A polygon can represent a city, geologic formation, lake, river
47
➔ The vector data structure mainly ensure the following
points. They are
● No node or line segment is duplicated;
● Line segment and nodes can be referenced to more
than one polygon;
● All polygon have the unique identifiers
● Polygon can be adequately represented.
48
➔ In vector structure topology is concerned with connectivity between entities
and not their physical shape.
➔ Boundaries are identified through network of arcs, checking polygons for
closure, and linking arcs into polygons.
➔ The area of polygon can calculated and unique identification numbers are
attached
➔ This identifier would allow non spatial information to be linked to a specific
polygon.
49
Vector Data Advantages
50
Vector data Disadvantages
51
Raster Data Model
● The term raster implies a regularly spaced grid
● Raster data consists of rows and columns of cells (or pixels)
● In this format a single value is stored against each cell
●
52
Raster Data
● Raster Data Model defines the world as a regular set of cells in a uniform grid
pattern
● Cells are square and evenly spaced in the x and y directions
● Each cell represent attribute values and cell location of phenomena or entities
● Cell dimension specifies the length and width of the cell in surface units
● Raster data models represent continuous phenomena or spatial features
○ E.g. Elevation/DEM, bathymetry, precipitation, slope, etc.
● Rasters are digital aerial photographs, imagery from satellites, digital pictures, or
even scanned maps
53
Raster Data
● Cell based data such as aerial imagery and digital elevation models is
characterized by pixel values
● Basically a raster file is a giant table, where each pixel is assigned a specific
value from 0 to 255.
● The meaning behind these values is determined by the user
○ They can represent elevations
○ Temperature
○ Hydrology etc
54
Cell size of Raster Data
● The level of detail represented by raster is often dependent on the cell (pixel)
size or spatial resolution of the raster
● The cell must be small enough to capture the required detail but large enough
so computer storage and analysis can be performed efficiently
55
56
Three types of raster data
● Thematic raster data
● Imagery raster data
● Spectral raster data
57
58
➔ In the raster data model the basic building block is the individual grid cell, an
➔ the shape and character of an entity is created by the grouping of cells.
➔ The size of the grid cell is very important as it Raster is a method for the
storage, Processing and display of spatial data.
➔ Each area is divided into rows and columns, which form a regular grid
structure. Each cell must be rectangular in shape, but not necessarily square.
➔ Each cell within this matrix contains location coordinates as well as an
attribute value. The origin of rows and column is at the upper left corner of
the grid.
➔ Rows function as the “y”coordinate and column as”x”coordinate in a two
dimensional system. A cell is defined by its location in terms of rows and
columns.
59
Several methods exist for encoding raster data from scratch. Three of these models
are as follows:
60
Raster Data Advantages
61
Raster Data Disadvantages
● It is especially difficult adequately represent linear features depending on the
cell resolution
● Network linkages are difficult to establish
● Processing of associated attribute data maybe cumbersome if large amount of
data exists
● Raster maps inherently reflect only one attribute or characteristics of an area
● Most output maps from grid cell systems don’t confirm to high quality
cartographic needs
62
Difference between Raster and Vector
63
64
Triangulated Irregular Network (TIN)
65
66
Types of TIN methods and Processes
There are different types of TIN interpolation methods. Some of them are
● Natural Neighbour
● Kriging
● Spline
● Nearest Neighbour
● Inverse distance weighting
67
Digital Elevation Model (DEM)
● DEM is a data model which represents the surface of a terrain in 3 dimension
● Can be represented as a raster or as a TIN
● The TIN DEM dataset is also referred to as a primary DEM or measured DEM
● Raster DEM is referred to as secondary DEM or computed DEM
68
Types of DEM
69
Spatial Data Structure
70
Spatial Data Structure
71
Raster Data Structure
73
Run-Length Encoding Data Structure
● Run-Length Encoding (RLE) algorithm was developed to handle the problem
that a grid often contains redundant or missing data
● When the raster data contains more missing data, the cell-by-cell encoding
method cannot be suggested
● In RLE method, adjacent cells along a row with the same value are treated as
a group called a run
● If a whole row has only one class, it is stored as the class and the same
attributes are kept without change
● Instead of repeatedly storing the same value for each cell, the value is stored
once together with the number of the cells that makes the run
● RLE data compression method is used in many GIS packages and in standard
image formats.
74
● run-length encoding structure of a
polygon.
75
Quadtree Data Structure
● To compress the data as well as to save the space in original grid, quadtree
data structure can be used
● A quadtree works by dividing a grid into four quadrants for the available
data
● The available data quadrant is again split into four half-size quadrants and so
on until the individual pixel is reached
● The attribute data for all the pixels of the quadrant remains the same even if it
is divided.
76
77
Vector Data Structure
● Vector files were simply lines and were having only starting and ending point
● The vector file consists of a few long lines, many short lines, or even a mix of
the two
● The files are generally written in a binary or ASCII (American Standard Code for
Information Interchange) code
● two types of data structure.
○ Topological Data Structure
○ Non-Topological Data Structure
78
Topological Data Structure
● Topology is the arrangement of how point, line, and polygon features share
geometry
● Topology is used for the following:
○ Constrain how features share geometry
● Topological data structure is often referred to as an intelligent data structure
○ because spatial relationships between geographic features are easily derived when using them
● Topological vector data structure is important in undertaking complex data
analysis
● In a topological data structure, lines cannot overlap without a node
79
● The arc-node topological data structure is now used in most of the systems
● In the arc-node data structure, the arc is used for the data storage and it also
works when it is needed to reconstruct a polygon
● In file of arcs, point data is stored and linked to the arc file
● Node refers to the end points of the line segment
● The arc has information not only related to that particular arc but also to its
neighbours in geographic space
80
● Important topological vector data structures
○ Topologically Integrated Geographic Encoding and Referencing (TIGER)
○ Coverage Data Structure.
81
Topologically Integrated Geographic Encoding and Referencing (TIGER)
82
Coverage Data Structure
● A coverage is a georelational data model that stores vector data—it contains
both the spatial (location) and attribute (descriptive) data for geographic
features
● Coverages use a set of feature classes to represent geographic features
● Coverage data structure was practised by many GIS companies like ESRI, in
their software packages in 1980s to separate GIS from CAD (Computer Aided
Design)
● A coverage data structure is a topology based vector data structure that can
be a point, line or polygon coverage
● A point is a simple spatial entity which can be represented with topology
● The point coverage data structure contains feature identification numbers
(ID) and pairs of x, y coordinates
83
Point coverage data structure
84
Line coverage data structure
85
Polygon coverage data structure 86
GIS Database
87
What is a database
● A database is any organized collection of data
● Some examples common examples
○ a telephone book
○ T.V. Guide
○ airline reservation system
○ motor vehicle registration records
○ papers in your filing cabinet
○ files on your computer hard drive.
88
Database Definitions
What is a database?
It’s an organized collection of data, it need not be a computer
based system.
89
Spatial Database
● A spatial database is a general-purpose database (usually a relational
database)
○ is a collection of spatial data types, operators, indices, processing
strategies, etc.
○ that has been enhanced to include spatial data that represents objects
defined in a geometric space, along with tools for querying and
analyzing such data
○ Most spatial databases allow the representation of simple geometric
objects such as points, lines and polygons
○ Some spatial databases handle more complex structures such as 3D
objects, topological coverages, linear networks, and triangulated
irregular networks (TINs)
90
What is SDBMS?
● A SDBMS is a software module that
○ can work with an underlying DBMS
○ supports spatial data models, spatial abstract data types (ADTs) and a
query language from which these ADTs are callable
○ supports spatial indexing, efficient algorithms for processing spatial
operations, and domain specific rules for query optimization
91
SDBMS Example
93
● SDBMS focusses on
○ Efficient storage, querying, sharing of large spatial datasets
○ Provides simpler set based query operations
○ Example operations: search by region, overlay, nearest neighbor,
distance, adjacency, perimeter etc.
○ Uses spatial indices and query optimization to speedup queries over
large spatial datasets.
● SDBMS may be used by applications other than GIS
○ Astronomy, Genomics, Multimedia information systems, ...
94
Components of SDBMS
● Components include
○ spatial data model,
○ query language,
○ query processing,
○ file organization and indices,
○ query optimization, etc.
95
Architecture of SDBMS
96
Spatial Analysis
97
What is spatial Analysis
● Spatial analysis is any of the formal techniques which studies entities using
their topological, geometric, or geographic properties
● Spatial analysis includes a variety of techniques using different analytic
approaches, especially spatial statistics
● Through spatial analysis you can interact with a GIS to answer questions,
support decisions, and reveal patterns
● it includes all of the transformations, manipulations, and methods that can be
applied to geographic data to turn them into useful information
98
Types of Spatial Analysis
99
● Queries and reasoning are the most basic of analysis operations, in which the
GIS is used to answer simple questions posed by the user. No changes occur
in the database and no new data are produced
● Transformations are simple methods of spatial analysis that change data sets
by combining them or comparing them to obtain new data sets and
eventually new insights. Transformations use simple geometric, arithmetic, or
logical rules, and they include operations that convert raster data to vector
data or vice versa. They may also create fields from collections of objects or
detect collections of objects in fields.
100
● Descriptive summaries attempt to capture the essence of a data set in one or two
numbers. They are the spatial equivalent of the descriptive statistics commonly
used in statistical analysis, including the mean and standard deviation.
101
Spatial Data Acquisition
102
Primary sources
● SURVEYING,
● PHOTOGRAMMETRY,
● GNSS AND REMOTE SENSING
103
Secondary Methods
● Digitization
● Automatic Line Following
● Scanning
104
Surveying
105
Photogrammetry
● It is a technique of mapping features using aerial
photographs
● Photos taken by set of cameras on board an airplane
● Instruments that can be used to help in the interpretation
are
○ Stereoscope
○ Stereo plotters
106
Remote Sensing
● Mapping features using satellite imagery
● Images taken by a set of sensors on board a satellite
● Features are mapped from satellite imagery using
○ Visual interpretation or
○ Automatic classification
107
Digitizing and Scanning
● Digitization is the process of converting existing maps to
digital form (Vector format) - called Vectorization
● Scanning is a process of converting existing maps to
digital form (Raster format) - called Rasterization
108